Job Responsibilities and Duties:
- Perform endtoend quality assurance of data feeds and data sets.
- Provide support for data triage and assessment.
- Identify and document areas for improvement in workflows or systems.
- Attend regular standup meetings.
- Provide input to code reviews.
- Crosstrain on existing collection tools.
- Support monitoring alerting and reporting out (e.g. dashboards).
- Support new use cases.
- Research and document options for collecting or aggregating data from a variety of webbased and internal Sponsor platforms.
- Evaluate webbased platforms ability to detect or deny access.
- Make recommendations on approaches to acquire information.
- Use appropriate tools and computer programming languages such as Python scripts to collect and process data from a variety of sources.
- Use Sponsornetwork APIs to programmatically access data.
- Create maintain and enhance systems in support of data exploitation.
- Create or improve custom collection scripts written in Python.
- Create or improve scripts leveraging APIs for collection needs.
- Automate data cleanup and conditioning of collected data.
- Automate data management and dissemination steps.
Experience:
- Demonstrated experience programming in Python.
- Demonstrated experience analyzing questions formulating requirements determining suitable analytic approaches evaluating results and communicating findings to partners and stakeholders.
- Demonstrated experience working with data in a variety of structured and unstructured formats.
- Demonstrated experience with a variety of database tools such as SQL and Presto and data lakes/S3 data.
- Demonstrated experience with data visualization tools such as notebookbased visualization libraries especially Elasticsearch Kibana and Tableau.
- Demonstrated ability to translate complex technical findings into an easily understood narrative in graphical verbal or written form.
- Demonstrated experience with AI/ML such as natural language processing in a production environment.
Desired Skills:
- Demonstrated experience programming in common compiled or interpreted languages such as Python and R.
- Demonstrated experience with data management tools such as Hadoop MapReduce or similar.
- Demonstrated experience with technical operations.
- Demonstrated experience with technical targeting.
- Demonstrated experience conducting data science using the Apache Zeppelin and Jupyter notebooks platforms and Spark/PySpark.
- Demonstrated experience collaborating with colleagues to develop customertailored products.
Clearance:
- Must be a U.S. Citizen
- Active DOD Top Secret SCI with Poly