Senior AI/ML Data Scientist
Labor Category: Data Engineer (Expert)
Vendor Note: Selected candidate must reside within two (2) hours of SSA Headquarters in Woodlawn MD. Selected candidate must be willing to work onsite at least 2 days a week.
Key Required Skills:
Strong knowledge of AI/ML/LLM Python NLP Generative AI and experience with clinical Domain.
Position Description:
Staying updated on the new methods in NLP ML and Generative AI.
Understanding real world challenges and developing automated data solutions
Developing testing and deploying new techniques for NLP understanding
Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
Training and optimizing NLP/LLM models and creating Python based pipeline.
Determine the nature of analytic problems evaluate options and offer recommendations for resolution.
Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem.
Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems.
Provide accurate timely complex and sophisticated data analysis.
Skills Requirements:
FOUNDATION FOR SUCCESS (Basic Qualifications)
Bachelors degree in Statistics Applied Mathematics Computer Science or Information Science with industry experience on NLP data science AI/ML/LLM engineering.
Minimum 8 Year (s) of Data Scientist experience
Must be able to obtain and maintain a Public Trust. Contract requirement.
FACTORS TO HELP YOU SHINE (Required Skills)
These skills will help you succeed in this position:
Experience with Natural Language Processing (NLP) Generative AI and Large Language Models (LLM)
Fluency in Python Programming version control and collaboration with GIT standard python packages (ex. Pandas numpy matplotlib) and ML frameworks
Knowledge of TensorFlow PyTorch scikitlearn NLTK Azure ML (optional) Amazon Web Services EC2.
Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow and/or experience with semantic search.
Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build train test and evaluate a variety of supervised and unsupervised analytic models.
Experience with ML model deployment and operations like Devops MLOps LLMOps.
Experience with NLP and Generative AI libraries like regular expressions (like spacy langchain) text annotation tools and semantic frameworks.
Experience with statistical and machine learning software such as pandas and scikitlearn.
Prior experience working on applications that relates to clinical domain.
Ability to clean and process large amounts of realworld data.
Experience retrieving and manipulating data from a variety of data sources included Db2 Oracle SQL Server Hadoop and flat files.
Experience with database management systems e.g. MySQL SQLite SQL etc.
Either experience with or the ability and willingness to learn distributed processing via the Hadoop ecosystem i.e. Spark Impala and Hive.
Excellent analytical skills to identify potential risks and propose effective solutions.
Clear communication skills to convey complex technical concepts to various partners.
Ability to collaborate with crossfunctional teams.
Providing problem solving skills proven communication in written and verbal formats to various audiences to include executive leadership.
HOW TO STAND OUT FROM THE CROWD (Desired Skills)
Showcase your knowledge of modern development through the following experience or skills:
Prior experience with federal or state governments IT projects.
Prior experience working on applications that relates to clinical domain.
Experience working in an analytical research environment.
Experience with statistical and machine learning software such as pandas and scikitlearn.
Experience in parallel processing such as GPU programming with CUDA
Mathematica
Experience using markup languages such as LaTeX HTML etc.
Natural Language Processing for anomaly detection
Education:
Bachelors degree with 12 years of experience
Must be able to obtain and maintain a Public Trust. Contract requirement.
ABOUT THE COMPANY
Headquartered in Leesburg Virginia Zenius Corporation is a HUBZonecertified small business. Zenius specializes in providing Grants Management IT Modernization Acquisition Management and Financial Management services to Federal agencies. Zenius is selected by Inc 5000 as one of the fastestgrowing companies in the US 2024 and in the DC Metro Area award for two years in a row 2021 and 2020. Zenius is also listed by Financial Times as one of the fastestgrowing companies in the Americas in 2021. Zenius is an awardee of 2019 Best of Leesburg winner (Business Management Consultant category).
BENEFITS
Zenius Corporation is a very employeeoriented company. Join us now and help us grow!
We offer a competitive benefits package that includes paid holidays and paid time off medical insurance including health vision dental insurance 401K matching Flexible Spending Account and flexible schedules as per business needs. We also work with our employees on training and professional certification plans that benefit the employee.
EQUAL OPPORTUNITY EMPLOYER:
Zenius Corporation provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race color religion gender sexual orientation gender identity or expression national origin age disability genetic information marital status amnesty or status as a covered veteran in accordance with applicable federal state and local laws. Zenius complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities. This policy applies to all terms and conditions of employment including but not limited to hiring placement promotion termination layoff recall transfer leaves of absence compensation and training.
Zenius Corporation expressly prohibits any form of unlawful employee harassment based on race color religion gender sexual orientation gender identity or expression national origin age genetic information disability or veteran status.