*** Selected candidate must reside within two (2) hours of Clients Headquarters in Woodlawn MD
*** Selected candidate must be willing to work onsite at least 2 days a week.
Position Description:
- Staying updated on the new methods in NLP ML and Generative AI
- Understanding real world challenges and developing automated data solutions
- Developing testing and deploying new techniques for NLP understanding
- Scalable development/deployment of ML and Generative AI approaches (such as Large Language Models (LLMs)
- Training and optimizing NLP/LLM models and creating Python based pipeline.
- Determine the nature of analytic problems evaluate options and offer recommendations for resolution.
- Advise on the methods and data needed and/or available to evaluate the (intelligence or data) problem.
- Collaborate with data collectors and analysts to identify and close gaps on complex monitoring problems.
- Provide accurate timely complex and sophisticated data analysis.
Key Required Skills:
- Strong knowledge of AI/ML/LLM Python NLP Generative AI and experience with clinical Domain.
Requirements
Basic Qualifications
- Bachelors degree with 12 years of experience.
- Bachelors degree in Statistics Applied Mathematics Computer Science or Information Science with industry experience on NLP data science AI/ML/LLM engineering.
- Minimum 8 Year (s) of Data Scientist experience
- Must be able to obtain and maintain a Public Trust. Contract requirement.
Required Skills
- Experience with Natural Language Processing (NLP) Generative AI and Large Language Models (LLM)
- Fluency in Python Programming version control and collaboration with GIT standard python packages (ex. Pandas numpy matplotlib) and ML frameworks
- Knowledge of TensorFlow PyTorch scikitlearn NLTK Azure ML (optional) Amazon Web Services EC2.
- Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow and/or experience with semantic search.
- Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build train test and evaluate a variety of supervised and unsupervised analytic models.
- Experience with ML model deployment and operations like Devops MLOps LLMOps.
- Experience with NLP and Generative AI libraries like regular expressions (like spacy langchain) text annotation tools and semantic frameworks.
- Experience with statistical and machine learning software such as pandas and scikitlearn.
- Prior experience working on applications that relates to clinical domain.
- Ability to clean and process large amounts of realworld data.
- Experience retrieving and manipulating data from a variety of data sources included Db2 Oracle SQL Server Hadoop and flat files.
- Experience with database management systems e.g. MySQL SQLite SQL etc.
- Either experience with or the ability and willingness to learn distributed processing via the Hadoop ecosystem i.e. Spark Impala and Hive.
- Excellent analytical skills to identify potential risks and propose effective solutions.
- Clear communication skills to convey complex technical concepts to various partners.
- Ability to collaborate with crossfunctional teams.
- Providing problem solving skills proven communication in written and verbal formats to various audiences to include executive leadership.
Desired Skills
- Prior experience with federal or state governments IT projects.
- Prior experience working on applications that relates to clinical domain.
- Experience working in an analytical research environment.
- Experience with statistical and machine learning software such as pandas and scikitlearn.
- Experience in parallel processing such as GPU programming with CUDA
- Mathematica
- Experience using markup languages such as LaTeX HTML etc.
- Natural Language Processing for anomaly detection
Basic Qualifications Bachelor's degree with 12+ years of experience. Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science with industry experience on NLP, data science, AI/ML/LLM engineering. Minimum 8 Year (s) of Data Scientist experience Must be able to obtain and maintain a Public Trust. Contract requirement. Required Skills Experience with Natural Language Processing (NLP), Generative AI and Large Language Models (LLM) Fluency in Python Programming, version control and collaboration with GIT, standard python packages (ex. Pandas, numpy, matplotlib) and ML frameworks Knowledge of TensorFlow, PyTorch, scikit-learn, NLTK, Azure ML (optional), Amazon Web Services EC2. Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search. Expert knowledge in conducting data analysis and applying advanced statistical concepts and machine learning methods to build, train, test, and evaluate a variety of supervised and unsupervised analytic models. Experience with ML model deployment and operations like Devops, MLOps, LLMOps. Experience with NLP and Generative AI libraries like regular expressions (like spacy, langchain), text annotation tools and semantic frameworks. Experience with statistical and machine learning software such as pandas and scikit-learn. Prior experience working on applications that relates to clinical domain. Ability to clean and process large amounts of real-world data. Experience retrieving and manipulating data from a variety of data sources included Db2, Oracle, SQL Server, Hadoop and flat files. Experience with database management systems, e.g., MySQL, SQLite, SQL, etc. Either experience with, or the ability and willingness to learn distributed processing via the Hadoop ecosystem, i.e., Spark, Impala and Hive. Excellent analytical skills to identify potential risks and propose effective solutions. Clear communication skills to convey complex technical concepts to various partners. Ability to collaborate with cross-functional teams. Providing problem solving skills, proven communication in written and verbal formats to various audiences to include executive leadership. Desired Skills Prior experience with federal or state governments IT projects. Prior experience working on applications that relates to clinical domain. Experience working in an analytical research environment. Experience with statistical and machine learning software such as pandas and scikit-learn. Experience in parallel processing such as GPU programming with CUDA Mathematica Experience using markup languages such as LaTeX, HTML, etc. Natural Language Processing for anomaly detection. Salary: $ 175000- $180000 with Benefits Benefits: 1. Holiday Benefit 10 holidays per year 2. Vacation Benefit 10 vacation days per year, Accrued on a weekly basis. 3. Sick Leave Benefit 5 Personal/Sick Leaves per year. Accrued on a weekly basis. 4. Medical Insurance Reimbursement Benefit Medical Insurance Allowance (QSEHRA).Reimbursement (Eligibility date dependent on enrollment. Employee purchase own plan) 5. AFLAC Supplemental Insurance Plan AFLAC Supplemental Insurance Plan available. 6. 401(K) Retirement Plan 401(K) Retirement Savings Plan.