We are seeking a Senior Data Scientist with extensive expertise in large language models (LLMs) and machine learning to drive advancements in resume and skills extraction for title normalization. This role focuses on leveraging LLMs to analyze unstructured text data create machine learning pipelines and deploy models in production environments.
Key Responsibilities
- Conduct research and implement LLMbased solutions to improve resume and skills extraction processes.
- Develop and maintain machine learning pipelines using Python and SQL for data processing and modeling.
- Build deploy and optimize models for understanding unstructured text data including skills credentials and titles.
- Collaborate with crossfunctional teams to ensure insights and models align with business goals and meet production constraints.
- Stay updated with the latest research in LLMs and machine learning to integrate cuttingedge methods into solutions.
Qualifications :
Required Skills
- Strong experience with LLMs including implementing and deploying them in production environments.
- Expertise in machine learning deep learning (e.g. MLPs or computer vision) and natural language processing (NLP).
- Proficiency in Python for creating data and machine learning pipelines.
- Solid understanding of SQL and Java for data processing and model integration.
- Familiarity with code versioning tools like Git.
- Experience in model development deployment and writing models from scratch.
- Selfstarter with strong communication skills and the ability to collaborate effectively.
NicetoHave Skills
- Familiarity with working on unstructured text data extraction.
- Experience with deep learning methods for resume parsing and title normalization.
Additional Information :
This role is an exciting opportunity for an experienced Data Scientist to lead impactful projects at the intersection of LLMs machine learning and data extraction.
Remote Work :
Yes
Employment Type :
Contract