This is a remote position.
We are seeking a Senior ML Engineer (NLP Generative AI). to join our team. As a Senior ML Engineer focusing on LLM and Gen AI solutions you will collaborate directly with the CTO and Product team to design and implement sophisticated applications enabling search retrieval and conversation through stateoftheart language models. This role requires a combination of technical expertise and business insight to transition hypotheses and ML experiments into fullyfledged AI/ML products with realworld impact.
Responsibilities:
- Collaborate with crossfunctional teams to understand business requirements and design endtoend AI solutions for medical device regulatory workflows.
- Analyse and preprocess diverse datasets to extract meaningful information for AI applications including the development of custom algorithms for document layout identification and processing.
- Extract key information from raw regulatory data and develop domain specific knowledge graphs. Implement graphbased RAG (RetrievalAugmented Generation) solutions using LLMs like GPT Falcon and LLaMA.
- Finetune pretrained transformerbased models for domainspecific natural language understanding conversation and summarization.
- Apply and research techniques to optimize existing LLM training and serving focusing on improving model quality and performance.
- Conduct experiments and benchmarking to assess model performance and optimize hyperparameters. Optimize LM prompts and hyperparameters using various optimizers and evaluation metrics.
- Streamline the development and deployment of Generative AI models using cloud infrastructure and tools such as cognitive search VectorDB and frameworks like LangChain LlamaIndex Haystack DSPY Semantic Kernel Ray and AutoGen.
- Stay uptodate with advancements in Generative AI and LLMs applying new techniques and methodologies to enhance our models.
- Document research findings methodologies and model architectures. Prepare technical reports and presentations for both technical and nontechnical stakeholders.
- Collaborate with engineering and ML Ops teams to transition AI innovations into scalable and reliable operational solutions.
- Ensure compliance with legal regulatory and organizational guidelines in the use and functioning of NLP tools and LLMs.
Requirements
- A Master s or Ph.D. degree in Computer Science Artificial Intelligence or a related field. Strong background in deep learning generative models and NLP.
- 8 years experience in field of AI/ML with experience in the endtoend development of AI products/solutions.
- Experience in handling largescale text based datasets is highly desirable.
- Knowledge of medical device regulatory domain is preferred.
- Knowledge of the latest developments in field of Generative AI and related tools and techniques.
- Proficiency with relational NoSQL and graph databases.
- Strong programming skills in Python and familiarity with relevant AI/ML libraries and tools. Strong mathematical and statistical skills including linear algebra matrix operations and probability theory.
- Deep understanding of probabilistic modelling Bayesian inference HMM topic modelling sentiment analysis NER text classification word embeddings machine translation sequence labelling dependency parsing information extraction and statistical language modelling.
Benefits
- Work Location: Remote
- 5 days working
A Master s or Ph.D. degree in Computer Science, Artificial Intelligence, or a related field. Strong background in deep learning, generative models, and NLP. 8+ years experience in field of AI/ML with experience in the end-to-end development of AI products/solutions. Experience in handling large-scale text based datasets is highly desirable. Knowledge of medical device regulatory domain is preferred. Knowledge of the latest developments in field of Generative AI and related tools and techniques. Proficiency with relational, NoSQL, and graph databases. Strong programming skills in Python and familiarity with relevant AI/ML libraries and tools. Strong mathematical and statistical skills, including linear algebra, matrix operations and probability theory. Deep understanding of probabilistic modelling, Bayesian inference, HMM, topic modelling, sentiment analysis, NER, text classification, word embeddings, machine translation, sequence labelling, dependency parsing, information extraction, and statistical language modelling.