Sr Data Scientist - REMOTE

Simple Solutions

Posted on : 28-08-2024

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Jobs by Experience

10years

Job Location

Raleigh, NC - USA

Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 28-08-2024

Job Description

This is a remote position.

Share only 12 years Sr profiles

Senior Data Scientist II

Raleigh NC (Hybrid) Remote is also fine.

As a data scientist on our team you will work on new product development in a small team environment writing production code in both runtime and buildtime environments. You will help propose and build datadriven solutions for highvalue customer problems by discovering extracting and modeling knowledge from largescale natural language datasets including matter and contract repository invoice/legal spend data and work management. You will prototype new ideas collaborating with other data scientists as well as product designers data engineers frontend developers and a team of expert legal data annotators. You will get the experience of working in a startup culture with the large datasets and many other resources of an established company.

RESPONSIBILITIES
Develop and implement LLMbased applications tailored for inhouse legal
Finetune and deploy large language models to enhance their performance on legal text processing tasks
Evaluate and help maintain our data assets and training/evaluation data sets
Design and build pipelines for preprocessing annotating and managing legal document datasets
Collaborate with legal experts to understand requirements and ensure models meet domainspecific needs
Conduct experiments and evaluate model performance to drive continuous improvements
Interface with other technical personnel or team members to finalize requirements.
Work closely with other development team members to understand moderately complex product requirements and translate them into software designs.
Successfully implement development processes coding best practices and code reviews for production environments.

REQUIREMENTS
Formal training in machine learning: dimensionality reduction clustering embeddings and sequence classification algorithms
Experience with deep learning frameworks such as PyTorch Tensorflow and Hugging Face Transformers.
Practical experience in Natural Language Processing methods and libraries such as spaCy word2vec TensorFlow Keras PyTorch Flair BERT
Practical experience with large language models prompt engineering finetuning and benchmarking using frameworks such as LangChain and LlamaIndex
Strong Python background
Knowledge of AWS GCP Azure or other cloud platform
Understanding of data modeling principles and complex data models.
Proficiency with relational and NoSQL databases as well as vector stores (e.g. Postgres Elasticsearch/OpenSearch ChromaDB)
Knowledge of Scala Spark Ray or other distributed computing systems highly preferred
Knowledge of API development containerization and machine learning deployment highly preferred
Experience with ML Ops/AI Ops highly preferred

PREFERRED QUALIFICATIONS
MS in Data Science Computer Science Statistics Machine Learning or related field
2 years of relevant work experience
Or undergraduate degree in relevant field and 4 years of relevant work experience

Senior Data Scientist II Raleigh, NC (Hybrid) ----Remote is also fine. As a data scientist on our team, you will work on new product development in a small team environment writing production code in both run-time and build-time environments. You will help propose and build data-driven solutions for high-value customer problems by discovering, extracting, and modeling knowledge from large-scale natural language datasets including matter and contract repository, invoice/legal spend data and work management. You will prototype new ideas, collaborating with other data scientists as well as product designers, data engineers, front-end developers, and a team of expert legal data annotators. You will get the experience of working in a start-up culture with the large datasets and many other resources of an established company. RESPONSIBILITIES Develop and implement LLM-based applications tailored for in-house legal Fine-tune and deploy large language models to enhance their performance on legal text processing tasks Evaluate and help maintain our data assets and training/evaluation data sets Design and build pipelines for preprocessing, annotating, and managing legal document datasets Collaborate with legal experts to understand requirements and ensure models meet domain-specific needs Conduct experiments and evaluate model performance to drive continuous improvements Interface with other technical personnel or team members to finalize requirements. Work closely with other development team members to understand moderately complex product requirements and translate them into software designs. Successfully implement development processes, coding best practices, and code reviews for production environments. REQUIREMENTS Formal training in machine learning: dimensionality reduction, clustering, embeddings, and sequence classification algorithms Experience with deep learning frameworks such as PyTorch, Tensorflow and Hugging Face Transformers. Practical experience in Natural Language Processing methods and libraries such as spaCy, word2vec, TensorFlow, Keras, PyTorch, Flair, BERT Practical experience with large language models, prompt engineering, fine-tuning and benchmarking, using frameworks such as LangChain and LlamaIndex Strong Python background Knowledge of AWS, GCP, Azure, or other cloud platform Understanding of data modeling principles and complex data models. Proficiency with relational and NoSQL databases as well as vector stores (e.g., Postgres, Elasticsearch/OpenSearch, ChromaDB) Knowledge of Scala, Spark, Ray, or other distributed computing systems highly preferred Knowledge of API development, containerization, and machine learning deployment highly preferred Experience with ML Ops/AI Ops highly preferred PREFERRED QUALIFICATIONS MS in Data Science, Computer Science, Statistics, Machine Learning, or related field 2+ years of relevant work experience Or undergraduate degree in relevant field and 4+ years of relevant work experience

Employment Type

Full Time

Company Industry

Civil Engineering

Key Skills

Apply Now

About Company

Simple Solutions

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Free AI Resume Review

Get Hired 3x Faster with free, confidential review from Ai resume review service.

Order Now

Resume, LinkedIn, Cover Letter

Elevate your professional profile with expertly crafted documents including your resume, LinkedIn profile, cover letter.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Learn More

Reverse Recruiting

Never apply for a job again. We apply and track jobs for you to find your perfect match.

Sr Data Scientist - REMOTE

Simple Solutions

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Sr Lab Supervisor

Writer-Remote

Dietitian- Remote

Remote Travel Concierge

Vacation Manager Remote

Remote Cruise Coordinator B

Remote Sales Agent B

Remote Scheduling Coordinator C