Turnitin is a recognized innovator in the global education space. For more than 20 years Turnitin has partnered with educational institutions to promote honesty consistency and fairness across all subject areas and assessment types. Over 16000 academic institutions publishers and corporations use our products and services.
At Turnitin working remotely is our default. We respect local cultures embrace diversity and we respect personal choice. Turnitin is headquartered in Oakland with offices in Dallas Pittsburgh Newcastle (UK) Stockholm (Sweden) Cologne (Germany) Amsterdam (Netherlands). Our diverse community of colleagues is unified by a shared desire to make a difference in education. Our remotefirst culture allows for every employee to get the same access to learning and career opportunities and it enables us to think differently about where and how we recruit talent from all kinds of diverse backgrounds.
Machine Learning is integral to the continued success of our company. Our product roadmap is exciting and ambitious. You will join a global team of curious helpful and independent scientists and engineers united by a commitment to deliver cuttingedge wellengineered Machine Learning systems. You will work closely with product and engineering teams across Turnitin to integrate Machine Learning into a broad suite of learning teaching and integrity products.
We are in a unique position to deliver Machine Learning used by hundreds of thousands of instructors teaching millions of students around the world. Your contributions will have global reach and scale. Billions of papers have been submitted to the Turnitin platform and hundreds of millions of answers have been graded on the Gradescope and Examsoft platforms. Machine Learning powers our AI Writing detection system gives automated feedback on student writing investigates authorship of student writing revolutionizes the creation and grading of assessments and plays a critical role in many backend processes.
Responsibilities and Requirements:
We expect Senior Machine Learning Scientists to be versatile and have a wellbalanced set of skills. You will focus on model training and maintenance with significant capacity for research (developing novel model architectures) dataset construction and model hardening (preparing the model and code for production pipelines).
Daytoday your responsibilities are to:
- Work with subject matter experts and product owners to determine what questions should be asked and what questions can be answered.
- Work with subject matter experts to curate generate and annotate data and create optimal datasets following responsible data collection and model maintenance practices.
- Answer questions and make trainable datasets from raw data using efficient SQL queries and scripting languages visualizing when necessary.
- Develop and tune Machine Learning models following best practices to select datasets architectures and model parameters.
- Utilize adopt and finetune Language Models including thirdparty LLMs (through prompt engineering and orchestration) and locally hosted LMs.
- Stay current in the field read research papers experiment with new architectures and LLMs and share your findings.
- Optimize models for scaled production usage.
- Communicate insights as well as the behavior and limitations of models to peers subject matter experts and product owners.
- Write clean efficient and modular code with automated tests and appropriate documentation.
- Stay up to date with technology make good technological choices and be able to explain them to the organization.
Qualifications :
Required Qualifications:
- Experience working with text data to build Deep Learning and ML models both supervised and unsupervised. Experience with deep learning in other modalities such as vision and speech would be a strong bonus.
- A strong understanding of the math and theory behind machine learning and deep learning.
- Software engineering background with at least 35 years of experience (we use Python SQL Unixbased systems git and github for collaboration and review).
- Machine / Deep Learning development skills including experiment tracking (we use AWS SageMaker Hugging Face transformers PyTorch scikitlearn Jupyter Weights & Biases).
- An understanding of Language Models using and training / finetuning and a familiarity with industrystandard LM families.
- Masters degree or PhD in Computer Science Electrical Engineering AI Machine Learning applied math or related field with relevant industry experience or outstanding previous achievements in this role. A Computer Science background is required as opposed to statistics or pure mathematics. Were an applied science group leaning towards deep learning and therefore software development proficiency is a prerequisite.
- Excellent communication and teamwork skills.
- Fluent in written and spoken English.
Would be a plus:
- Familiarity in coding for atscale production ranging from best practices to building backend API services or standalone libraries.
- Essential devops skills (we use Docker AWS EC2/Batch/Lambda).
- Familiarity in building frontends (LLMs or more standard React Javascript Flask) for simple demos POCs and prototypes.
- Experience with advanced prompting finetuning or training an LLM opensource or cloud using industry accepted platforms (such as mosaic.ai or stochastic.ai).
- Showcase previous work (e.g. via a website presentation open source code).
Remote Work :
Yes
Employment Type :
Fulltime