drjobs Lead machine learning engineer Model Architectures amp Capabilities Team العربية

Lead machine learning engineer Model Architectures amp Capabilities Team

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

London - UK

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

The company

Nebius is a modern technology venture offering strategic partnerships to leading companies around the world empowering them to create their own local hyperscaler platforms and become trustworthy providers of cloud services and technologies in their own regions. In addition to innovative software and hardware including server racks designed inhouse Nebius provides a launchready business model and customizable tools for support sales and marketing.

Our aim is to empower our partners to create their own IT infrastructure and deliver cuttingedge disruptive cloud solutions to local markets while keeping security and compliance with international standards like ISO and GDPR top priorities.

Were a global company with offices in the Netherlands Israel and Serbia.


Our team

Nebius was founded by a core team of engineers and business professionals with a proven track record of using cloud technologies to create value for other businesses. We know from experience that cuttingedge technologies can only make an impact if their innovation is matched by the level of the experts managing them so as Nebius expands our core priority is to attract the most qualified enthusiastic and driven individuals we can to join our growing team.

Nebius Large Language Models (LLM) team is dedicated to pushing the boundaries oflanguage modelling technology.We are focused on developing a stateoftheart LLM technological stack that spans webscaledata collection foundational model training and alignment. Our overarching objective is topioneer cuttingedge language generation technology for both internal use and customerapplications driving the evolution of the next generation of AIpowered products.


The role

We are currently in search of the team lead for the Model Architectures & Capabilities team.The team is responsible for pushing forward the capabilities of the models small and largethat we train inhouse. This includes finding model architectures that efficiently achieve thedesired capabilities scaling these models to the limits of our hardware and exploring novelideas that could potentially expand of what is possible.

In this position your responsibility will be to:

  • Lead the team responsible for model architectures and capabilities
  • Define strategy and tactics i.e. figure out what research and engineering directions topursue to push the technology forwards and help the team plan and execute theexperiments that will get us there efficiently
  • Ensure high standards of engineering and research activities within the team
  • Keep improving the design our internal infrastructure for training large models toensure it keeps being fast and flexible despite the technology moving forwards
  • Mentor our engineers and researchers

We expect you to have:

  • A profound understanding of theoretical foundations of machine learning
  • Deep expertise in modern deep learning for language processing and generation
  • Substantial experience with pretraining large models on huge clusters
  • Good understanding of performance aspects of large neural network training (shardingstrategies custom kernels hardware features etc.)
  • Strong software engineering skills (we mostly use python)
  • Deep experience with modern deep learning frameworks (we use jax)
  • Proficiency in contemporary software engineering approaches including CI/CDversion control and unit testing
  • Strong communication and leadership abilities

      It would be an added bonus if you had:

      • Bachelors degree in Computer Science Artificial Intelligence Data Science or arelated field. Masters or PhD preferred
      • Track record of building and delivering products (not necessarily MLrelated) in adynamic startuplike environment
      • Experience in engineering complex systems such as large distributed data processingsystems or highload web services
      • Opensource projects that showcase your engineering prowess
      • Excellent command of the English language alongside superior writing articulationand communication skills

      Does all that sound like your kind of challenge Then join us!

      Employment Type

      Full Time

      Company Industry

      About Company

      Report This Job
      Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.