drjobs Senior ML Data Engineer

Senior ML Data Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Plano - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Title: Senior ML Data Engineer

Work Location: Plano TX (Remote)

Job Description:

Senior ML Data Engineer Feature Engineering ETL

We seek a talented Senior Data Engineer with ML feature engineering expertise to join our Consumer ML team This role involves designing and implementing advanced feature engineering and ETL pipelines to enable robust machine learning applications The ideal candidate has handson experience with Databricks a deep understanding of the medallion architecture and a proven track record of supporting the endtoend ML lifecycle Experience with MLFlow and an aptitude for creating data driven insights are highly desirable.

Key Responsibilities

  • Feature Engineering Data Integration Develop and maintain feature engineering pipelines using Databricks to support ML models effectively
  • Data Pipeline Development Integrate diverse data sources eg clickstreams user Behaviour demographic data to create user Behaviour features profiles for complex ML tasks
  • Medallion Architecture Design and implement ETL ELT pipelines aligned with the bronze silver and gold layers of the medallion architecture
  • Model Support Build data pipelines to support ML model training calibration and deployment leveraging MLFlow for experiment tracking and performance monitoring
  • Query Optimization Low Latency Pipelines Design low latency production ready data pipelines to support realtime and batch model inference
  • CICD Practices Apply CICD principles for seamless pipeline deployment
  • Data Governance Ensure pipelines comply with security and regulatory standards particularly for handling PII and maintain metadata and master data across the data catalogue
  • Collaboration Work closely with ml scientists ml engineers and other stakeholders to align data transformation with business objectives

Qualifications

  • 7 years in data engineering and at least 4 years focusing on ML feature engineering ETL pipeline development and data preparation for ML
  • Proven experience managing pipelines on Databricks using Apache Spark with a strong understanding of the medallion architecture
  • Familiarity with ML lifecycle management with MLFlow experience as a strong plus and advanced skills in Apache Spark PySpark for big data processing and analytics
  • Proficient in Python for data manipulation and SQL for query optimization
  • Experience building pipelines for realtime and batch model serving in production environments and knowledge of CICD practices for ETLELT pipeline development
  • Expertise in metadata and master data management within technical data catalogues
  • Understanding of data security and compliance especially with sensitive data like PII

S No.

Skill

Experience (In years)

Last Used (In Yrs)

1

Data Engineering

2

Machine Learning

3

Python

4

ETL

5

Advance SQL

6

PySpark

7

MLFlow/Apache Air Flow

Thanks & Regards

Naveen Paila

Email:

Employment Type

Full Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.