MLOps Engineer
Irving TX
Job Description :
- Model Development :Collaborate with data scientists to develop train and validate machine learning models.
Implement algorithms and techniques suitable for realtime data processing and inference.
Design and implement robust deployment pipelines for machine learning models in realtime environments.
- Utilize cloud services (G CP or AWS) for deploying and scaling machine learning models.
System
Architect and optimize endtoend machine learning solutions that integrate seamlessly with existing infrastructure.
- Ensure solutions are built for scalability maintainability and high availability.
- Performance Monitoring:
Monitor model performance and ensure realtime systems are operating at optimal levels.
- Implement logging tracking and alerting mechanisms to identify and address model drift or system failures.
Collaboration:
Work closely with crossfunctional teams including data engineers software developers and product managers to align on project goals and deliverables. Communicate technical concepts to nontechnical stakeholders effectively.
Create and maintain documentation for model development deployment processes and system architecture.
- Document best practices and contribute to knowledgesharing initiatives within the team.
Continuous Improvement:
Stay uptodate with the latest trends in machine learning and cloud technologies.
- Proactively identify areas for improvement in existing processes and models.
- Qualifications:
Education:
Bachelors degree in Computer Science Data Science Mathematics or a related field. Masters degree preferred.
3 years of experience in machine learning engineering data science or related fields.
- Proven experience with realtime model deployment on cloud platforms (AWS GCP).
Familiarity with tools like TensorFlow PyTorch Scikitlearn or similar libraries.
Proficient in programming languages such as Python Java or Scala.
Strong understanding of data structures algorithms and machine learning concepts.
- Experience with containerization technologies (Docker Kubernetes) for model deployment.
- Knowledge of cloud services like AWS SageMaker Google AI Platform or similar.
Soft
Strong analytical and problemsolving skills.
Excellent communication skills with the ability to articulate complex ideas to diverse audiences.
- Ability to work in a fastpaced agile environment and manage multiple priorities.
- Preferred Qualifications:
Experience with big data technologies (e.g. Apache Spark Hadoop).
- Familiarity with monitoring and observability tools (e.g. Prometheus Grafana).
Understanding of CI/CD pipelines for machine learning (e.g. MLflow Kubeflow).