drjobs Data Analytics w Data Bricks - HYBRID العربية

Data Analytics w Data Bricks - HYBRID

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bethlehem, PA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

This is a contracttohire position with our Insurance client.
  • Must be strong on SQL Python and Databricks
  • Will undergo 1 hour live coding exercise during the interview
Location: Bethlehem PA. Resource will be required to work onsite a minimum of 3 days per week in the Bethlehem PA office. Local candidates preferred. If your candidate is not local to Bethlehem they will be required to relocate and work onsite from Day 1.


You will:
Collaborate with data scientists and analysts to understand data requirements and translate them into scalable high performant data pipeline solutions.
Support data discovery & data preparation for model development. Perform detailed analysis of raw data sources by applying business context and collaborate with crossfunctional teams to transform raw data into curated & certified data assets to be used for ML and BI use cases.
Collaborate with data science and data engineering team to build scalable and reproducible machine learning pipelines for training and inference.
Implement machine learning models into operations and processes via batch streaming and API methods.
Monitor and troubleshoot data pipeline performance identifying and resolving bottlenecks and issues.
Develop test and maintain robust tools frameworks and libraries that standardize and streamline the data & machine learning lifecycle.
Contribute to developing and maintaining endtoend MLOps lifecycle to automate machine learning solutions development and delivery.
Implement robust monitoring framework for model performance.
Collaborate with crossfunctional teams of Data Science Data Engineering business units and various IT teams.
Create and maintain effective documentation for project and practices ensuring transparency and effective team communication.

You Have:
Bachelors or masters degree with 5 years of experience in Computer Science Data Science Engineering or a related field.
4 years of experience in working with Python SQL PySpark and bash scripts. Proficient in software development lifecycle and software engineering practices.
2 years of handson experience in using Databricks platform
3 years of handson experience in operationalizing Machine Learning solutions which are used in live production processes.
2 years of experience and proficiency in API development using FastAPI frameworks and familiarity with containerization technologies like docker or Kubernetes.
3 years of experience in developing and maintaining robust data pipelines data to be used by Data Scientists to build ML Models.
3 years of experience working with Cloud Data Warehousing (Redshift Snowflake Databricks SQL or equivalent) platforms and experience in working with distributed framework like Spark.
Solid understanding of machine learning life cycle data mining and ETL techniques.
Experience with machine learning frameworks (like Keras or PyTorch) and libraries (like scikitlearn xgboost).
Handson experience in building and maintaining tools and libraries which have been used by multiple teams across organization.
Proficient in understanding and incorporating software engineering principles in design & development process.
Hands on experience with CI/CD tools (e.g. Jenkins or equivalent) version control (Github Bitbucket) Orchestration (Airflow Prefect or equivalent)
Excellent communication skills and ability to work and collaborate with cross functional teams across technology and business.

Good to have:
Familiarity with deep learning frameworks and deploying deep learning models for production use cases.
Familiarity in using GPU compute either for model training or inference.
Understanding of Large language models (LLM) and MLOps lifecycle for operationalizing LLM models.

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.