Overview
The Data Engineer plays a crucial role in harnessing data to drive insights and improve decisionmaking within our organization particularly in the healthcare sector. This position is fundamental as it builds and maintains the data infrastructure that enables data analysts and data scientists to work effectively. Leveraging Google Cloud Platform (GCP) the Data Engineer will be responsible for designing efficient data pipelines ensuring seamless data integration and optimizing data storage solutions. This role requires a solid understanding of healthcare data regulations and the ability to handle sensitive information securely and efficiently. With the healthcare industry rapidly embracing datadriven strategies the Data Engineer will help in identifying opportunities to improve patient care streamline operations and enhance the overall healthcare experience. Working closely with crossfunctional teams the Data Engineer will also play a pivotal role in implementation projects that require data management solutions making this position essential for the successful delivery of healthcare services powered by advanced analytics.
Key Responsibilities
- Design build and maintain scalable data pipelines in GCP.
- Integrate diverse healthcare data sources and formats.
- Implement ETL processes to extract transform and load data efficiently.
- Collaborate with data scientists and analysts to understand data requirements.
- Optimize data storage solutions using BigQuery and other GCP tools.
- Develop data models that support analytics and reporting.
- Ensure compliance with healthcare data regulations and privacy standards.
- Monitor data pipeline performance and troubleshoot issues.
- Document data architecture and maintenance processes.
- Utilize SQL and Python for data manipulation and transformation.
- Implement data governance practices to ensure data quality.
- Participate in code reviews and contribute to team knowledge sharing.
- Work with stakeholders to gather requirements and deliver data solutions.
- Evaluate and adopt new technologies to enhance data capabilities.
- Support machine learning initiatives with wellstructured datasets.
- Stay updated on healthcare data trends and innovations.
Required Qualifications
- Bachelors degree in Computer Science Information Technology or a related field.
- Minimum of 3 years of experience in data engineering particularly within the healthcare domain.
- Proven experience in Google Cloud Platform tools and architecture.
- Strong proficiency in SQL and data manipulation languages.
- Experience with ETL tools and data integration techniques.
- Solid understanding of data governance and data quality metrics.
- Proficiency in Python or other programming languages for data processing.
- Familiarity with BigQuery Cloud Storage and Dataflow.
- Experience working with healthcare data types and standards (e.g. HL7 FHIR).
- Strong analytical and problemsolving skills.
- Excellent verbal and written communication abilities.
- Ability to collaborate effectively with interdisciplinary teams.
- Strong attention to detail and commitment to data accuracy.
- Experience with data visualization tools is a plus.
- Knowledge of machine learning concepts is an advantage.
healthcare,gcp,cloud storage,python,data quality metrics,bigquery,data modeling,sql proficiency,etl processes,google cloud platform (gcp),sql,data governance,dataflow