Overview:
The Data Engineer plays a crucial role in our organization responsible for developing constructing testing and maintaining architectures such as databases and largescale processing systems. They will work closely with the data science team and business stakeholders to understand their requirements and provide the infrastructure to support their work.
Key Responsibilities:
- Design and develop data pipelines and ETL processes using Snowflake Azure (ADLS ADF Synapse) and/or AWS (Redshift EMR Glue).
- Collaborate with data architects modelers and IT team members on project goals and solution methods.
- Extract transform and load data from various sources into Snowflake and other data platforms.
- Implement and support data infrastructure solutions using Informatica DataStage and other relevant tools.
- Manage and optimize data pipeline performance troubleshoot data quality issues and ensure data integrity.
- Deploy Machine Learning models and algorithms into production environment.
- Design and develop scalable data processing and analytics solutions on cloud platforms.
- Create and maintain technical documentation related to data engineering processes and systems.
- Collaborate with crossfunctional teams to address business needs and drive business value through data solutions.
- Assist in implementing data governance and security best practices.
Required Qualifications:
- Bachelors degree in Computer Science Engineering or a related field.
- Proven experience in building and optimizing big data data pipelines architectures and data sets.
- Proficiency in SQL and experience with largescale databases (e.g. Snowflake Redshift).
- Handson experience with cloud platforms such as Azure and/or AWS.
- Experience with ETL tools like Informatica DataStage and data orchestration tools like Azure Data Factory.
- Strong programming skills in Python Java or Scala.
- Experience with data modeling data warehousing and building data lakes.
- Understanding of data management and storage principles.
- Ability to work in a fastpaced collaborative and agile development environment.
- Excellent problemsolving and communication skills.
etl,scala,glue,java,python,data warehousing,informatica,data modeling,datastage,data engineering,aws,data lakes,azure,sql,snowflake