Must Have Technical Capabilities (If applicable) :
Proficiency in GitHub CI/CD pipelines Terraform and AWS services like Lambda API Gateway and S3.
Strong knowledge of Spark SQL and PySpark for processing large datasets.
Hands on experience databricks AWS Glue and Airflow.
Basic understanding of system design principles for building scalable and maintainable systems.
Familiarity with data modelling techniques and data warehouse design (Kimball star schema snowflake schema).
Excellent problemsolving skills and ability to work with analysts to resolve datarelated queries.
Experience with Power BI for building interactive dashboards and reports.
Exposure to containerization technologies like ECR and ECS.
Essential Job Functions & Duties :
Learning and Adaptation :
Stay up to date with the latest trends and technologies in data engineering and analytics.
Quickly adapt to new tools platforms and methodologies.
Platform Optimization :
Analyse and provide suggestions for improving the efficiency reliability and scalability of the data platform.
Drive innovation in data processes and workflows.
Data Engineering & Infrastructure :
Work with modern tools and technologies such as GitHub CI/CD Terraform AWS Lambda API Gateway S3 ECR ECS Spark SQL and PySpark.
Support the development and deployment of scalable data pipelines and services using databricks AWS Glue or Airflow.
Collaboration :
Partner closely with analysts to resolve queries and ensure they have access to the right data for insights.
Act as a bridge between technical teams and business stakeholders.
Data Modelling :
Apply knowledge of data modelling techniques including Kimbal methodology star schemas and snowflake schemas to design efficient data warehouses and marts.
Reporting & Visualization (Good to Have) :
Utilize tools like Power BI to create reports and dashboards making data accessible and actionable for stakeholders.
Other Responsibilities :
A team player with strong communication skills capable of collaborating across technical and nontechnical teams.
Someone with a growth mindset eager to learn new technologies and contribute to
continuous improvement.
An analytical thinker who can provide creative solutions to complex data challenges.