Job Title: Pyspark Developer
Location: Louiseville, KY
Type: Full Time
Job Description:
- Experience building complex, highly scalable and reliable data pipelines using technologies like Databricks, spark, Kafka, Hive, Parquet, Avro
- Software development experience with High level languages like Python, Scala, Java, SQL, No-SQL
- Experience working with pipeline orchestration tools to automate data flows like Azure Data Factory, Airflow, Perfect
- Experienced in Establishing and promoting high standards in DevOps, Security, CI/CD, Monitoring, data validation, testing
- Experience with cloud platforms (preferably Azure) and experience building enterprise level data lakes
- Experience working with Terraform
- Experience using Docker and Kubernetes
- Experience with Microservices and RESTful APIs
- Understanding Data Catalog, Data Governance, Data Lineage
- Experience with integration of data from multiple data sources
- Knowledge of various ETL techniques
- 5+ years of experience in data engineering