Job descriptions:
We operate in an Azure Databricks Lakehouse. Well need a person with:
Azure experience ADF for orchestration ADLS for storage AzureDevOps for CI/CD
Databricks experience all compute/ETL leverages Databricks and is programmed leveraging Spark (PySpark SparkSQL)
PowerShell experience this is our scripting language of choice
SQL proficiency its used everywhere (TSQL PostgreSQL)
Proficiency with parquet and delta formats
Additionally they will need experience in:
SDLC CI/CDWe follow a standard deployment process (dev test prod) that includes peerreviewed code. They need to be comfortable with standard DevOps practices.
Should have a deep understanding of indexes and partitioning.
Should be proficient optimizing code for performance (able to read a DAG determine where CBO is using most resources)
Should be proficient in writing code in a matter that it can run repeatedly and produce the same state (we have a custom SQL Deployment framework)
Start date for this requirement is Feb 1st 2025
Mandatory Areas
Must Have Skills
Skill 1 Yrs of Exp 6 Experience with Azure platform ecosystem products such as: Synapse ADF Azure Active Directory Data Lake Storage (ADLS)
Skill 2 Yrs of Exp 6 Experience with one or more ETL platforms: ADF NiFi Alteryx SSIS
Skill 3 Yrs of Exp 6 Experience in one or more of: Java Perl Python PowerShell groovy bash
Skill 4 Yrs of Exp 6 Experience with one or more databases: SQL Server ADW Postgres
Skill 5 Yrs of Exp 6 Experience with Jira Git Azure DevOps CI/CD
Domain Experience (If any ) NA
Must have Certifications None
Location Remote is fine ; rate will vary based on location ;
Onsite Requirement Prefer onsite but remote Ok for technically strong candidates
Number of days onsite 3 days
Note : The candidate should have Azure data experience.
For your ease I would recommend finding good Azure data engineer with skillsets with background in scripting and PowerBI or other similar visualization tools. The engineer should have a problem resolution mindset and should be ready to work as a Data platform SRE