Sr Lead Data Engineer Spark SQL Data Bricks Jobs in The AES Group in Seattle, WA - USA

Sr Lead Data Engineer Spark SQL Data Bricks

The AES Group

Posted on : 10-12-2024

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Seattle, WA - USA

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 10-12-2024

Job Description

Role Sr Lead Data Engineer Apache Spark Spark Core Spark SQL Spark Streaming Data Bricks
Location: Hybrid (Seattle WA)

Job description
This position contributes to our success by building enterprise data services for analytic solutions. This position is responsible for design development testing and support for data pipelines to enable continuous data processing for data exploration data preparation and realtime business analytics. Models and acts in accordance with our guiding principles.

Tops 3 Skills Needed

Data Engineering

5 years

Data Bricks

5 years

Sparks/ Dev ops practices

5 years

Years of Experience:

5 years

Technology requirements:

Proficiency in Apache Spark including Spark Core Spark SQL and Spark Streaming.
Proficiency in languages such as Python for data processing and scripting.

Basic Qualifications/ Experience:

Experience in designing and implementing ETL processes using Databricks notebooks for efficient data extraction transformation and loading.
Indepth knowledge of the Databricks Unified Analytics Platform and its features for collaborative big data analytics.
Understanding of data modeling concepts for designing database structures.
Proficiency in working with both relational databases and NoSQL databases.
Integration of data from diverse sources including APIs databases streaming data and external data feeds.
Implementation of processes to ensure data quality including data validation cleansing and error handling.
Knowledge of cluster management optimization and scaling for efficient data processing.
Optimization of Spark jobs and Databricks clusters for better performance.
Proficiency in cloud platforms such as Azure for building scalable and flexible data architectures.
Use of tools like Apache Airflow ADF or Databricks to orchestrate and schedule data workflows.

Degree or certifications required:

Education (minimum education level degree or certification necessary): Bachelors degree in computer science management information systems or related discipline

Skills (minimum skills required):

57 years Architect and design large scale high performance distributed systems
57 years SQL Platform
2 years Exposure NoSQL Platform is a plus
5 years Hadoop YARN MapReduce Pig or Hive Spark
2 years Data platform implementation on Azure or AWS is a plus

Key Responsibilities:

Responsibilities and essential job functions include but are not limited to the following:
Demonstrate deep knowledge and ability to lead others in the data engineering team to build and support noninteractive (batch distributed) & realtime highly available data data pipeline and technology capabilities
Translate strategic requirements into business requirements to ensure solutions meet business needs
Work with infrastructure provisioning & configuration tools to develop scripts to automate deployment of physical and virtual environments; to develop tools to monitor usage of virtual resources
Assist in the definition of architecture that ensure that solutions are built within a consistent framework
Lead resolution activities for complex data issues
Define & implement data retention policies and procedures
Define & implement data governance policies and procedures
Identify improvements in team coding standards and help in implementation of the improvements
Leverage subject matter expertise to coordinate issue resolution efforts across peer support groups technical support teams and vendors
Develop and maintain documentation relating to all assigned systems and projects
Perform systems and applications performance characterization and tradeoff studies through analysis and simulation
Perform root cause analysis to identify permanent resolutions to software or business process issues
Lead by example by demonstrating the Client mission and value

NicetoHaves:

Knowledge of data security best practices and the implementation of measures to ensure data privacy and compliance.
Implementation of monitoring and logging solutions to track the health and performance of pipelines.
Familiarity with monitoring platforms like DataDog and New Relic
Azure

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

The AES Group

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Free AI Resume Review

Get Hired 3x Faster with free, confidential review from Ai resume review service.

Order Now

Resume, LinkedIn, Cover Letter

Elevate your professional profile with expertly crafted documents including your resume, LinkedIn profile, cover letter.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Learn More

Reverse Recruiting

Never apply for a job again. We apply and track jobs for you to find your perfect match.

Sr Lead Data Engineer Spark SQL Data Bricks

The AES Group

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Civil Engineer

Software Engineer

Software Engineer - Mid-Level

Overnight Maintenance Engineer

Overnight Maintenance Engineer

Privado Guest Services Lead Dispatcher

Engineer I - HVACR

Engineer I - HVACR