drjobs Sr SRE Site Reliability Engineer العربية

Sr SRE Site Reliability Engineer

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Alexander City - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Sr. SRE ( Site Reliability Engineer) Data DevOps/ DataOps/ No SQL Kafka Databricks Kubernetes Kafka Terrafoam

Imp Note

This is a Sr. SRE role and not devops role

Kubernetes skill level expert is required

Kafka skill level expert is required

Terraform skill level expert is required

Databricks skill level intermediate is ok

NOSQL Database Cassandra Mongo PostGres very imp for this role

Pl match skills before submitting resumes

Core skills needed

Azure Clous AKS Scalability monitoring deployment check logs ensure node and pod health.

Databases include Cassandra Mongo PostGres

Databricks Notebooks There are a lot of jobs on Databricks experience with Databricks to know how a notebook is created and run run queries against the database and finding discrepancies and perform fixes.

Based microservices responsible for deployment scripting language is python.

Should have an understanding around terraform.

Emphasis on Logs and Monitoring (datadog and splunk)

Summary of Experience

  • Requires 1012 years experience in the IT industry
  • Requires 9 years of software and DevOps development engineering
  • Experience in working with cloud environment Azure preferred.
  • Experience with Kubernetes Azure Kubernetes (AKS) preferred.
  • Experience with using Kafka Event Hub NATS or any messaging broker.
  • Experience with Cassandra PostgresSQL Mongo Elastic Search Cosmos DB
  • Experience on Azure DevOps Jenkins/ Python / Terraform / Ansible
  • Experience with Databricks
  • Experience with DataDog Splunk or other logging and APM tools.
  • Experience in working with Linux environment.

Summary of Key Responsibilities

Responsibilities and essential job functions include but are not limited to the following:

Responsible for health of production system

Develop monitoring dashboards

Configure alerts and automate process for system recovery

Monitor alerts and take proactive steps to resolve system issues

Troubleshoot production issues

Lead production troubleshooting calls

Responsible for patches and updates on production systems.

Design and build cuttingedge multimicro service solutions to support Starbuckss growth worldwide.

Helping CI/CD team during rolling out application and infrastructure globally.

Collaborates with development team other Information Technology (IT) teams developer leads. Initiates process improvements for new and existing systems.

Participates in a production support rotation that includes pager responsibilities.

Ability to accurately break down complex application designs into component deliverables and estimate design and development timelines

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.