drjobs SRE Site Reliability Engineer

SRE Site Reliability Engineer

Employer Active

The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Alexander City - USA

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Role Sr. SRE ( Site Reliability Engineer)

Location Seattle WA needs to come to office 3 days a week.

Job Type Contract (12 months)

Core skills needed

Azure Clous AKS Scalability monitoring deployment check logs ensure node and pod health.

Databases include Cassandra Mongo PostGres

Databricks Notebooks There are a lot of jobs on Databricks experience with Databricks to know how a notebook is created and run run queries against the database and finding discrepancies and perform fixes.

Based microservices responsible for deployment scripting language is python.

Should have an understanding around terraform.

Emphasis on Logs and Monitoring (datadog and splunk)

Summary of Experience

  • Requires 1012 years experience in the IT industry
  • Requires 9 years of software and DevOps development engineering
  • Experience in working with cloud environment Azure preferred.
  • Experience with Kubernetes Azure Kubernetes (AKS) preferred.
  • Experience with using Kafka Event Hub NATS or any messaging broker.
  • Experience with Cassandra PostgresSQL Mongo Elastic Search Cosmos DB
  • Experience on Azure DevOps Jenkins/ Python / Terraform / Ansible
  • Experience with Databricks
  • Experience with DataDog Splunk or other logging and APM tools.
  • Experience in working with Linux environment.
  • Indepth understanding of Computer Science fundamentals in objectoriented design data structures algorithms and problem solving
  • Experience building complex scalable highperformance software systems that have been successfully delivered to customers
  • Demonstrated knowledge of best practices for the design and implementation of largescale systems as well as experience in taking such systems from design to production
  • Experience building and operating mission critical highly available (24x7) systems
  • Ability to work well with a team in a fastpaced agile development environment.
  • Bachelors in Computer Science or equivalent work experience.
  • Excellent communication analytical and problemsolving skills
  • Extensive understanding in SDLC and scrum methodologies.
  • Job Summary and Mission

    We are seeking an experienced selfmotivated Senior Engineer who is technically very strong with strong Linux background with deep knowledge in micro services backend storage design NoSQL database distributed systems and very good troubleshooting skills. Typical activities include production monitoring creating monitoring dashboards setting up alerts triaging alerts coupled with the ability to drive efforts and solution improvements effectively across various IT and business functions. In this role person will be responsible for setting up monitoring dashboards alerts maintaining production systems deploying code in Production monitoring alerts resolving issues and leading production troubleshooting calls. Working with Product Owners and other developers to implement highly scalable reactive application platform solutions in Cloud based Linux environments.

    Summary of Key Responsibilities

    Responsibilities and essential job functions include but are not limited to the following:

  • Responsible for health of production system
  • Develop monitoring dashboards
  • Configure alerts and automate process for system recovery
  • Monitor alerts and take proactive steps to resolve system issues
  • Troubleshoot production issues
  • Lead production troubleshooting calls
  • Responsible for patches and updates on production systems.
  • Design and build cuttingedge multimicro service solutions to support Starbuckss growth worldwide.
  • Work with crossfunctional teams for ongoing design efforts and systems support.
  • Automate password and certificate rotations on application and DB servers.
  • Helping CI/CD team during rolling out application and infrastructure globally.
  • Collaborates with development team other Information Technology (IT) teams developer leads. Initiates process improvements for new and existing systems.
  • Coaches and mentors other team members. Performs crosstraining and facilitates information sharing among team members.
  • Participates in a production support rotation that includes pager responsibilities.
  • Ability to accurately break down complex application designs into component deliverables and estimate design and development timelines
  • General IT Skills:

    Experience in Application support Problem diagnosis and resolution

    Expert in interpretation of functional requirements

    Development of technical design specifications for complex projects

    Expert in industry standard development methodologies

    Experience in middleware integration using tools like Web Methods

    Integrate application support efforts with concurrent parallel application development efforts

    Employment Type

    Full Time

    Company Industry

    Report This Job
    Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.