drjobs Site Reliability Engineer العربية

Site Reliability Engineer

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Chicago, IL - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Job Description:

As a Site Reliability Engineer you will be responsible for maintaining and improving the reliability availability and performance of our systems. You will collaborate closely with development operations and security teams to build and automate scalable infrastructure monitor system health and address issues before they impact users. The ideal candidate will have a strong background in both software development and systems engineering with a passion for automation monitoring and continuous improvement.

Key Responsibilities:

  • Design implement and manage highly available and scalable infrastructure in cloud and onpremises environments.
  • Develop and maintain automation scripts and tools to streamline operations deployments and monitoring.
  • Monitor system performance and availability using monitoring tools (Prometheus Grafana Nagios etc.) and respond to incidents to minimize downtime.
  • Work closely with development teams to design and deploy reliable efficient and secure services.
  • Conduct root cause analysis of incidents and implement solutions to prevent recurrence.
  • Implement and manage CI/CD pipelines to automate code deployment and infrastructure changes.
  • Optimize system performance capacity and cost by identifying bottlenecks and areas for improvement.
  • Develop and enforce best practices for incident management disaster recovery and business continuity.
  • Participate in oncall rotations to ensure 24/7 support for critical systems and services.
  • Collaborate with security teams to ensure systems are secure and compliant with relevant standards and regulations.

Qualifications:

  • Bachelors degree in Computer Science Information Technology or a related field; relevant certifications (AWS Certified DevOps Engineer Google Professional SRE) are a plus.
  • Minimum of 35 years of experience in site reliability engineering systems engineering or a related role.
  • Strong experience with cloud platforms (AWS Azure Google Cloud etc.) and containerization technologies (Docker Kubernetes etc.).
  • Proficient in scripting and programming languages (Python Go Bash etc.) for automation and tooling.
  • Experience with configuration management tools (Ansible Puppet Chef etc.) and infrastructure as code (Terraform).
  • Solid understanding of networking security and system administration.
  • Experience with CI/CD tools and practices (Jenkins GitLab CI CircleCI etc.).
  • Excellent problemsolving skills and the ability to troubleshoot complex systems.
  • Strong communication and collaboration skills with a focus on teamwork and knowledge sharing.
  • Ability to work in a fastpaced environment and manage multiple priorities effectively.

Remote Work :

No

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.