drjobs Site Reliability Engineer

Site Reliability Engineer

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Alexander City - USA

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Job Title: Site Reliability Engineer (SRE) (US Citizen Only)

Location: Remote

We are currently seeking candidates who meet the following qualifications

Responsibilities:

  • Ensure high availability performance and scalability of production systems and infrastructure.
  • Develop and implement automation for system provisioning scaling and monitoring using tools like Terraform Ansible or similar.
  • Monitor system reliability through key metrics and dashboards ensuring uptime and performance standards.
  • Respond to incidents troubleshoot issues and drive root cause analysis and longterm solutions.
  • Collaborate with software development teams to ensure reliable deployment pipelines and to improve system performance.
  • Optimize system performance and troubleshoot any bottlenecks or inefficiencies in applications and infrastructure.
  • Manage and maintain CI/CD pipelines for automated deployment and continuous integration.
  • Develop and maintain system documentation playbooks and runbooks for streamlined operations.
  • Participate in an oncall rotation for system monitoring and incident resolution.

Qualifications:

  • Bachelors degree in Computer Science Engineering or related field.
  • Experience in a Site Reliability Engineer DevOps or related role.
  • Strong experience with cloud platforms (AWS Azure or Google Cloud) and cloudnative architectures.
  • Proficiency in scripting languages such as Python Bash or Go for automation tasks.
  • Experience with containerization and orchestration tools (e.g. Docker Kubernetes).
  • Expertise in infrastructureascode tools (Terraform CloudFormation or Ansible).
  • Experience with monitoring logging and observability tools (e.g. Prometheus Grafana ELK Datadog).
  • Strong understanding of Linux/Unix system administration and networking principles.
  • Knowledge of load balancing high availability and disaster recovery techniques.
  • Experience with CI/CD pipelines and tools like Jenkins GitLab or CircleCI.
  • Excellent troubleshooting skills and ability to handle complex technical problems.

Preferred Skills:

  • Experience with distributed systems and microservices architecture.
  • Strong understanding of database systems (SQL and NoSQL) and database performance tuning.
  • Experience in version control systems like Git and experience with GitOps workflows.
  • Certification in cloud technologies (e.g. AWS Certified SysOps Administrator Azure DevOps Engineer) is a plus.
  • Experience working in an Agile/DevOps environment.
  • Federal Experience is a plus.
  • Required Security clearance.

    If you meet these qualifications please submit your application via link provided in Linkedin.
    Kindly do not call the general line to submit your application.

Employment Type

Full Time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.