Job Title: Site Reliability Engineer (SRE) (US Citizen Only)
Location: Remote
We are currently seeking candidates who meet the following qualifications
Responsibilities:
- Ensure high availability performance and scalability of production systems and infrastructure.
- Develop and implement automation for system provisioning scaling and monitoring using tools like Terraform Ansible or similar.
- Monitor system reliability through key metrics and dashboards ensuring uptime and performance standards.
- Respond to incidents troubleshoot issues and drive root cause analysis and longterm solutions.
- Collaborate with software development teams to ensure reliable deployment pipelines and to improve system performance.
- Optimize system performance and troubleshoot any bottlenecks or inefficiencies in applications and infrastructure.
- Manage and maintain CI/CD pipelines for automated deployment and continuous integration.
- Develop and maintain system documentation playbooks and runbooks for streamlined operations.
- Participate in an oncall rotation for system monitoring and incident resolution.
Qualifications:
- Bachelors degree in Computer Science Engineering or related field.
- Experience in a Site Reliability Engineer DevOps or related role.
- Strong experience with cloud platforms (AWS Azure or Google Cloud) and cloudnative architectures.
- Proficiency in scripting languages such as Python Bash or Go for automation tasks.
- Experience with containerization and orchestration tools (e.g. Docker Kubernetes).
- Expertise in infrastructureascode tools (Terraform CloudFormation or Ansible).
- Experience with monitoring logging and observability tools (e.g. Prometheus Grafana ELK Datadog).
- Strong understanding of Linux/Unix system administration and networking principles.
- Knowledge of load balancing high availability and disaster recovery techniques.
- Experience with CI/CD pipelines and tools like Jenkins GitLab or CircleCI.
- Excellent troubleshooting skills and ability to handle complex technical problems.
Preferred Skills:
- Experience with distributed systems and microservices architecture.
- Strong understanding of database systems (SQL and NoSQL) and database performance tuning.
- Experience in version control systems like Git and experience with GitOps workflows.
- Certification in cloud technologies (e.g. AWS Certified SysOps Administrator Azure DevOps Engineer) is a plus.
- Experience working in an Agile/DevOps environment.
- Federal Experience is a plus.
- Required Security clearance.
If you meet these qualifications please submit your application via link provided in Linkedin.
Kindly do not call the general line to submit your application.