Job Title: Chaos Engineer
Contract Duration: 12 months with potential for extension
Location: 100% remote
Visa Restrictions: None (however H1Bs must be held by your firm)
Top 3 Must Have Skills:
- 4+ years of experience in chaos engineering, reliability engineering, or a similar role
- Strong proficiency in Golang and experience building chaos tools using Golang, GraphQL, and Elasticsearch
- In-depth understanding of distributed systems, microservices architecture, and containerization technologies (such as Docker and Kubernetes)
Position Description:
The Chaos Engineer will test computer systems to ensure they can withstand unexpected disruptions.
Approach will focus on the concepts underlying chaos theory, specifically random and unpredictable
behavior. The goal of the Chaos Engineer is to build confidence in the systems' capabilities to withstand
turbulent conditions and unexpected disruptions in production.
What You'll Get to Do:
Design, develop, and maintain chaos engineering tools using Golang, GraphQL, and Elasticsearch
to inject faults and simulate failure scenarios in systems.
Collaborate with cross-functional teams to identify potential weaknesses in our infrastructure
and applications and develop mitigation strategies to prevent outages and performance
degradation.
Develop and implement chaos experiments to validate the effectiveness of our systems under
various failure conditions.
Work closely with the engineering, operations, and QA teams to ensure that our chaos
engineering practices are aligned with the overall objectives of the organization.
Analyze system performance and incident data to continuously improve the reliability and
resilience of the systems.
Participate in on-call rotations to provide support for production incidents and ensure the
smooth operation of services.
Stay current on industry trends and advancements in chaos engineering, and continuously
explore opportunities to enhance both tools and processes.
You'll Bring these Qualifications:
4+ years of experience in chaos engineering, reliability engineering, or a similar role.
Bachelor's degree in Computer Science, Information Technology, or related discipline.
Strong proficiency in Golang and experience building chaos tools using Golang, GraphQL, and
Elasticsearch.
In-depth understanding of distributed systems, microservices architecture, and containerization
technologies (such as Docker and Kubernetes).