Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailThe WorkWave Team is seeking an experienced Lead / Senior Lead Site Reliability Engineer (SRE) to drive reliability scalability and operational excellence across our cloudbased infrastructure. This role is crucial in ensuring high availability monitoring and streamlined deployment processes across various environments including AWS and hybrid systems. The Lead / Senior Lead SRE will work closely with crossfunctional teams to optimize system reliability and efficiency actively contributing to a robust infrastructure that supports business growth.
Responsibilities
Design manage and optimize scalable infrastructure across cloud environments with a focus on reliability availability and performance. Implement comprehensive monitoring and observability systems to ensure proactive issue detection and resolution.
Lead incident response for critical infrastructure issues across cloud platforms drive root cause analysis and implement corrective measures to minimize recurrence.
Collaborate with crossfunctional teams to create efficient automated CI/CD pipelines that support cloud hybrid and onprem deployments enabling smooth and reliable delivery.
Apply IaC best practices across environments using tools that ensure consistent provisioning configuration and management of resources in cloud environments.
Ensure new services meet reliability and scalability requirements across all environments before deployment. Conduct capacity planning and performance tuning to adapt to business needs.
Develop and maintain comprehensive documentation for infrastructure deployment workflows monitoring configurations and incident management procedures providing clear guidance across teams.
Provide mentorship and technical guidance to team members sharing knowledge of best practices in reliability engineering and infrastructure management.
Research and integrate new tools and technologies to improve the efficiency scalability and resilience of our SRE processes across cloud and hybrid infrastructures.
Qualifications :
Bachelors or Masters Degree in Computer Science Information Technology or a related field.
45 years of experience in Site Reliability Engineering or DevOps with a focus on multienvironment infrastructure and cloud platforms.
Strong track record of managing and optimizing infrastructure in production environments including incident management and system troubleshooting.
Proficient in CI/CD pipeline automation and infrastructure as code practices across cloud and hybrid environments.
Skills and Competencies
Remote Work :
No
Employment Type :
Fulltime
Full-time