Collaborate with engineering to ensure seamless 24/7 service uptime, utilizing your expertise to develop self-healing, automated systems that proactively address potential issues. (Don't worry, we have a fair on-call rotation and compensate with time off!)
Define and monitor Service Level Objectives (SLOs) and mission-critical metrics, ensuring our systems meet the highest reliability standards.
Develop incident response playbooks and build robust monitoring and alerting capabilities to ensure swift and efficient issue resolution.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.