drjobs
DevOps Lead
drjobs DevOps Lead العربية

DevOps Lead

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs

Job Location

drjobs

New Delhi - India

Monthly Salary

drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Were a European company with a mission to revolutionize the way Brazilian customers engage with financial and entertainment services. We are a company that takes Agile seriously and we give a lot of autonomy to leaders and teams to execute the best strategy for the company.

We are passionate about what we do adept to simplicity and eager to meet people who have this same vision so we can build together!

Our mission:

Provide technology and services to create unique digital entertainment experiences.

Position Overview:

We are seeking an experienced DevOps Lead to provide support for our enterprise infrastructure. In this role you will be responsible for supporting our systems including participation in an oncall rotation. Beyond support you will collaborate on improving incident response enhancing observability and collaborating with technical teams to strengthen the overall resilience and improve the performance of our workloads. Your work will involve close cooperation with crossfunctional teams bridging the gap between development and operations and cultivating a culture of collaboration and ongoing enhancement.

Responsibilities:

As a key member of our team your responsibilities will include:

  • Providing crucial support for production systems and playing a vital role in issue triage.
  • Actively participating in our oncall rotation promptly responding to availability incidents and assisting service engineers.
  • Establishing comprehensive monitoring and alerting systems to detect and respond to issues in realtime.
  • Monitoring systems to ensure adherence to system SLO/SLA reviewing and following up on production incidents.
  • Collaborating with crossfunctional teams to enhance incident response and resolution times conducting thorough postmortems for continuous improvement.
  • Proactively identifying and addressing system reliability issues performance bottlenecks and implementing preventive measures to minimize downtime.
  • Working closely with engineering teams to identify and address system limitations.
  • Participating in the Change Management process by reviewing RFCs to ensure adherence to the Definition of Done and actively supporting software and hardware deployments.
  • Championing automation in workflows and tools to improve the reliability and scalability of services.
  • Developing and implementing comprehensive monitoring and alerting systems using Datadog for realtime issue detection and response.
  • Writing and reviewing code creating documentation and troubleshooting distributed systems.
  • Collaborating with teams to optimize incident response resolution times and conducting postmortems for ongoing improvement.

What you need:

  • 3 years of handson experience in DevOps Site Reliability Engineering (SRE) Security roles.
  • 1 years of experience in leading teams or a similar senior role.
  • Proven track record of building and optimizing CI/CD pipelines to streamline software delivery processes.
  • Mandatory expertise in Azure demonstrating a comprehensive understanding of cloud infrastructure.
  • Essential experience in managing and monitoring Kubernetes applications
  • Strong troubleshooting skills in networking and infrastructure issues ensuring uninterrupted system operations.
  • Prior experience in successfully managing customerfacing systems in a 24/7 environment including handling escalations with a focus on customer satisfaction.
  • Proficiency in scripting and automation using Terraform
  • Familiarity with triaging and escalation policies/protocols using OpsGenie or PagerDuty.
  • Handson experience with monitoring and observability tools such as Datadog Prometheus Grafana and the ELK stack.
  • Excellent communication and documentation skills to facilitate clear and effective collaboration within the team.
  • Any additional experience with Cloudflare monitoring is considered a significant plus.

What we offer:

  • Startup environment: challenging collaborative fast and fun where you will have the opportunity to learn bring innovation and interact with colleagues from different nationalities
  • Autonomy: freedom for you to give ideas and create improvements in processes
  • Competitive salary package
  • A remotefirst culture;
  • 11s culture and feedback loops
  • Access to free online courses to foster your personal growth;



Remote Work :

No

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.