Senior Site Reliability Engineer SRE

Wakapi

Posted on : 19-03-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Mendoza - Argentina

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 19-03-2025

Job Description

The Role:
We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join our Platform
Engineering team. The ideal candidate will have a strong understanding of DevOps and
Service Level Management (SLM) metrics. As well as experience working in eventdriven
infrastructure projects using tools like Terraform New Relic Kubernetes AWS and
Kafka.
As a representative of Platform Engineering you will play a critical role working with other
engineering teams to ensure our platform infrastructure tooling fulfils their needs and has apositive impact on Developer Experience.
As well as helping them determine the right settings and thresholds for triggering alerts orautomations on their applications.

Responsibilities:

Scalability and High Availability: Design implement and maintain scalable and
highly available systems using load balancing autoscaling patterns canary
releases and bluegreen deployments.
Monitoring Logging and Observability: Develop and maintain monitoring and
logging dashboards using tools like New Relic Prometheus Grafana and Datadog.
Ensure observability through metrics tracing log aggregation and alerting.
Alerting and Automation: Help teams determine the right settings and thresholds
for triggering alerts or automations on their applications. Understand that each
application has different performance requirements such as varying acceptable
response times or resource constraints.
System Performance and Reliability: Monitor optimize and ensure system
reliability and performance using tools like New Relic to:
o Apply DORA metrics to measure and improve development and operational
performance.
o Ensure compliance with SLM metrics like SLAs SLOs and SLIs by tracking
uptime response times and resolution times.
Resiliency: Implement and advocate for Chaos engineering practices to ensure
system resiliency.
Collaboration: Work with crossfunctional teams to enhance platform engineering
practices and gathering the right information for metrics analysis.

Requirements:
Proven experience working with InfrastructureasCode tooling like Terraform
for infrastructure management.
Strong understanding of scalability and high availability patterns including load
balancing autoscaling canary releases and bluegreen deployments.
Strong understanding of DevOps metrics (like DORA) and their application in
measuring and improving development and operational performance.
Strong understanding of Service Level Management (SLM) metrics (like SLAs
SLOs and SLIs). And their importance in defining monitoring and ensuring
compliance from the services bound to them.
Experience with monitoring logging and observability tools like New Relic
Prometheus Grafana and Datadog.
Experience working with Kafka and improving performance of eventdriven realtime data processing and streaming projects and architectures.
Familiarity with tooling used for SLM DevOps and DORA metrics like Apache
Dev Lake Grafana and New Relic.
Experience working with AWS Azure or GCP for cloud infrastructure management.
Experience working with CI/CD pipeline tools such as GitHub Actions Jenkins
GitLab CI or similar.
Analytical Skills. Ability to analyze and interpret metrics to drive improvements.
Strong communication skills to effectively collaborate with team members and
stakeholders.
Nicetohaves
Familiarity with ObservabilityasCode tooling and practices.
Familiarity with Chaos engineering practices for system resiliency.

C LS K

Wakapi Web

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

Wakapi

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Site Reliability Engineer SRE

Wakapi

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Power Engineer

Senior data engineer Remoto

Senior data engineer - Remote

Senior Software Engineer USA - 100 Remote LATAM talent exclusively

Senior Testledare

Lead Software Engineer Java

Cloud Engineer GCP - UK Remote

Lead Software Engineer React Full-stack