Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailAbout
Our client is a technological innovation consultancy with a highly qualified team. They help companies accelerate towards digital transformation.
Role
Our client is seeking a highly qualified and experienced Observability Engineer to join our platform engineering team. The ideal candidate will play a crucial role in ensuring the reliability performance and scalability of their systems by introducing and implementing specialized monitoring and alerting practices.
Responsibilities
Design implement and manage observability solutions to monitor the health and performance of our systems and applications.
Create and manage dashboards alerts and reports to provide actionable insights into system behavior and performance.
Utilize preferred tools like Datadog and OpenTelemetry to build comprehensive observability platforms.
Troubleshoot and resolve production issues related to observability and monitoring.
Develop and maintain infrastructure as code using tools such as Terraform to manage observability across multicloud environments.
Develop and maintain documentation for observability solutions including best practices and standards.
Collaborate with development and operations teams to ensure seamless integration of observability practices into the CI/CD pipeline.
Engage with internal teams to promote observability best practices and ensure consistent adoption across the organization.
Continuously evaluate and implement new observability tools and technologies to improve monitoring and alerting capabilities.
Provide mentorship and guidance to junior engineers on observability best practices and tools.
Requirements
Experience: Minimum of 4 years in modern observability with a focus on cloud technologies and automation tools.
Proven experience in designing and implementing observability solutions using open standards.
Proficiency in administration of observability platforms such as Datadog and OpenTelemetry.
Experience with container orchestration using Kubernetes.
Expertise in infrastructure as code particularly with Terraform.
Solid understanding of cloud hypervisors specifically Azure and GCP.
Previous experience in software development is highly valued.
Excellent problemsolving skills and ability to troubleshoot complex issues.
Strong communication and collaboration skills with a proven ability to work effectively in a team environment.
Ability to learn and adapt quickly to new technologies and methodologies.
Experience with other monitoring and observability tools.
Knowledge of additional cloud platforms and services.
Familiarity with DevOps practices and tools.
Understanding of security best practices and how they apply to observability.
Proven ability to work independently and as part of a team.
Full Time