Job Description
IT consulting firmisseeking a highly qualified and experienced Observability Engineer to join their platform engineering team. The ideal candidate will play a crucial role in ensuring the reliability performance and scalability of our systems by introducing and implementing specialised monitoring and alerting practices.
Responsibilities:
- Design implement and manage observability solutions to monitor the health and performance of our systems and applications.
- Create and manage dashboards alerts and reports to provide actionable insights into system behavior and performance.
- Troubleshoot and resolve production issues related to observability and monitoring.
- Develop and maintain infrastructure as code.
- Develop and maintain documentation for observability solutions including best practices and standards.
- Collaborate with development and operations teams to ensure seamless integration of observability practices into the CI/CD pipeline.
- Engage with internal teams to promote observability best practices and ensure consistent adoption across the organization.
- Provide mentorship and guidance to junior engineers on observability best practices and tools.
Requirements
- Experience: Minimum of 4 years in modern observability with a focus on cloud technologies and automation tools.
- Proven experience in designing and implementing observability solutions using open standards.
- Proficiency in administration of observability platforms such as Datadog and OpenTelemetry.
- Experience with container orchestration using Kubernetes.
- Expertise in infrastructure as code particularly with Terraform.
- Solid understanding of cloud hypervisors specifically Azure and GCP.
- Previous experience in software development is highly valued.
- Excellent problemsolving skills and ability to troubleshoot complex issues.
- Strong communication and collaboration skills with a proven ability to work effectively in a team environment.
- Ability to learn and adapt quickly to new technologies and methodologies.
- Knowledge of additional cloud platforms and services.
- Familiarity with DevOps practices and tools.
- Understanding of security best practices and how they apply to observability.
- Proven ability to work independently and as part of a team.