About the Role:
We are seeking a Site Reliability Engineer (SRE) or Infrastructure and Data Architect with expertise in infrastructure architecture crossfunctional SRE practices and Python development. The ideal candidate will possess strong analytical skills particularly in data engineering and preferably have a background in observability tools such as ELK (Elasticsearch Logstash Kibana) and Grafana.
Key Responsibilities:
- Infrastructure Design and Implementation:
- Architect and maintain robust infrastructure solutions to ensure system scalability reliability and efficiency.
- Collaborate with crossfunctional teams to design and implement cloudnative and onprem solutions.
- Site Reliability Engineering (SRE):
- Develop deploy and maintain automation tools to optimize operational workflows and incident responses.
- Enhance system reliability and reduce manual intervention by implementing SRE best practices.
- Data Engineering and Analytics:
- Analyze large datasets to identify patterns optimize processes and improve decisionmaking.
- Design and implement pipelines for data transformation storage and retrieval.
- Python Development:
- Write highquality maintainable and efficient Python code for automation integrations and system enhancements.
- Develop scripts and tools to support operational and engineering needs.
- Observability and Monitoring:
- Set up and maintain observability tools like ELK and Grafana for monitoring system health and performance.
- Implement alerting mechanisms to proactively identify and resolve system issues.
Key Requirements:
- Technical Expertise:
- Strong experience in infrastructure architecture and SRE practices.
- Proficient in Python programming and development.
- Familiarity with modern observability tools (ELK Stack Grafana).
- Knowledge of data engineering and analytics.
- Analytical Skills:
- Ability to analyze complex systems and datasets for optimization and troubleshooting.
- Strong problemsolving skills and attention to detail.
- Experience:
- 5 years of experience in a similar role involving infrastructure SRE or data architecture.
- Handson experience with cloud platforms and DevOps practices is a plus.
- Soft Skills:
- Excellent communication and collaboration skills to work with crossfunctional teams.
- Proactive mindset and ability to work in a fastpaced environment.
Preferred Qualifications:
- Background in observability platforms especially ELK and Grafana.
- Prior experience in largescale system design and dataintensive environments.
data engineering,infrastructure architecture,python development,cloud-native solutions,skills,infrastructure,reliability,architecture,analytical skills,data transformation,analytics,data,design,automation tools,python,elk stack,site reliability engineering (sre),grafana