Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailWe are enabling the transition to softwaredefined vehicles supported by electrified and intelligently connected architectures which will combine to power the future of mobility.
Were seeking a highly motivated technical lead to establish a Cloud Site Reliability Engineering practice within the Active Safety and User Experience division which is responsible for building the next generation of Autonomous Driving solutions for some of the biggest Car Manufacturers in the world.
In this role you will provide guidance on the best practices tools and processes for a worldleadingCloud Site Reliability Engineering (SRE) function and lead applying SRE solutions to critical workloads across CICD Simulation and AI.
ROLES AND RESPONSIBILITIES
Design build and operate cloudbased software engineering tools that are elastic resilience and secure
Design integrations with industryleading observability platforms and implement AIbased alerting systems.
Implement cybersecurity best practices for threat detection and multiplatform Identity Management.
Identify and resolve performance bottlenecks and other issues that affect the reliability and scalability of our systems
Develop and maintain automation scripts and tools to streamline operations
Collaborate with development teams to ensure that our systems and services meet their needs and are easy to use
Work with other SREs to design and implement solutions for monitoring logging and alerting of our cloud infrastructure and services
Continuously evaluate and improve our cloud infrastructure and processes to ensure that they are efficient scalable and secure
Actively participate in project team meetings to ensure standard practices are followed and any concerns are quickly addressed.
Keep up to date on the latest industry trends in technologies.
EXPERIENCE / SKILL REQUISITES:
Bachelors degree in Computer Science or related field or equivalent experience
5 years of experience in a DevOps MLOps or Cloud Site Reliability Engineering or similar role e.g. Platform Engineering Cloud Operations etc
Proficiency in one or more programming/scripting languages (Python Go Perl etc.)
Strong Experience with cloudbased infrastructure platforms such as AWS GCP or Azure
Experience with containerization technologies such as Docker and Kubernetes
Good Experience with one or more Infrastructure as Code technologies i.e. Ansible Terraform Cloud Formation
Working knowledge of monitoring logging and alerting tools such as Datadog Prometheus Grafana and Splunk
Strong problemsolving and troubleshooting skills
Excellent communication and collaboration skills
Excellent problemsolving and debugging skills and Agile development practices
Good team player and should follow agile development methodologies
Good interpersonal and communication skills English language proficiency is a must.
Ability to learn and adapt new technologies passion for continuous improvement.
Full Time