Provide L3 support for a private cloud including oncall rotation
Work closely with the internal engineering team and provide input on testing of new component releases and infrastructure upgrades as well as performance capacity and monitoring
Create and improve processes for support including training documentation customer engagement incident problem and change management
Contribute to internally developed CLIs and APIs to automate SREs activities and platforms automation
Work together with L2 teams and other L3 team members internationally.
Qualifications:
5 to 10 years of relevant experience in platforms maintenance/development
Experience in a least one programming language
Experience with maintaining complex production systems with cloud and legacy technologies
Proven Kubernetes and Docker experienceKnowledges of monitoring stack (Grafana Prometheus Splunk) usage
Strong organizational skills and ability to manage multiple tasks and highpressure situations for outage resolution
Communicate effectively with various user groups e.g. developers and engineers as well as remote team members
change management,splunk,cloud technologies,automation,docker,customer engagement,programming language,prometheus,grafana,l3 support,incident management,kubernetes,problem management