A cloud engineer is responsible for the reliability and performance of the digital platform that we run for our clients. You will be working with various teams including the development team/s to manage the hosted or cloud native applications in an efficient and secure manner.
Your responsibilities will include
- To collaborate with specialist teams within the organisation and thirdparty suppliers to deliver outstanding results for our clients
- Actively participate in our shiftleft strategy via tools automation process and selfservice functions to empower the business and other IT teams
- Manage business stakeholders through clear and accurate communication
- Configure and finetune hybrid and cloud native systems
- Monitoring and managing systems within the scope of the service
- Maintaining data integrity and access control by making use of operational tools
- Monitoring cloud costs and explore cost optimization strategies
- Plan and take preventive actions to mitigate risk of service disruptions
- Planning and patching systems based on generated software vulnerability reports
- Responding to alerts and escalation of incidents with the aim to analyze and resolve them
- Execute root cause analysis into P1/P2 incidents
- Fulfilling service requests raised
- Deploy release packages from a CI/CD solution
- Ensure training material and technical documentation on new and existing operating process are kept uptodate
- Reporting key metrics to the service manager
- Participate and contribute to our Agile way of working including planning retrospectives and showcases
- Document and track issues and reasonable steps taken within the ticket management system
- Provide afterhours support within a defined roster
Who you are
- You must have a strong technical aptitude and an organized process driven work ethic.
- You will have strong written and verbal communication skills and a track record for providing high customer satisfaction.
- At least 3 years experience in an Online Systems/Web application support
- Exposure to APM monitoring alerting and identifying trends with dashboards
- Experience with installation configuration managing and operating Linux systems
- Knowledge of best practices and IT Operations in an alwaysup and always available service
- Have a good understanding and experience working with one of three major Public cloud providers (AWS MS Azure GCP)
- Hands on experience with Docker and Kubernetes will be an added advantage
- Experience in querying logs using log aggregation tools like ELK SPLUNK etc.
- Working knowledge of SQL and NoSQL database like MySQL/Oracle/PostgresSQL/DynamoDB/MongoDB/Cassandra/Redis/Memcache
- Has experience working with JIRA and Confluence
- Prior working experience in an Agile environment will be an advantage
- An understanding of ITIL practices would be ideal
- You must be willing to work in shifts.