The individual will be responsible for monitoring our exchange service status refining deployment pipeline monitoring troubleshooting and identifying the root cause of issues.
Focus on ensuring the stability of the service and helping devteam to deploy features to production.
Responsible for handling and solving issues when service goes wrong.
Responsible for designing a better deployment pipeline.
Build tools to monitor systems and identify issues.
Candidate Requirements:
Fluent in Chinese (MUST)
Terraform
Familiar with GCP AWS or other cloud services.
Have experience in CICD Workflow.
Familiar with Docker kubernetes and have experience in using Kubernetes to manage productiongrade cluster.
Familiar with mysql mq and redis.
Familiar with monitoring system prometheus be able to customize grafana dashboard.
Familiar with logging system ELK or EFK.
Coding ability: python or golang.
24/7 oncall when there are urgent issues to handle and must be handled in a timely responsible manner.
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.