drjobs Site Reliability Engineer العربية

Site Reliability Engineer

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Role: Site Reliability Engineer
Experience: 510 Years
Location: Remote

About the role:

Your primary role will be to ensure that customers infrastructure is consistently available meeting contractually agreed SLA targets. You will be supporting alwaysavailable platforms building new systems upgrading and patching existing ones and supporting application deployments. You will also act as a point of escalation for the regional support centres

You will serve as a technical liaison between the Customer and the Engineering Team working to ensure the criticality of the problem is fully understood and satisfactorily resolved in a time sensitive manner. By handling technical problems with extreme professional acumen you will deliver a positive problemsolving experience to customers who subscribed to the remote environment management services

Responsibilities:

  • Support an alwaysavailable cloudbased platform
  • Support application deployments build new systems and upgrade and patch existing ones
  • Ensure that there is adequate monitoring and reporting to be able to identify problems and resolve/escalate them as appropriate
  • Participate in the building of tools and processes to support the infrastructure and improve automation for manual elements
  • Identify operations improvements to strengthen organisational processes and practices and enhance support quality and customer satisfaction
  • Maintain accurate and up to date documentation on the current infrastructure and system support documents such as run books
  • Ensure that mission critical data is backed up as required
  • Resolve critical/complex issues where analysis of situations or data requires an indepth evaluation of variable factors
  • Proactively monitor customer s technical issues and drive support requests and escalations to a satisfactory resolution
  • Escalate and communicate any servicerelated technical incidents ongoing service interruptions and/or problems to the relevant Service Delivery/Support personnel and ensure that any fixes are understood documented and communicated to team members
  • Resolve incidents in a positive and supportive manner and work effectively with outside vendors to provide high quality and responsive services to Flex clients
  • Work closely with the engineering teams to help resolve bugs and deliver solutions in a timely fashion
  • Learn on the job and explore new technologies with minimal supervision
  • There will be some need for oncall work and/or weekend work to cover live events.

Key Skills:

  • Minimum of 5 years experience with the Linux operating system. (Ideally some form of a Linux Administration Certification)
  • Experience in scripting and serverside programming languages (Shell Python Java Go)
  • Understanding of protocols/technologies like HTTP SSL SQL JSON
  • Understanding application clustering/load balancing concepts and technologies
  • Experience administering Java Application Server (e.g. Apache Tomcat)
  • Experience with Amazon Web Services (AWS) GCP or Microsoft Azure technologies
  • Experience automating deployments using tools such as Terraform Ansible Docker Kubernetes and Rancher is a big plus
  • Experience administering database and messaging systems like MariaDB MongoDB RabbitMQ Redis is a big plus
  • Highly motivated in customer support services and client satisfaction
  • Experience in troubleshooting/debugging complex technical issues
  • Excellent written and oral communication skills in English
  • Must be able to articulate technical solutions to all audiences
  • Ability to work independently and as a team player


infrastructure,cloud-based product development,aws,linux,python,go

Employment Type

Remote

Company Industry

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting
Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.