Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via email
Job Title: Site Reliability Engineer
Location: 2 days from San Jose CA office 3 days remote
Duration: Longterm contract
Must have: Need Hands on experience in Kubernetes and NFS.
Job Responsibilities:
Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments validations and monitoring to improve operational tasks.
Scheduling monitoring scripts using cron and airflow.
Monitoring using tools including Dynatrace Apica Grafana etc.
Database handling
Build CICD pipelines
Incident handling and problem management
Mandatory Skills:
Experience in Ansible/ Python
Monitoring Tools Dynatrace/Apica/Grafana
Required Experience:
14 plus years of IT Infrastructure experience
Extensive experience working with Linux Flavors like RHEL/centos OS shells filesystems and utilities
Experience in programming languages like Python ansible
Knowledge of distributed computing and experience working with container orchestration frameworks including onprem and rancher Kubernetes and good knowledge on Kubernetes objects
Experience working with Storage ONTAP is preferable: volume aggregates backups DR planning
Experience scheduling monitoring scripts using Cron and Airlfow
Experience with monitoring tools including Dynatrace Apica Grafana etc
Database knowledge including SQL and NoSQL DBS
Experience building CICD pipelines (preferred)
Cloud platform knowledge (specifically AWS) is required
Full Time