Job Requirements:
Note: Shared Infrastructure Below indicate (RancherK8s Containers VMs Kafka Apache Flink/Spark EDB PostgreSQL Redis Cache or equivalent S3/Cloudian Apigee etc.) Strong experience with Python scripting
Responsibilities:
- Design implement and maintain shared infrastructure services for AWS and OnPremises VMware environments.
- Collaborate with crossfunctional teams to establish connectivity and integration between cloud and onpremises resources.
- Design and implement data replication sync and failover strategies to ensure high availability and disaster recovery across environments.
- Develop and maintain CI/CD pipelines for automated deployment and configuration management.
- Monitor and optimize shared services performance scalability and reliability.
- Troubleshoot and resolve issues related to shared infrastructure services and replication mechanisms.
- Document configurations processes and procedures related to shared services and replication engineering.
- Stay updated with industry best practices and emerging technologies related to cloud and onpremises infrastructure.
- As a DevOps engineer setup CI/CD pipeline for application deployment and infra components deployment using terraform
- Work in the DevOps team to build new shared infrastructure services for on premises failover environment: RancherK8s Containers VMs Kafka Apache Flink/Spark EDB PostgreSQL Redis Cache or equivalent S3/ Cloudian Apigee etc
Skills required
- 8 years experience with global enterprise networking operations data center management Infrastructure Services in AWS and VMware you could be a great fit for this role. Strong experience with Python scripting
- Relevant certifications such as AWS network certification or VMware Network and Security certifications
- 8 years of experience in designing network and workload isolation network segmentation network security policy definition and network standards (DNS & Subdomain routing etc.)
- 8 Years of compute network storage and security services in both AWS and VMware Environments
- 8 years experience in developing and executing strategies for improving security and reliability across all systems and services
- 8 Years of experience in setting K8s using Rancher AWS EKS or similar services. Ability to deploy CIS CSI Ingress controller Reverse Proxy and Other instrumentation around Kubernetes clusters
- 5 years of experience in shared infrastructure services in AWS and VMware environments such as Kafka Stream Data Pipes (Flink/Spark/Kinesis) Redis Cache Apigee (API gateway).
- Strong understanding of security principles practices and technologies including encryption authentication access control and network security. Proven experience with reliability engineering practices such as monitoring alerting incident response and performance tuning.
- Proven experience with reliability engineering practices such as monitoring alerting incident response and performance tuning.
- Proven experience with DevOps practices such as CICD and Infrastructure as a code.
- Nice to have: Proficiency in scripting and automation tools such as Python Bash Ansible or Terraform.
- Nice to have: Experience in implementing Network and infrastructure compliance with financial industry standards and regulatory requirements
- Previous experience in Implementing and maintaining monitoring alerting and incident response processes. Optimize system performance and automate repetitive tasks to improve efficiency
- Experience with DevOps practices and tools such as CI/CD pipelines GitOps and infrastructure as code.
- Experience with cloud platforms (AWS Azure GCP) and container orchestration systems (Kubernetes Docker).
- Excellent problemsolving skills and the ability to work under pressure in a fastpaced environment.
- Strong communication and interpersonal skills with the ability to influence and inspire teams.