Hello
My name is Rishabh Chaturvedi. I just received details on a great job that I believe you would be a great fit for. Please take a look below and share your interest. If not interested I would also appreciate if you can recommend me someone looking for a similar role.
Job Title: Site Reliability Engineer/SRE
Location: Midtown NYC NY Hybrid
Duration: Full Time Role
Interview :: Video
Visa U.S Citizen Green Card Holder
Must have LinkedIn profile
Need Local Profiles
Role Description:
We need a senior (10 years) Site Reliability Engineer/SRE with excellent experience working with AWS (Certifications preferred). Candidates must have experience in architecting implementing and managing monitoring tools such as Prometheus/Grafana CloudWatch Splunk NewRelic and ELK in the cloud. Strong Linux OSlevel and commandline/scripting knowledge and configuration management principles as well as Experience with computer provisioning on a Cloud based platform using Terraform and/or Cloud formation.
Number of years working with:
Total IT experience:
Years working with: SRE
Years working with: AWS
Years working with: Linux
Years working with: Terraform/Cloud
Responsibilities
- Build highly available solutions across the entire SDLC stack with primary focus on an internet facing fintech site.
- Develop and maintain tools to support the development environment on MacOS and Linux tool environment with focus on improving developer productivity.
- Maintain site reliability with a focus on building highly scalable systems integrating resiliency and high availability at all levels.
- Develop software and tooling to secure and automate cloud infrastructure building software delivery capabilities with fully automatic workflows.
- Design and operation of a Kubernetes environment for container management and orchestration.
- Participate in oncall rotations to help understand the system while helping build tools for automation.
-
Qualifications
- 10 years of DevOps TechOps or SRE experience with 5 years of AWS experience
- Microservices (Docker Kubernetes) experience in a production environment strongly desired
- Strong Linux OSlevel and commandline/scripting knowledge and configuration management principles
- Working knowledge of databases such as MongoDB Postgres DynamoDB
- Experience in architecting implementing and managing monitoring tools such as Prometheus/Grafana CloudWatch Splunk NewRelic and ELK in the cloud
- Coding beyond simple scripting with strong opinions on maintainable/reusable code in Python Ruby or Java desired
- Experience with computer provisioning on a Cloud based platform using Terraform and/or Cloud formation
- Experience with distributed systems design maintenance and handson troubleshooting/debugging skills
- Exceptional analytical skills able to apply knowledge and experience in decisionmaking to arrive at creative and commercial solutions
- Experience building a Microservice based architecture
- Excellent written and verbal communication skills
- Experience in updating runbooks tools and documentation that help the team to respond to incidents proactively
- Able to design and implement complex but easily managed automated infrastructure
- A desire to share teach and learn as part of a team
- AWS certifications are a plus