Requirements
Ss
Must Have
- Experience in REST and WEB API Support
- Experience in Cloud based apps Support
S
| Category
|
Linux & S Scripting
| 1
|
Monitoring Tool Splunk/Dynatrace or Other
|
ITIL/ITSM
|
Troubleing
|
PL/SQL SQL
|
Jenkins
| 2
|
CI/CD
|
Groovy Scripting/Yaml
|
Linux s scripting & Git/bit bucket
|
Ansible/Chef
|
Site Reliability Engineering:
o Serve as the primary contact responsible for ensuring application scalability performance and resilience.
o Practice sustainable incident response and blameless postmortems while taking a holistic approach to problem solving and optimizing time to recover.
o Automate datadriven alerts to proactively escalate issues. Work with development teams to elish SLOs and improve reliability.
DevOps/Automation:
o Tackle complex development automation and business process problems. Engage in and improve the whole lifecycle of services from inception and design through deployment operation and refinement.
o Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating and lead in DevOps automation and best practices.
o Increase automation and tooling to reduce toil and manual intervention
ITSM Practices:
o yses ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
The ideal candidate will have experience in many of these areas:
BS degree in Computer Science or related technical field involving coding (e.g. physics or mathematics) or equivalent practical experience.
Coding or scripting exposure.
Appetite for change and pushing the boundaries of what can be done with automation. Be curious about new technology infrastructure and practices to scale our architecture and prepare for future growth.
Experience with algorithms data structures scripting pipeline management and software design
Systematic problemsolving approach coupled with strong communication ss and a sense of ownership and drive.
Interest in designing ysing and troubleing largescale distributed systems.
Willingness and ability to learn and take on challenging opportunities and to work as a member of matrix based diverse and geographically distributed project team.
Ability to balance doing things right with fixing things quickly. Flexible and pragmatic while working towards improving the longterm health of the system.
Comfortable collaborating with crossfunctional teams to ensure that expected system behaviour is understood and monitoring exists to detect anomalies.
Preferred Qualifications:
Coding experience in one or more of the following: C Java Python Go
Experience with algorithms data structures scripting pipeline management and software design.
Experience in working across development operations and product teams to prioritize needs and to build relationships is a must.
Experience in a SRE role or related field.
Background on cloud native tooling and orchestration technologies (Kubernetes preferred).
Experience in Monitoring tools such as Splunk Dynatrace.
Experience with Java J2EE WebServices (SOAP/REST) Spring/Spring Boot is a plus.
Experience in production support environments and ITIL processes.
Experience with industry standard CI/CD tools like Git/BitBucket Jenkins Maven Artifactory Groovy and Chef. Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is required.
Developing and maintaining cloud solutions on Azure GCP or AWS in accordance with best practices.
Understanding of:
o Clientserver relationships
o Network concepts (Layer 1 to Layer 3)
o Stack trace ysis (TCP dumps heap dumps CPU/memory ysis thread dumps).
o Load balancers and application firewalls.
o Operating System navigation.
o Logging and monitoring ods standards and tools.
o High availability and business continuity planning
o Caching concepts
o Configuration management