Role1: Sr. DevOps SRE Engineer with Monitoring tools experience
Location: Dallas, TX
Duration: Long Term
Required Skills:
- Must have 10+ years overall experience and should have prior experience as an SRE engineer
- Installing, Integrating, maintaining, and monitoring tools like Splunk, New-relic, Prometheus, Grafana
- Setting up the dashboards and monitoring, a good understanding of going thru logs and understanding the issues
- Good Understanding of AWS Cloud Services (EC2, S3, SNS, SQS, Lambda, VPC, ALB), Docker, Kubernetes, Tomcat servers, and other application servers
- Excellent knowledge of Jenkins, GitLabs CI/CD, Java Build (Maven / Gradle), NPM Builds
- Complete understanding of the DevOps process is an advantage
- Experience with Python / Shell scripting is an advantage
- Familiarity with the various technical landscapes of multi-channel business architectures
- Experience in all phases of software development, including design, configuration, testing, debugging, implementation, and support of large-scale, business-centric, and process-based applications.
Roles and Responsibilities:
- Build software to help operations and support teams with monitors and dashboards
- Participating in On-call support to clients and maintaining the support tickets without escalations
- Monitoring availability and taking a holistic view of the system's health
- Fixing support escalation issues, Documenting knowledge, and Conducting training
- Provide primary operational support and engineering for multiple large distributed software applications
- Ensure the performance, quality, and responsiveness of the applications.
- Ability to understand the logs and login to AWS servers/environments quickly and provide the steps to solve the issues
- Good Verbal and Communication is a plus.
- Should be Self-Driven and able to learn new Technologies.