Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailJob Title: SRE DevOps Engineer
Duration: FULL-TIME
Responsibilities:
Supporting the Ops teams to diagnose and provide solutions for systemic issues in the platforms
Solve Operational problems
Work on-call shifts responding to alerts
Providing oversight into mission critical projects
Conduct operations workshops and increase operational effectiveness within the organization
Design and implement operational processes, deployment guidelines, and feedback loops to ensure successful deployment/operations of Software Robotics & Citizen development technology platforms
Bring service enhancements to the Software Robotics & Citizen development Services by means on standardized & improved monitoring and reporting capabilities
Perform monthly analytics of historic Ops tickets to enable proactive issue identification and enabling of issue avoidance or self-healing capabilities
Identify, develop, test, debug and implement improvements using Digital Transformation techniques to optimize operational efficiencies for Level 3 Ops functions and improve platform reliability
Persuade and influence others through strong and comprehensive communication and diplomacy skills
Set and maintain acceptable performance and availability thresholds (Service Level Objectives) by working closely with Ops and system engineers to drive adoption of modern reliability practices like SLI, SLOs, error budget policies, actionable alerts, self-healing, proactive capacity management and change/release management practices.
Monitor and help stabilize services in production. Standardize & Improve monitoring and reporting capabilities, dashboard creation using tools like Splunk, AppDynamics, etc
Perform Compliance Reviews of Vulnerabilities/EOVS
Engage on Gating/Production Readiness Review(PRR) for new services/platforms/products
Participate on CoB/DR Drills
Standardize change and release management
Proactively perform demand forecasting, capacity planning and anomaly detection
Review and engage on release of new products/services from testing through go-live
Standardize change and release management for Platforms
Contribute towards architecture reviews, making sure product designs are technically sound with contingencies thought through.
Technical documentation focused towards driving post-mortems, technical/process guidance, automation, operational improvement, etc.
Review current platforms & application architecture and engage with server & product engineers for setting up SDI for the existing platform with focused drive to enable new platforms.
Qualifications:
10 years of relevant Level 3 SME or Engineering experience in platforms covering VMWare Horizon, RPA and low-code/no-code automation products with 3years of recent progressive experience as an SRE
Experience in working with programming languages (Python, Java, C, JavaScript)
Working knowledge of AWS, containers (Kubernetes, Docker, etc) is preferred
Working knowledge of CI/CD Tools (BitBucket, Jenkins, RLM, uDeploy, Travis)
Experience in Automations (using Terraform, Chef, Ansible, Unix Shell Scripting, PowerShell scripting, Python, PowerBI, products like Automation Anywhere, Selenium, Appian, and other RPA/low-code/no-code applications)
Expert hands-on scripting experience with Linux and windows server platforms
Experience with enabling monitoring and observability (Appdynamics, SysTrack, Splunk PagerDuty, New Relic, Datadog) and usage of schedulers (Autosys, etc)
Able to understand core infrastructure issues and guide ops on troubleshooting
Platform (Middleware) support experience on RPA and low-code/no-code platforms like Automation Anywhere, Work Fusion, Appian, Boardwalk, Selenium, or other automation technologies.
Able to understand RPA components and its integration with Horizon
Proficiency in MS Office (Word, Excel, Powerpoint)
Has worked with and experienced in using ITSM tools (ServiceNow or others)
IT Support based qualifications / training
Critical Competencies:
Strong verbal and written communication
Learn and able to adapt to changing environment. Willing to learn and have a can-do attitude
Ability to perform complex and varied assignments in support of the team's scope of work
Analytical thinking and experience with data analysis tools and methodologies
Planning, multi-tasking, and prioritization skills
Team-oriented and collegial approach to addressing challenges
Strong people, process, and business focus
ITIL certified
Being part of a team, you will be expected to work un-supervised and manage your workload effectively
Strong communication skills
Good interpersonal and teamwork skills
Full Time