-
Five or more years of experience as a full stack Linux Systems/Application Support Engineer
-
Familiarity with Tomcat, MySQL/Percona support and SQL queries, RHEL, RabbitMQ, Elasticsearch, nginx, haproxy
-
Understanding of HA design, cross-site replication, local and global load balancers, etc
-
Experience with Security Hardening & Vulnerability/Compliance, OS patching
-
Strong knowledge of performance monitoring, metrics, capacity planning, and management
-
Hands on Scripting & Programming - REACT, Java, JavaScript, Python, bash, Ansible, YAML, etc.
-
Understanding of data parsing and regex syntax
-
Experience with application onboarding - capturing requirements, understanding data sources, application relationships, manage meetings, training, etc
-
Familiarity with Splunk, HP OMi/Infrastructure agents, APM/New Relic, Oracle OEM, Catchpoint, syslog events, SNMP events, Zabbix, ServiceNow, etc
-
Understanding of CMDB and asset relationships, topology maps, and alert enrichment
-
Develop new processes to prevent problem recurrence and automated recoveries
-
Strong data analytics and centralized reporting (ex. Grafana dashboard integration)
-
Identify opportunities to improve architecture/engineering practices
-
Strong skills in creating documentation - engineering runbooks, support procedures, user onboarding and support documentation
-
Familiarity with Confluence and JIRA
-
Mentor staff to replace manual processes with automation
-
Collaborate across all levels of the organization to drive the SRE model
-
Familiarity with supporting enterprise container based platforms
-
Data ingestion & enrichment from various sources, webhooks, and REST APIs with JSON/XML payloads
-
Strong knowledge of Unix/Linux based systems, and experience troubleshooting applications running on these systems
-
Experience with software design lifecycle, including testing, implementation, and delivery
-
Ability to apply a systematic approach to solve problems with a sense of ownership and drive through resolution
-
Ability to recognize and identify root cause events in order to assist application teams and train the AI/ML
-
Effective communication skills with the ability to articulate technical details to diverse audiences