- 2 Sr. Level SRE Engineers/ Architects
Must Haves:
- 5 years of experience in an SRE or similar role with a focus on trading systems or financial services.
- Expertise in monitoring tools (Dynatrace Splunk Grafana) and Kubernetes.
- Strong understanding of DevOps methodologies and tools.
- Proven track record in incident management root cause analysis and implementing resilient system designs.
- Experience with deployment strategies (blue/green canary etc.) and managing complex distributed systems in a cloud environment.
- Excellent problemsolving communication and teamwork skills.
- Solid understanding of onprem and hybrid cloud infrastructure (VMware Linux Windows Azure) and container orchestration (Kubernetes Docker).
- Fairly good understanding of MongodB Kafka and IBM mainframe DB2 (preferred)
- Conversant with WebLogic Java technology stacks including spring boot (Not Expert level skillset)
Nicetoknow:
- Dynatrace Basics:
- Can you explain how Dynatrace OneAgent works and how it collects data from monitored applications
- Splunk Fundamentals:
- How would you use Splunk to search and filter log data to identify errors or anomalies in an application
- Monitoring and Alerting:
- Describe how you would set up monitoring and alerting for a critical service using Dynatrace.
- Log Analysis with Splunk:
- Can you provide an example of a Splunk search query youve used to troubleshoot a specific issue
- Performance Optimization:
- How do you use Dynatrace to identify and address performance issues in a web application
- Incident Response:
- Describe your role in an incident response scenario and how you used Dynatrace or Splunk to diagnose and resolve the issue.
- Infrastructure Monitoring:
- How would you monitor the health and performance of a Kubernetes cluster using Dynatrace
- Data Visualization:
- How do you create dashboards in Splunk to visualize key metrics and trends for a service
- Collaboration with Development Teams:
- How do you work with development teams to implement observability and monitoring solutions using Dynatrace and Splunk
- Continuous Improvement:
- How do you use data from Dynatrace and Splunk to drive continuous improvement in application performance and reliability
Dynatrace,Splunk,Financial Services,Kubernetes,DevOps