Key Responsibilities:
- Linux Systems Management: Administer and optimize Linuxbased servers and environments ensuring high availability scalability and performance.
- CI/CD Pipeline Development: Design implement and maintain Jenkins pipelines to automate the build test and deployment processes.
- Configuration Management with Ansible: Automate system configuration provisioning and deployment using Ansible to ensure consistency and reliability.
- Scripting and Automation: Write Python scripts to automate repetitive tasks manage infrastructure and integrate with different tools and services.
- Version Control: Manage source code repositories using Git/Bitbucket enforce branching strategies and ensure smooth collaboration between development teams.
- Kafka: Configure monitor and optimize Kafka clusters ensuring highthroughput lowlatency messaging across services.
- Aerospike Management: Administer and optimize Aerospike NoSQL databases for high performance scalability and fault tolerance.
- Monitoring and Alerting with Prometheus & Grafana: Set up and maintain Prometheus for realtime monitoring and alerting of infrastructure and application performance. Visualize monitoring data through Grafana dashboards.
- Log Management with Elasticsearch & Kibana: Implement centralized logging solutions using Elasticsearch for storage and Kibana for visualizing and analyzing logs. Proactively monitor logs to detect and resolve issues.
- Collaboration: Work closely with developers QA and IT teams to improve deployment processes code quality and overall system reliability.
- Troubleshooting and Incident Management: Respond to critical incidents perform root cause analysis and implement preventive measures to ensure high availability and reliability of services.
Requirements
Required Skills:
- Linux: Indepth knowledge of Linux operating systems including experience with shell scripting package management and system administration tasks.
- Jenkins: Handson experience with Jenkins for building testing and deploying applications in a CI/CD pipeline.
- Ansible: Proficient with Ansible for configuration management automation and orchestration tasks.
- Python: Strong Python programming skills especially for automating tasks managing cloud infrastructure and integrating with APIs.
- GIT/Bitbucket: Solid experience with Git including branching merging and versioning strategies. Familiarity with Bitbucket for repository management.
- Kafka: Good experience with Kafka including the configuration and management of Kafka brokers producers consumers and stream processing.
- Aerospike: Strong working knowledge of Aerospike for highperformance scalable distributed NoSQL database solutions.
- Prometheus: Experience setting up and managing Prometheus for infrastructure monitoring and alerting.
- Grafana: Proficient in creating and managing Grafana dashboards for visualizing monitoring metrics and performance data.
- Elasticsearch: Handson experience working with Elasticsearch for log storage indexing and query performance.
- Kibana: Experience with Kibana for visualizing exploring and analysing logs and metrics collected in Elasticsearch.
Stakeholder Management and Control to ensure timely project delivery while maintaining a healthy environment. Driving new programs while accounting for risk management or threats to Project progress. Participates in Quarterly planning and associated activities. Ensuring project level effort estimation and capacity planning to arrive at Project timelines for new product creation and launches. Following program progress and ensuring project scope doesn t get out of hand with the product owner.