PLEASE NOTE THIS IS AN EQUITYONLY ROLE AND THE INTERVIEWS WILL COMMENCE IN FEBRUARY 2025.
StealthMode StartUp Client is seeking a DevOps Engineer to design implement and maintain scalable secure and reliable infrastructure for an innovative web and mobile platform. This role will focus on automation CI/CD pipeline management cloud infrastructure optimization and ensuring system uptime and resilience.
The ideal candidate will have strong expertise in cloud platforms infrastructure as code (IaC) CI/CD pipelines and a passion for building highperformance deployment systems.
To apply please provide a CV your compensation requirements (including salary expectations for when funding is secured) and a cover letter/note that explains why you are interested and how you meet the requirements. Please note that submissions received without all the requested information will be automatically disqualified and rejected.
Key Responsibilities:
- Design and maintain scalable secure and reliable infrastructure on cloud platforms (e.g. AWS GCP Azure).
- Implement InfrastructureasCode (IaC) using tools such as Terraform CloudFormation or Ansible.
- Ensure systems are highly available and faulttolerant capable of supporting global traffic.
- Build manage and optimize CI/CD pipelines for continuous integration deployment and delivery.
- Automate deployment workflows to ensure smooth and efficient software releases.
- Monitor and troubleshoot CI/CD pipeline issues promptly.
- Implement and manage monitoring and alerting systems (e.g. Prometheus Grafana Datadog).
- Set up logging and observability tools (e.g. ELK stack Splunk) to track system health and diagnose issues.
- Proactively identify and resolve performance bottlenecks and infrastructure vulnerabilities.
- Enforce security best practices across infrastructure CI/CD pipelines and deployment processes.
- Implement rolebased access controls (RBAC) and ensure data encryption at rest and in transit.
- Ensure compliance with data privacy and security standards (e.g. GDPR CCPA ISO 27001).
- Develop and maintain disaster recovery (DR) and backup strategies to ensure system resilience.
- Perform regular disaster recovery drills to verify system integrity under failure scenarios.
- Optimize resource allocation and cost efficiency on cloud infrastructure.
- Ensure systems can scale dynamically to meet fluctuating traffic demands.
- Collaborate closely with Backend Frontend and Data Engineering teams to align deployment and infrastructure strategies.
- Provide technical guidance to team members on infrastructure and deploymentrelated challenges.
- Maintain comprehensive technical documentation for infrastructure pipelines and operational workflows.
- Ensure team members have access to clear playbooks and troubleshooting guides.
Requirements:
- Minimum 4 years of experience in DevOps Site Reliability Engineering (SRE) or a similar role.
- Excellent command of the English Language in all forms.
- Previous startup experience would be an advantage.
- Handson experience with AWS GCP or Azure cloud platforms.
- Proficiency with tools such as Terraform CloudFormation or Ansible.
- Expertise in CI/CD tools like Jenkins GitLab CI CircleCI or similar.
- Experience with Docker and Kubernetes for container orchestration and management.
- Proficiency with monitoring and observability tools (e.g. Prometheus Grafana Datadog ELK Stack).
- Understanding of cloud security protocols IAM roles and data encryption standards.
- Proficiency in scripting languages like Bash Python or Go.
- Strong experience with Git workflows and version control systems.
- Strong analytical and troubleshooting skills to resolve infrastructure and deployment issues efficiently.
- Ability to create clear and detailed technical documentation for infrastructure and processes.
Ideal Candidate Profile:
- A proactive engineer passionate about building reliable scalable and highperforming systems.
- Detailoriented with a strong focus on security compliance and operational excellence.
- Adaptable to rapidly changing requirements in a fastpaced startup environment.
- Collaborative with excellent crossfunctional communication skills to align technical goals.
- Excited about staying ahead of emerging trends and technologies in DevOps and infrastructure management.
- Committed to fostering a culture of automation and efficiency across the team.
Compensation & Benefits
Equityonly at present to transition to a salaried fulltime permanent position when funding is secured.
Remote and flexible working arrangements the opportunity to be part of something potentially epic with potential opportunities for global travel and access to industry conferences and workshops in due course.