drjobs Site Reliability Engineer mfd

Site Reliability Engineer mfd

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Belgrade - Serbia

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Description

Are you an experienced developer or DevOps engineer Do you want the freedom to work remotely and want to grow in the new field of site reliability at an internationally successful software and education company Well than take our reliability to the next level as part of our Site Reliability Engineering team :)

*** Please note ENGLISCH and GERMAN is a MUST on this position. Please do not apply if you do not speak both languages ***

Who is Digistore24

We are one of the fastestgrowing tech companies in Europe.

What drives us We shape the digital future! Our mission is to empower people with our software and expertise to share their knowledge online enabling them to fulfill their dream of an own business. As a result millions of people gain access to information that helps them reach their goals. To keep pace with our growth we aim to expand our teams sustainably. We emphasize working with experts and strong personalities who share our values regardless of their location.

Your new dream job

  • Automation and Infrastructure as Code (IaC): You automate repetitive tasks deployments and system management to reduce human error and improve efficiency. This might involve creating scripts CI/CD pipelines or automating infrastructure provisioning.
  • Reliability and Performance Optimization: You continuously improve the system uptime by identifying bottlenecks and optimizing system architecture.
  • Capacity Planning and Scaling: You assess and predict system resource requirements (CPU memory storage) to ensure the infrastructure can scale with increasing demand. Implement autoscaling solutions to handle load spikes without human intervention ensuring systems remain performant under various conditions.
  • System Monitoring and Incident Response: Continuously monitor system performance uptime and reliability using tools like Prometheus Grafana or ElasticSearch. The goal is to detect and respond to issues before they impact users. Manage and respond to incidents outages and failures quickly aiming to minimize downtime. This includes managing incident documentation communication and postincident analysis.
  • Incident Postmortems and Continuous Improvement: Conduct root cause analysis (RCA) after incidents to identify what went wrong and how to prevent similar issues in the future. Implement fixes improvements and best practices based on learnings from postmortems to increase system reliability and reduce future incidents.

Your benefits at Digistore24

You will play a crucial role in shaping our cuttingedge projects in our collaborative work environment while enjoying flexibility in working time and location.

  • Work in our partners coworking spaces or in your home office as long as you can guarantee uninterrupted internet access
  • Regular further education
  • The stability of an extremely successful German hightech company that is funded by its successful product and not by investors
  • Outcome focused teams and a culture of direct feedback
  • Modern equipment: Thinkpad or MacBook
  • International collaborative team with strong cohesion
  • Spectacular team events in various European countries
  • Autonomy from day one
  • Contribution to the retirement scheme
  • Work in your team on a firstname basis without a dress code and at eye level
  • Flexible working hours from Mondays to Fridays (core working hours from 10AM to 4PM)

Requirements

Your superpower

  • Communication Mastery: You communicate precisely and in a recipientfriendly manner. You diffuse potential conflicts with sensitivity and a solutionoriented approach. You always strike the right tone with stakeholders developers and your team even under time pressure and can seamlessly switch from German to English if necessary.
  • Collaboration Wizardry: You collaborate with developers stakeholders and operations and bring everyone on the same page. You understand the challenges of different teams and find solutions that benefit the entire company.
  • Automation Sorcery: You promote automation as a way to save time and reduce errors and implement tools that improve productivity across the team.
  • ProblemSolving Genius: You dive deep into problems identify root causes and come up with solutions that prevent future incidents.
  • Selforganization: You thrive on autonomy and excel at organizing and structuring complex projects while working from home.
  • Tech stack:
    • Kubernetes / Container Technology (no description necessary)
    • CI/CD (Github Workflows Helm Kustomize)
    • Cloud Services (preferably Google but others are also okay)
    • Excellent spelling and grammar in German (no description necessary)
    • PHP language experience would be a plus

Your typical day at Digistore24

  • Morning video call to talk to your team about yesterdays progress and todays plans.
  • You like to work in a structured way and outline your daily routine and daily goals. Like every day you block out enough time to work on the continuous development of our SRE processes. You are not alone in this but can count on the support of your team.
  • Now its time for the daily call with your team. You report on your priorities and blockers and receive tangible tips on how to solve your challenges.
  • For the next few hours you allow yourself the luxury of turning off all messengers in order to develop focused ideas for improvements in autoscaling monitoring and alerting. You then test your ideas in practice. You make a note of these success principles so that you can present them to the Head of IT Operations in a oneonone call.
  • After your lunch break a developer needs help with a new CI/CD workflow. You discuss the requirements with him and provide him with an initial prototype.
  • You take the ticket to check the resource allocation of an application check the current utilization and adjust the deployment.
  • You find an endpoint that is not yet included in the monitoring. After creating a ticket for this you immediately write the code in the Terraform project to add it.

This position is NOT for you if

  • ... you do not identify with our values
  • you have less than 3 years of experience in IT operations
  • you cant take ownership and need to discuss every detail with your supervisor or colleagues
  • you have difficulty planning and prioritizing your tasks
  • you dont like to find solutions for complex problems
  • you are not confident speaking German AND English

Our values

Please take a REALLY close look at the values. Are you ready to live them

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.