drjobs SRE LEAD

SRE LEAD

صاحب العمل نشط

هذا المنشور غير متاح الآن! ربما يكون قد تم شغل الوظيفة.
drjobs

حالة تأهب وظيفة

سيتم تحديثك بأحدث تنبيهات الوظائف عبر البريد الإلكتروني
Valid email field required
أرسل الوظائف
drjobs
أرسل لي وظائف مشابهة
drjobs

حالة تأهب وظيفة

سيتم تحديثك بأحدث تنبيهات الوظائف عبر البريد الإلكتروني

Valid email field required
أرسل الوظائف
موقع الوظيفة drjobs

Of - تركيا

الراتب الشهري drjobs

لم يكشف

drjobs

لم يتم الكشف عن الراتب

الوصف الوظيفي

SRE resource requirements

Looking for an experienced SRE resource with proven experience in designing architecting and implementing robust and durable solutions especially in the cloud preferably at scale.

Roles and Responsibility:

  • Engage in and improve the whole life cycle of application and cloud servicesfrom inception and design through deployment operation and refinement.
  • Design develop ship and motivate the creation of software and systems to increase product reliability and organizational efficiency.
  • Lead development and tracking of SRE Error Budgets
  • Lead development of SRE dashboard.
  • Lead root cause investigations.
  • Proactively identify system anomalies
  • Recognize automation opportunities.
  • Plug into software release cycle. Work closely with developers to ensure software releases are well designed planned implemented released and monitored.
  • Automate timeconsuming and manual processes.
  • Assess current SRE solution and define the SRE approach for products.
  • Work with applications development teams on designing implementing and improving SRE practices.
  • Design and execute Scalability strategies that ensure the scalability and the elasticity of the infrastructure.

Educational Qualification

  • An Engineering/master or equivalent degree in computer science

Certifications

  • Associate level of certification in cloud.

Skills must have

  • Cloud Platform Expertise: Cloud platform experience with AWS Azure or GCP (Preferable) and handson experience with key cloud services including logging & monitoring Pub/Sub Function as a Service (FaaS) and Containers.
  • Strong Knowledge on IAC: Expertise in Infrastructure as Code (IAC) and strong command on Terraform for provisioning and managing cloud infrastructure.
  • Proficiency in Core SRE Principles: Expertise in essential SRE concepts such as CUJ SLO SLI and Error Budgeting based on NFRs and ability to apply these principles effectively to ensure service reliability meet business objectives and drive continuous improvement initiatives.
  • Experience of Reducing TOIL: Identifying manual and repetitive tasks within the Software Development Life Cycle (SDLC) or IT operations and implementing automation solutions to reduce the TOIL. Ability to streamline processes enhance productivity and free up resources for more strategic initiatives through automation and process improvement.
  • Comprehensive CI/CD Proficiency: Strong understanding of Continuous CI/CD practices with robust knowledge of Git GitHub Actions and GitHub Workflows. Familiarity with other tools such as Jenkins and similar would be advantageous.
  • Proficiency in Container Orchestration: Handson experience in creating and managing Docker images ensuring optimal performance and security. Proficiency in Kubernetes platform including the ability to effectively manage containerized applications scale resources as needed and troubleshoot issues in production environments.
  • Monitoring and Observability: Experience with monitoring tools such as Prometheus Grafana and ELK Stack and should be able to set up and configure monitoring solutions utilize metrics for performance optimization and troubleshoot issues effectively.
  • Proficiency in Scripting Languages: Demonstrate strong scripting skills particularly in languages such as Python and Shell Scripting.
  • Collaboration and Communication: Should excel in working within diverse teams conveying technical concepts clearly to nontechnical stakeholders and actively getting engaged in incident management and postincident reviews.
  • Continuous Learning and Adaptability: Demonstrate commitment to staying current with emerging technologies trends and best practices in cloud computing and SRE methodologies showing a willingness to adapt and learn as needed.

Skills nice to have

  • Detailed knowledge about networking
  • Experience on design cloud Infrastructure solution and application migration to cloud
  • Automation and Performance Management
  • Security Practices: Knowledge of security best practices tailored to cloud environments including IAM Network Security and Encryption Techniques.

Other

  • Interest level to learn new technologies and implement
  • Open to pick the different profile with in SRE/DevOps

Energy level and communication

Consider large/popular use cases during discussion like google SRE Netflix SRE etc

نوع التوظيف

دوام كامل

نبذة عن الشركة

الإبلاغ عن هذه الوظيفة
إخلاء المسؤولية: د.جوب هو مجرد منصة تربط بين الباحثين عن عمل وأصحاب العمل. ننصح المتقدمين بإجراء بحث مستقل خاص بهم في أوراق اعتماد صاحب العمل المحتمل. نحن نحرص على ألا يتم طلب أي مدفوعات مالية من قبل عملائنا، وبالتالي فإننا ننصح بعدم مشاركة أي معلومات شخصية أو متعلقة بالحسابات المصرفية مع أي طرف ثالث. إذا كنت تشك في وقوع أي احتيال أو سوء تصرف، فيرجى التواصل معنا من خلال تعبئة النموذج الموجود على الصفحة اتصل بنا