Site Reliability Engineer SRE with OpenShiftKubernetes Jobs in LussoTech LLC in Houston, TX - USA

Site Reliability Engineer SRE with OpenShiftKubernetes

LussoTech LLC

Posted on : 12-11-2024

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Houston, TX - USA

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 12-11-2024

Job Description

Job Description
Client has a need for a Site Reliability Engineer (SRE) to become a part of our growing Digital IT team focused on building an OpenShift/Kubernetes capability. The SRE will support the reliability of Digital IT/OT critical applications. This transformative role involves automating IT infrastructure tasks and driving SRE best practices tools and processes. The ideal candidate should exhibit a growth mindset and proactively monitor and work with application developers to respond to incidents for optimal user experience.

The candidate must have senior level experience deploying OpenShift on premises and supporting applications in Kubernetes. The ideal candidate will have experience in both onprem OpenShift and Azure Kubernetes container platforms.

The successful candidate will possess strong infrastructure and developer background as well as interpersonal skills needed to communicate design requirements and objectives while providing thought leadership to peers and leadership.

Responsibilities:

Maintaining survivability and reliability of IT/OT critical resources.

Write and build CI/CD pipelines and build/release processes for IT/OT workflow applications.

Provide mentoring to the IT/OT Devops team in the best practices associated with CI/CD deployments using ADO and GIT.

Perform periodic load and scalability testing to establish baselines drift and capacity planning.

Conduct weekly operational state reviews covering performance trends anomalies errors and other availability events with SREs product owners and development teams.

Participate in quarterly business and operational reviews aligning on roadmaps development velocity efficiency growth trends patching etc.

Plan and execute periodic Disaster Recovery exercises including both tabletop and simulated failures (fault injection).

Required Qualifications

Candidates must have a bachelors degree and 12 years of IT experience.

Senior level experience with OpenShift and Kubernetes.

Familiarity with continuous integration/deployment processes and tools such as IDEs (Eclipse) Source Code management. (GIT/Stash) ADO Pipelines Maven Nexus artifacts etc.

Strong understanding of SRE practices: incident response change/release management capacity planning infrastructure automation elastic environments chaos engineering and blameless postmortems.

Expertise in application performance monitoring observability and proactive alert correlation including monitoring containers and failurebased alerting.

Scripting experience such as Python and Bash

Experienced in deploying applications in OpenShift in both public and private cloud.

Ability to work and interact with others in a structured/team environment.
Technology Stack

Experience with at least one technology in each of the tech stack categories below:

Monitoring and Logging Tools(s): AppDynamics Splunk ELK Stack DataDog Prometheus AWS CloudWatch/XRay Grafana

Programming: C# .NET PowerShell Python YAML

Containers: Docker Helm Chart

OS: Linux RHEL Ubuntu CentOS

Code Repos: Azure Repos GitHub GitLab

Infrastructure as code: Terraform Ansible

Automation Tools: AnsibleJenkins Chef Puppet

Agile: JIRA SAFe

Desired Qualifications

Experience in cloud/virtual technologies and management OpenShift VMware AWS Azure etc.

Familiarity with security best practices for containerized applications.

Knowledge of DevOps practices and tools.

Knowledge skills and abilities to automate the creation of Platform as a Services (PaaS) infrastructure using industry standard tools such as Ansible and Chef.

Familiarity with Industrial Control System (ICS) security architecture Purdue model.

Work Location:
OnSiteHouston or Bartlesville

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

LussoTech LLC

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Free AI Resume Review

Get Hired 3x Faster with free, confidential review from Ai resume review service.

Order Now

Resume, LinkedIn, Cover Letter

Elevate your professional profile with expertly crafted documents including your resume, LinkedIn profile, cover letter.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Learn More

Reverse Recruiting

Never apply for a job again. We apply and track jobs for you to find your perfect match.

Site Reliability Engineer SRE with OpenShiftKubernetes

LussoTech LLC

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Manager Site Patient Engagement

Sr Data Engineer

Software Engineer

Civil Engineer

Software Engineer - Mid-Level

Overnight Maintenance Engineer

Overnight Maintenance Engineer

Engineer I - HVACR