Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailHello,
I Hope you are doing great.
Please find the JD for your reference and let me know if you are interested in this role.
Role: Project Lead; Site Reliability Engineer
Location: California City, CA, US ( Remote )
Florida City, FL, US ( Remote )
Missouri City, MO, US ( Remote )
Job description
This role requires you to have a broad range of engineering skills as well as a strong desire to protect the customer experience. As we are on our journey to the cloud, we also still operate and support our own colocation spaces and data centers. You will get the opportunity to troubleshoot issues, collaborate with multiple application teams on subjects around performance and reliability, as well as drive observability into systems ranging from core infrastructure in data centers, to services that reside in cloud services.
What will you be doing day to day
Participate in an on-call rotation
Participate in high severity, customer impacting issues
Utilize your breadth of expertise as an SRE and engage with engineering teams struggling to maintain reliable and performance services
Participate as an influential partner with the problem management team helping set priority and being a technical leader in post incident reviews
Participate with IT and engineering teams to maintain a knowledge repository that will contain application technical mapping and previously identified issues and symptoms
Partner with IT and engineering teams to identify gaps in observability, either missing telemetry or alerting
Partner with the Reliability Architecture team on influencing the adoption of standards for items like proper health checks, application diagnostics, partner training opportunities, and additional diagnostic tooling
Education/Experience: Bachelor's degree in Computer Science, Information Systems Management, Engineering or related field or equivalent experience.
Required Technical Experience (5-7 years):
Experience with enterprise storage solutions (PowerMax, Isilon, NetApp ONTAP, Cisco MDS, Pure Storage, etc.)
Experience with working on virtualization technologies (VMWare)
Experience with automation languages such as Powershell, Python, etc.
Experience troubleshooting operating systems (Linux, Windows, Unix)
Experience troubleshooting large scale enterprise networks
Experience writing config as code (Terraform, ARM, etc.)
Experience with software engineering stacks (.net, lamp, etc.)
Experience reading and writing object-oriented code
Experience with languages such as Java, C# or other OO languages
Experience or familiarity with using observability platforms such as Dynatrace or Splunk
Desired Technical Experience:
Bachelor's degree in Computer Science, Information Systems Management, Engineering or related field or equivalent experience
Certificate in one or more cloud technologies (AWS preferred)
Experience with container technologies (Docker, Kubernetes)
Experience doing deep packet analysis.
Full Time