Senior Dev Operations Engineer SRE CR260-A

SoftSol, Inc.

Posted on : 29-09-2024

Employer Active

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Alexander City - USA

Salary

Not Disclosed

Salary Not Disclosed

Posted on : 29-09-2024

Job Description

Job Title: Senior Dev Operations Engineer SRE (CR260)

Location: Remote

Duration: Long Term

MUST HAVES

Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace
Experience with AWS solutions using AWS services including Kafka ECS EKS.
Experience with IaC (Infrastructure as code) CDK or Terraform.

Objective:

The Site Reliability Engineer (SRE) will be a lead on the DevOps team and is responsible for system administration areas including monitoring installation configuration maintenance operations and architecture of AWS cloud environments and on premise environments. The candidate will work within a team in implementing and maintaining all production and preproduction environments by implementing tools and automation. Looking for a candidate with exceptional Site Reliability and DevOps skills and should have extensive knowledge and experience in implementing solutions and tools to maintain and grow all application environments. Most importantly the right individual will possess a positive cando attitude and a passion for delivering technical solutions in a fastpaced environment. In addition the individual will be dedicated independent and collaborate at a high level in ensure the stability and reliability of infrastructure and applications running in the AWS Cloud and on premise environments. Advanced experience working in AWS environments will be expected while leading the implementing of improvements and advancements.

Deliverables:

Monitoring sites environments and software by implementing tools and automation to achieve 99.9% uptime.
Measurement optimization and tuning of system performance and ensuring that systems will run reliably and are highly available in a 24/7 production environment.
Automate system and application monitoring using monitoring and automation tools
Anticipating potential problems before they occur and coming up with solutions.
Conducting postincident reviews and Root Cause Analysis.
Documenting your work to turn findings into repeatable actions.
Coding automation within a site infrastructure.
Implement production monitoring systems.
Utilize strong analytical and problemsolving skills.
Security assessments and addressing vulnerabilities.
Design and deploy AWS solutions using AWS services (i.e. EC2 S3 Glacier ELB RDS IAM Route 53 VPC Auto Scaling Cloud Watch Cloud Trail Cloud Formation Security Groups API Gateway SSM Route table Endpoint service etc.)
Provision management and daytoday operations of AWS environments
Implement alarms / alerts / notifications using AWS services (i.e. Cloud Watch)
Implement AWS Multi AZ accounts for HA and DR
Design AWS infrastructure that minimize operational costs through pushbutton deployment at scale with nearzero downtime.
Develop and maintain configuration management solutions.
Provide technical guidance knowledge transfers and mentorship to State Fund internal engineering
peers as required and lead technical staff responsibilities.
Server Maintenance based on updates system requirements data usage and antivirus requirements.
Responsible for the design implementation and support of large scale web farm infrastructure across multiple data centers supporting the Infrastructure as a Service (IaaS) offering.
Help engineering implement new technologies in development for future production deployment.
Working with team to analyze and design infrastructure witch includes virtualization clustering database disaster recovery and geographic redundancy.
Triage and provide technical solutions to environment related issues encountered by new and existing applications
Support developers with change requests uptime and performance related issues.
Documentation of work in regards to bug reports systems analysis application monitoring and common task reporting
Author internal documentation such as environment diagrams installation/configuration documents and release notes.
Assist in establishing and implementing configuration management program and policies.
Troubleshoot and debug environment and infrastructure problems found in the production and nonproduction environments.
Collaborating with software developers engineers and operations teams.
Provide 24 by 7 production support

TECHNICAL KNOWLEDGE AND SKILLS:

6 years of overall IT experience
4 years of AWS Cloud management experience with below skill set
AWS Certified DevOps and / or Solution Architect certification
Experience in AWS provisioning operations and management of AWS environments.
Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace
Experience with AWS solutions using AWS services including Kafka ECS EKS.
Experience with IaC (Infrastructure as code) CDK or Terraform.
Experience setting up / maintaining multi AZ infrastructure including HA and DR in AWS.
Experience with code repositories Azure DevOps Server GIT GITLab SVN
Experience with continuous integration tools Jenkins Azure Pipelines
Excellent knowledge of Linux systems
Experience with system automation and configuration management tools including Ansible
Experience with Python scripting
Strong background in networking load balancing and firewalls
Highlevel understanding of networking standard protocols and components such as: HTTP DNS TCP/IP ICMP the OSI Model Subnetting and Load Balancing
Thorough understanding of and experience with managing web applications in a highly available environment
Experience in Software development is a plus
Familiarity with deploying and configuring Java and .Net applications.
Experience with Application Security Testing tools a plus (Coverity Tenable BlackDuck etc)
Understanding of SQL PL/SQL and TSQL command

Employment Type

Full Time

Company Industry

Key Skills

Apply Now

About Company

SoftSol, Inc.

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Free AI Resume Review

Get Hired 3x Faster with free, confidential review from Ai resume review service.

Order Now

Resume, LinkedIn, Cover Letter

Elevate your professional profile with expertly crafted documents including your resume, LinkedIn profile, cover letter.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Learn More

Reverse Recruiting

Never apply for a job again. We apply and track jobs for you to find your perfect match.

Senior Dev Operations Engineer SRE CR260-A

SoftSol, Inc.

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Accountant

Sales Engineer

Manufacturing Engineer

Java Software Engineer

Quality Assurance Engineer

Local CDL-A Truck Driver

Local CDL-A Truck Driver

Staff Engineer - JavaAWS Zelle