drjobs Software Stack Engineer

Software Stack Engineer

Employer Active

drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Alexander City - USA

Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Job Description

Role: Software Stack Engineer

Location: 6 Month Contract Can be extended

Remote but some travel to Jersey City NJ or Charlotte NC

Job Description/Position Summary:

Design engineer build and deliver AI Infrastructure and Platform solution for Model Development and Model Deployment (Inference). This is not an Operational Support role but an Architecture and Engineering role with Docker and Container Expertise. Resource will collaborate with AI/ML teams to understand their requirements and translate them into scalable Kubernetesbased infrastructure solutions. Resource should understand desired outcomes of AI Platforms required to support the Data Scientist community.

Primary Skills:

  • Continuous Automation
  • Coding and Application Development
  • Work in collaborative environment across CTO and CIO teams
  • Excellent documentation and articulation skills
  • Architecture and Strategy
  • Analytical Thinking
  • Constraintdriven Decision Making
  • Data Management / Governance
  • Attention to detail

Continuous Automation Coding and Application Development Work in collaborative environment across CTO and CIO teams Excellent documentation and articulation skills Architecture and Strategy Analytical Thinking Constraintdriven Decision Making

Required Skills:

  1. Docker Containerization and Orchestration strong understanding of pods services and deployments
  2. Docker Operators and Helm charts
  3. Understanding of Kubernetes security best practices (e.g. RBAC network and pod security policies)
  4. Ability to set up monitoring logging and alerting for Kubernetes clusters using Prometheus/Grafana
  5. Optimize Kubernetes cluster performance resource utilization.
  6. Python JSON

Desired Skills:

Working level understanding of the following:

  1. AI Frameworks like PyTorch
  2. Linux Resource Management Tools SLURM
  3. Nvidia Software Stack CUDA TensorRT Triton Inference Server
  4. Jupyter Notebook

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.