Job Description:
As a Systems Analyst you will provide technology consulting to external customers and internal project teams. You will be responsible for delivering technical support and leadership in creating and implementing technology solutions tailored to meet customers business needs. This role requires a deep understanding of customers operations to ensure satisfaction and effective solution delivery. You will contribute to the companys solutions portfolio by sharing technical knowledge and methodologies derived from customer projects shaping technical direction and strategies both within the organization and for external clients. This position demands consistent and significant engagement assisting in meeting or exceeding revenue and customer satisfaction goals and contributing to the organizations profitability by generating and cultivating new business opportunities.
Job Responsibilities:
- Provide onsite system administration and HighPerformance Computing (HPC) application consulting services.
- Address and resolve top issues in the HPC environment.
- Maintain HPC systems availability for customers.
- Monitor system performance and recommend improvements.
- Collaborate with team members and stakeholders to deliver highquality support and solutions.
- Create and document site procedures system diagrams and other configuration or support documents.
- Maintain system software and firmware revisions including patches updates and OS upgrades.
- Troubleshoot system hardware software and thirdparty software issues providing detailed analysis of problems and solutions.
- Gather data perform analysis and escalate problems to higherlevel product support groups and management as necessary to ensure timely resolution of system or customer issues.
- Provide solutions and implement repairs or workarounds when possible fully documenting steps taken.
Education and Experience Required:
- Bachelors degree in Computer Science Engineering or a related field.
- 4 years of HPCrelated experience ideally with largescale HPC and parallel file system administration and support.
- Without a degree three additional years of relevant professional experience (7 years in total).
Knowledge and Skills:
- Understanding of an HPC Data Center IT Operations environment.
- Expertise in HPC application consulting and support.
- Strong system administration skills particularly in HPC environments.
- Extensive knowledge and experience with Linux operating systems (RHEL or SLES).
- Experience with job scheduling and resource management tools.
- Experience with various HPC hardware architectures and software stacks.
- Knowledge of parallel file systems (e.g. Lustre GPFS).
- Familiarity with containerization technologies (e.g. Docker Singularity).
- Experience with scripting and automation tools (e.g. Python Bash Ansible).
- Familiarity with cybersecurity best practices in HPC environments.
- Ability to lead and work effectively in a team environment.
- Direct experience and demonstrated proficiency with multiple programming and scripting languages (e.g. Perl Python C FORTRAN) preferred.
- Ability to maintain system software utilizing debugging tools for problem isolation; will perform