High Performance Computing Engineer
Norconsult Telematics
Employer Active
Posted 8 hrs ago
Send me Jobs like this
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
Job Description and Responsibilities
- Design, deploy, and maintain HPC clusters and GPU-enabled servers for compute-intensive workloads, ensuring high availability and performance.
- Administer and troubleshoot Red Hat Enterprise Linux systems in production environments to ensure stability and uptime.
- Develop and maintain automation scripts (e.g., Ansible, Bash, Python) for system provisioning, configuration management, and patching.
- Manage job scheduling systems (e.g., Slurm, PBS, LSF) and parallel file systems (e.g., Lustre, GPFS, BeeGFS) for efficient workload distribution.
- Configure and optimize GPU workloads using NVIDIA CUDA or ROCm in HPC/AI environments.
- Collaborate with researchers, data scientists, and engineering teams to fine-tune workloads and enhance system performance.
- Ensure system security, stability, and compliance with organizational policies through proactive measures.
- Monitor system health, analyze performance metrics, and conduct capacity planning to meet future demands.
- Provide technical support, comprehensive documentation, and user training for HPC/GPU systems to enable seamless user experiences.
Desired Candidate Profile
Qualifications & Skills
- Bachelor s degree in Computer Science, Information Technology, or a related field.
- Minimum of 5-7 years of experience in HPC systems administration or related roles, with expertise in Red Hat Enterprise Linux.
- Proficiency in automation scripting (Ansible, Bash, Python) and managing job schedulers (Slurm, PBS, LSF) and parallel file systems (Lustre, GPFS, BeeGFS).
- Hands-on experience with GPU workload configuration (NVIDIA CUDA, ROCm) in HPC/AI environments.
- Red Hat Certified Engineer (RHCE) or higher certification is preferred.
- Experience in academic, research, or enterprise-scale HPC environments, with exposure to cloud-based HPC platforms (e.g., AWS ParallelCluster, Azure CycleCloud).
- Familiarity with AI/ML workflows and tools (e.g., TensorFlow, PyTorch) in GPU environments is a plus.
- Strong problem-solving, collaboration, and communication skills to support cross-functional teams.
br>
Company Industry
- Telecom
- ISP
Department / Functional Area
- IT Software
Keywords
- High Performance Computing Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Norconsult Telematics
https://ntww.mynexthire.com/employer/jobs/careers#?src=careers&page=careers