HPC/AI Infrastructure Engineer
K20s Kinetic Technologies
Employer Active
Posted 20 hrs ago
Send me Jobs like this
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
Key Responsibilities:
- Deploy, configure, and manage NVIDIA Base Command Manager for orchestrating GPU workloads (critical).
- Implement and maintain NVIDIA AI Enterprise Suite to support enterprise-grade AI frameworks.
- Operate and optimize NVIDIA GPU and Network Operators within Kubernetes environments.
- Utilize NVIDIA NIMs and Blueprints to streamline AI model deployment and infrastructure automation.
- Administer and scale Slurm workload manager for HPC job scheduling (critical).
- Manage vanilla Kubernetes clusters, ensuring high availability and resource efficiency.
- Maintain and secure systems running on Canonical Ubuntu OS, including patching and performance tuning.
Required Skills & Qualifications:
- Strong expertise with NVIDIA GPU technologies and AI infrastructure.
- Hands-on experience with Slurm in HPC environments.
- Proficiency in Kubernetes cluster administration.
- Deep knowledge of Linux (Ubuntu) system administration.
- Familiarity with network operators and GPU scheduling in containerized environments.
- Ability to troubleshoot complex distributed systems.
Preferred Skills:
- Experience with automation tools (e.g., Ansible, Terraform).
- Knowledge of cloud-native architectures and hybrid HPC/AI deployments.
- Familiarity with observability tools (Prometheus, Grafana).
- Background in AI/ML workflows and performance optimization.
Desired Candidate Profile
Experience: 5+ years
Location: KSA- Saudi Arabia
Contract Duration: 1year
Overview:br>We are seeking a highly skilled HPC/AI Infrastructure Engineer to design, deploy, and manage advanced computing environments leveraging NVIDIA technologies, Kubernetes, and Linux systems. This role is critical to ensuring the performance, scalability, and reliability of AI workloads across GPU-accelerated clusters./p>
Company Industry
- IT - Software Services
Department / Functional Area
- IT Software
Keywords
- HPC/AI Infrastructure Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
K20s Kinetic Technologies
Similar Jobs
Cloud Engineer ( GCP MIgration/Infra)
GROWTH CONSULTANT
- 5 - 10 Years
- Dubai - United Arab Emirates (UAE)
Digital Infrastructure Engineer- Aria Holding Qatar
Confidential Company
- 8 - 15 Years
- Doha - Qatar
Senior DevOps/SRE Engineer
TOKEN 13 SOFTWARE L.L.C
- 3 - 8 Years
- Dubai - United Arab Emirates (UAE)