HPC/AI Infrastructure Engineer

K20s Kinetic Technologies

Employer Active

Posted 20 hrs ago

Experience

5 - 7 Years

Job Location

Riyadh - Saudi Arabia

Education

Any Graduation

Nationality

Any Nationality

Gender

Not Mentioned

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

Key Responsibilities:

  • Deploy, configure, and manage NVIDIA Base Command Manager for orchestrating GPU workloads (critical).
  • Implement and maintain NVIDIA AI Enterprise Suite to support enterprise-grade AI frameworks.
  • Operate and optimize NVIDIA GPU and Network Operators within Kubernetes environments.
  • Utilize NVIDIA NIMs and Blueprints to streamline AI model deployment and infrastructure automation.
  • Administer and scale Slurm workload manager for HPC job scheduling (critical).
  • Manage vanilla Kubernetes clusters, ensuring high availability and resource efficiency.
  • Maintain and secure systems running on Canonical Ubuntu OS, including patching and performance tuning.

Required Skills & Qualifications:

  • Strong expertise with NVIDIA GPU technologies and AI infrastructure.
  • Hands-on experience with Slurm in HPC environments.
  • Proficiency in Kubernetes cluster administration.
  • Deep knowledge of Linux (Ubuntu) system administration.
  • Familiarity with network operators and GPU scheduling in containerized environments.
  • Ability to troubleshoot complex distributed systems.

Preferred Skills:

  • Experience with automation tools (e.g., Ansible, Terraform).
  • Knowledge of cloud-native architectures and hybrid HPC/AI deployments.
  • Familiarity with observability tools (Prometheus, Grafana).
  • Background in AI/ML workflows and performance optimization.

Desired Candidate Profile

Experience: 5+ years

Location: KSA- Saudi Arabia

Contract Duration: 1year

Overview:br>We are seeking a highly skilled HPC/AI Infrastructure Engineer to design, deploy, and manage advanced computing environments leveraging NVIDIA technologies, Kubernetes, and Linux systems. This role is critical to ensuring the performance, scalability, and reliability of AI workloads across GPU-accelerated clusters./p>

Company Industry

Department / Functional Area

Keywords

  • HPC/AI Infrastructure Engineer

Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com

Similar Jobs

Cloud Engineer ( GCP MIgration/Infra)

GROWTH CONSULTANT

  • 5 - 10 Years
  • Dubai - United Arab Emirates (UAE)

Senior DevOps/SRE Engineer

View All