Senior MLOps Engineer
Institute Of Foundation Models
Posted 30+ days ago
Send me Jobs like this
Nationality
Any Nationality
Gender
Not Mentioned
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
- Design and manage scalable ML infrastructure on AWS using EKS, EC2, RDS, S3, and IAM-based access control.
- Build and maintain Kubernetes deployments for LLM and TTS inference using Helm, ArgoCD, and Prometheus/Grafana monitoring.
- Implement and optimize model serving pipelines using vLLM, SGLang, TensorRT, or similar frameworks for high-throughput inference.
- Develop CI/CD and MLOps automation for data versioning, model validation, and deployment (GitHub Actions, Jenkins, or AWS CodePipeline).
- Integrate OpenWebUI, Gradio, or similar UIs for user-facing model demos and internal evaluation tools.
- Collaborate with ML researchers to productize models including TTS (e.g., ElevenLabs API), ASR (Whisper), and LLM-based chat systems.
- Ensure observability, cost optimization, and reliability of cloud resources across multiple environments.
- Contribute to internal tools for dataset curation, model monitoring, and retraining pipelines.
- Maintain infrastructure-as-code using Terraform and Helm charts for reproducibility and governance.
- Support real-time multimodal workloads (voice, text, vision) across inference clusters.
- 4+ years of experience in MLOps, DevOps, or Cloud Infrastructure Engineering for ML systems.
- Strong proficiency in Kubernetes, Helm, and container orchestration.
- Experience deploying ML models via vLLM, SGLang, TensorRT, or Ray Serve.
- Proficiency with AWS services (EKS, EC2, S3, RDS, CloudWatch, IAM).
- Solid experience with Python, Docker, Git, and CI/CD pipelines.
- Strong understanding of model lifecycle management, data pipelines, and observability tools (Grafana, Prometheus, Loki).
- Excellent collaboration skills with ML researchers and software engineers.
- Extensive Experience with vLLM, K8s, Elevenlabs, Whisper, Gradio/OpenWebUI, or custom TTS/ASR model hosting.
- Familiarity with multi-GPU scheduling, NCCL optimization, and HPC cluster integration.
- Knowledge of security, cost management, and network policy in multi-tenant Kubernetes clusters and cloudflare systems.
- Prior work in LLM deployment, fine-tuning pipelines, or foundation model research.
- Exposure to data governance and responsible AI operations in research or enterprise settings.
Company Industry
- Education
- Training
- Teaching
- Academics
Department / Functional Area
- Engineering
Keywords
- Senior MLOps Engineer
Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com
Institute Of Foundation Models
https://jobs.lever.co/ifm-us/5f0feab3-1309-4c24-a703-ed1993c448b9
Similar Jobs
Devops Engineer
VELODATA GLOBAL PRIVATE LIMITED
- 5 - 10 Years
- Doha - Qatar
Senior Devops Engineer
Phars Films
- 4 - 9 Years
- Dubai , Abu Dhabi , Sharjah - United Arab Emirates (UAE)
Devops Engineer
INNOVATION DIRECT EMPLOYMENT SERVICES L.L.C
- 8 - 10 Years
- Abu Dhabi - United Arab Emirates (UAE)
Solution Architect (Data & AI)
Confidential Company
- 8 - 15 Years
- Abu Dhabi - United Arab Emirates (UAE)
Senior DevOps Engineer (Arabic Speakers)
OMNIX INTERNATIONAL Co. L.L.C.
- 7 - 10 Years
- Dubai - United Arab Emirates (UAE)