Send me Jobs like this
Experience
4 - 6 Years
Job Location
Education
Masters in Computer Application(Computers), Master of Technology/Engineering(Computers)
Nationality
Any Nationality
Gender
Any
Vacancy
1 Vacancy
Job Description
Roles & Responsibilities
Role Summary
We are seeking a Senior Al Engineer-a senior Al professional with people leadership experience to build and operate in-house Al microservices that power core business workflows (e.g., OCR/intelligent document processing, NLP, computer vision, LLM services). You will lead a small, high-impact team to deliver scalable, stable, and cost-efficient services from prototype to production, with clear SLAs and observability.
What You'll Do (Key Responsibilities)
Own the services roadmap for OCR/IDP, NLP/CV, embeddings, and LLM microservices; translate business problems into well-designed APIs.
Lead and grow a delivery squad (3-8 engineers/DS/ML) with agile rituals, clear goals, and hands-on guidance.
Design & implement microservices (Python/FastAPI preferred) with 12-factor practices, strong interfaces, and backward-compatible versioning.
Productionize models (selection, fine-tuning, evaluation) using ONNX Runtime/TensorRT, Triton Inference Server, vLLM/Helm/Serving stacks as appropriate.
Establish MLOps & LLMOps: data/versioning (DVC/LakeFS), CI/CD (GitHub Actions/Azure DevOps), feature store, model registry, canary/blue-green, A/B and shadow testing.
Implement observability and SRE: SLOS/SLAs, tracing/metrics/logs (OpenTelemetry, Prometheus, Grafana), error budgets, and automated rollback.
Optimize performance & cost throughput/latency benchmarking, autoscaling, GPU/CPU right-sizing, caching, batching, prompt/tooling cost controls.
Desired Candidate Profile
Must-Have Qualifications
- Master's degree in AI/ML, Computer Science, or related field.
- 4-6 years total software/ML engineering; 3+ years building production AI services.
- Team leadership experience (direct people management or tech-lead for 3-8 engineers).
- Track record delivering 1-2 large, end-to-end AI projects (from data to deployed service) with real usage.
- Strong in Python and microservice frameworks (FastAPI/Flask), Docker, Kubernetes, and Git.
- Experience with message/streaming (Kafka/RabbitMQ), Redis, PostgreSQL, and vector databases (e.g., FAISS, Milvus, pgvector).
- Proficient with PyTorch / Transformers, OpenCV, PaddleOCR/Tesseract; familiarity with LLMs and prompt/guardrail techniques.
- Solid grasp of testing (unit/integration), CI/CD, and inference optimization (ONNX/TensorRT/quantization/batching).
- Cloud experience (preferably Azure; AWS/GCP also fine) and infrastructure-as-code (Terraform/Bicep).
- Arabic Language NLP Experience.
Nice-to-Have
- Experience with RAG architectures, retrieval evaluation, and safety/guardrails.
- Knowledge of event-driven architectures, API gateways, and service mesh (e.g., Istio/Linkerd).
- Familiarity with data engineering (Airflow/Prefect, Delta/Lakehouse) and feature stores (Feast).
- Experience with cost monitoring (FinOps) and SLO-driven governance.
Tech Stack (Indicative)
Languages/Frameworks: Python (FastAPI), Node.js/.NET (nice-to-have)
ML/AI: PyTorch, Hugging Face, OpenCV, Tesseract/PaddleOCR, ONNX Runtime, TensorRT, Triton Inference Server, vLLM
Data/Messaging: Kafka/RabbitMQ, Redis, PostgreSQL, Vector DB (Milvus/FAISS/pgvector)
MLOps/DevOps: GitHub Actions/Azure DevOps, Docker, Kubernetes, Helm, DVC, MLflow/Weights & Biases, OpenTelemetry, Prometheus, Grafana
Cloud: Azure (AKS, ACR, Key Vault, Event Hubs, Cosmos/PG), or AWS/GCP equivalents
Is there anything else you'd like me to extract from an image or a different section of this job description?
�
Employment Type
- Full Time
Company Industry
Department / Functional Area
Sundus Consultancy
Rec Agency
Mr Sufyan - Consultant
Abu Dhabi JeddahStreet,AL–Jubail,KSA, United Arab Emirates (UAE)