JavaScript is disabled!
Please enable JavaScript in your web browser!

Senior AI Engineer Services Sundus Consultancy

Posted 1 hrs ago

Experience

4 - 6 Years

Education

Masters in Computer Application(Computers), Master of Technology/Engineering(Computers)

Nationality

Any Nationality

Gender

Any

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

Role Summary

We are seeking a Senior Al Engineer-a senior Al professional with people leadership experience to build and operate in-house Al microservices that power core business workflows (e.g., OCR/intelligent document processing, NLP, computer vision, LLM services). You will lead a small, high-impact team to deliver scalable, stable, and cost-efficient services from prototype to production, with clear SLAs and observability.

What You'll Do (Key Responsibilities)

Own the services roadmap for OCR/IDP, NLP/CV, embeddings, and LLM microservices; translate business problems into well-designed APIs.

Lead and grow a delivery squad (3-8 engineers/DS/ML) with agile rituals, clear goals, and hands-on guidance.

Design & implement microservices (Python/FastAPI preferred) with 12-factor practices, strong interfaces, and backward-compatible versioning.

Productionize models (selection, fine-tuning, evaluation) using ONNX Runtime/TensorRT, Triton Inference Server, vLLM/Helm/Serving stacks as appropriate.

Establish MLOps & LLMOps: data/versioning (DVC/LakeFS), CI/CD (GitHub Actions/Azure DevOps), feature store, model registry, canary/blue-green, A/B and shadow testing.

Implement observability and SRE: SLOS/SLAs, tracing/metrics/logs (OpenTelemetry, Prometheus, Grafana), error budgets, and automated rollback.

Optimize performance & cost throughput/latency benchmarking, autoscaling, GPU/CPU right-sizing, caching, batching, prompt/tooling cost controls.

Desired Candidate Profile

Must-Have Qualifications

  • Master's degree in AI/ML, Computer Science, or related field.
  • 4-6 years total software/ML engineering; 3+ years building production AI services.
  • Team leadership experience (direct people management or tech-lead for 3-8 engineers).
  • Track record delivering 1-2 large, end-to-end AI projects (from data to deployed service) with real usage.
  • Strong in Python and microservice frameworks (FastAPI/Flask), Docker, Kubernetes, and Git.
  • Experience with message/streaming (Kafka/RabbitMQ), Redis, PostgreSQL, and vector databases (e.g., FAISS, Milvus, pgvector).
  • Proficient with PyTorch / Transformers, OpenCV, PaddleOCR/Tesseract; familiarity with LLMs and prompt/guardrail techniques.
  • Solid grasp of testing (unit/integration), CI/CD, and inference optimization (ONNX/TensorRT/quantization/batching).
  • Cloud experience (preferably Azure; AWS/GCP also fine) and infrastructure-as-code (Terraform/Bicep).
  • Arabic Language NLP Experience.

Nice-to-Have

  • Experience with RAG architectures, retrieval evaluation, and safety/guardrails.
  • Knowledge of event-driven architectures, API gateways, and service mesh (e.g., Istio/Linkerd).
  • Familiarity with data engineering (Airflow/Prefect, Delta/Lakehouse) and feature stores (Feast).
  • Experience with cost monitoring (FinOps) and SLO-driven governance.

Tech Stack (Indicative)

Languages/Frameworks: Python (FastAPI), Node.js/.NET (nice-to-have)

ML/AI: PyTorch, Hugging Face, OpenCV, Tesseract/PaddleOCR, ONNX Runtime, TensorRT, Triton Inference Server, vLLM

Data/Messaging: Kafka/RabbitMQ, Redis, PostgreSQL, Vector DB (Milvus/FAISS/pgvector)

MLOps/DevOps: GitHub Actions/Azure DevOps, Docker, Kubernetes, Helm, DVC, MLflow/Weights & Biases, OpenTelemetry, Prometheus, Grafana

Cloud: Azure (AKS, ACR, Key Vault, Event Hubs, Cosmos/PG), or AWS/GCP equivalents

Is there anything else you'd like me to extract from an image or a different section of this job description?

Employment Type

    Full Time

Department / Functional Area

Sundus Consultancy

Rec Agency

Mr Sufyan - Consultant

Abu Dhabi JeddahStreet,AL–Jubail,KSA, United Arab Emirates (UAE)

https://www.sundusrecruitment.com