Senior Applied ML Engineer

Nile Bits

Posted on 20 May

Experience

5 - 10 Years

Job Location

Cairo - Egypt

Education

Any Graduation

Nationality

Any Nationality

Gender

Not Mentioned

Vacancy

1 Vacancy

Job Description

Roles & Responsibilities

Benchmark and evaluate TTS and ASR models using Arabic-specific test sets, measuring metrics such as Word Error Rate (WER), naturalness, and dialect coverage.

Fine-tune generative models for voice cloning, zero-shot speaker adaptation, and speech synthesis.

Build and maintain Arabic-focused data pipelines, including:

  • Audio collection and preprocessing
  • Diacritization (Tashkil)
  • Data cleaning and augmentation

Optimize model inference for production environments using:

  • Quantization
  • KV-cache tuning
  • Streaming inference techniques

Integrate and evaluate complete speech-to-speech conversational pipelines.

Conduct experiments based on recent research papers and convert findings into production-ready solutions.

Collaborate with engineering and product teams to deploy robust and scalable speech systems.

Desired Candidate Profile

Required Qualifications

  • 5+ years of experience in Machine Learning, Applied AI, or AI Research.
  • Strong programming skills in Python.
  • Extensive hands-on experience with PyTorch and the Hugging Face ecosystem.
  • Proven experience training and fine-tuning neural models for:
    • Text-to-Speech (TTS)
    • Automatic Speech Recognition (ASR)
    • Audio codecs
  • Deep understanding of modern speech architectures such as:
    • Whisper
    • Conformer
    • HiFi-GAN
    • Diffusion-based models
  • Experience with audio processing techniques including:
    • Voice Activity Detection (VAD)
    • Speaker Diarization
    • Neural Vocoders
  • Demonstrated ability to implement and adapt research papers into practical production experiments.
  • Strong understanding of Arabic language challenges, including:
    • Diacritization (Tashkil)
    • Dialectal variations
    • Code-switching
  • Experience with inference optimization techniques such as:
    • Quantization
    • Streaming inference
    • NVIDIA TensorRT

Preferred Qualifications

  • Experience developing custom NVIDIA CUDA kernels for high-performance model inference.
  • Familiarity with speculative decoding and other advanced acceleration techniques.
  • Experience deploying models at scale in cloud or GPU-based production environments.
  • Contributions to open-source speech or machine learning projects.

Company Industry

Department / Functional Area

Keywords

  • Senior Applied ML Engineer (Speech & Audio)

Disclaimer: Naukrigulf.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@naukrigulf.com

Nile Bits

Join a cutting-edge initiative focused on building advanced AI voice infrastructure for Arabic-speaking markets. The project involves developing state-of-the-art Arabic speech technologies, including:

  • Natural Text-to-Speech (TTS)
  • Real-Time Automatic Speech Recognition (ASR)
  • End-to-End Speech-to-Speech Conversational Systems

The solutions are tailored to regional Arabic dialects, including Egyptian, Gulf, Levantine, and others.

Read More

https://jobs.smartrecruiters.com/NileBits/744000127036999-senior-applied-ml-engineer-speech-audio-