logo

Data Scientist R&D - Speech & Voice AI (Low-Resource Languages)


Décryptage du poste par Postule AI

Introduction to the Position

As a Data Scientist – R&D (Speech-focused) at ToumAI, your mission will be to research, develop and optimize speech and voice intelligence models that power our Voice AI platforms, SDKs and on-device solutions. You will work on end-to-end speech pipelines, from data and annotation strategies to model training, evaluation and optimization, with strong constraints on accuracy, latency, robustness and footprint, particularly for low-resource languages and dialects such as Moroccan Darija.

Your work will directly impact production systems used in voicebots, QMS, VoC analytics, APIs and on-device SDKs, bridging applied research and real-world deployment.

Your Role

  • Research, design and train speech-related models (STT components, language identification, diarization, emotion recognition, speech segmentation, code-switching)
  • Improve transcription accuracy and robustness in noisy, conversational and real-world audio conditions
  • Work on multilingual and dialectal speech data with a focus on low-resource settings
  • Define and refine data collection, annotation strategies and quality control processes for speech datasets
  • Design evaluation protocols and metrics aligned with product and business requirements
  • Collaborate with platform and ML engineers to integrate models into real-time and batch pipelines
  • Optimize models for latency, memory footprint and inference efficiency (quantization, pruning, distillation)
  • Analyze model errors and failure modes, propose corrective strategies and iterate
  • Document experiments, architectures and learnings to support long-term R&D
  • Stay up to date with state-of-the-art research in speech processing and applied Voice AI

Your Qualifications

  • Strong curiosity for speech processing and applied AI research
  • Solid experience in machine learning and data science, with a focus on speech or audio-based models
  • Hands-on experience with PyTorch (or equivalent deep learning frameworks)
  • Understanding of speech tasks such as ASR/STT, diarization, VAD, language identification or emotion recognition
  • Experience working with audio data pipelines and annotation workflows
  • Familiarity with model evaluation, error analysis and benchmarking in speech systems
  • Interest in low-latency and on-device inference constraints is a strong plus
  • Experience with multilingual, low-resource or dialectal speech is highly valued
  • Familiarity with LLM-based speech pipelines or speech-to-text post-processing is a plus
  • Strong analytical mindset and ability to collaborate with engineering teams
  • Proficiency in French or Arabic is a plus; English required for technical work

Benefits

At ToumAI, you will work at the intersection of applied research and production Voice AI. You will:

  • Contribute to core R&D on speech and voice intelligence
  • Work on real datasets and deployed systems
  • Influence model choices, architectures and optimization strategies
  • Help shape the future of Voice AI for low-resource languages

If you enjoy pushing speech models from research to real-world deployment under tight constraints, this role is for you.

Recruitment Process

  1. First conversation with the HR team to get to know you better and introduce you to ToumAI
  2. A technical test or applied research case (speech-focused)
  3. Role-specific interview with your future manager / R&D lead
  4. Final meeting with top management (if needed)

Cette description d'emploi a pu être reformatée par Postule pour améliorer sa lisibilité et sa présentation. Le contenu et les informations restent fidèles à l'offre d'emploi originale. .