Data Scientist R&D - Speech & Voice AI (Low-Resource Languages)

Décryptage du poste par Postule AI

Résumé du rôle

ToumAI is seeking a Data Scientist specializing in speech and voice AI R&D to research, develop, and optimize speech models for production Voice AI platforms. The role focuses on end-to-end speech pipelines including STT, diarization, emotion recognition, and language identification, with emphasis on low-resource languages like Moroccan Darija. You'll work on real-world deployment challenges including accuracy, latency, and on-device optimization, bridging applied research with production systems used in voicebots, APIs, and SDKs.

Exigences clés (estimation)

Research, design and train speech models (STT, language ID, diarization, emotion recognition, code-switching)
Improve transcription accuracy in noisy, conversational conditions and work on multilingual/dialectal speech data
Define data collection, annotation strategies and quality control processes for speech datasets
Collaborate with ML engineers to integrate models into real-time pipelines and optimize for latency and efficiency
Analyze model errors, design evaluation protocols, and document experiments to support long-term R&D

Compétences

Solid experience in machine learning and data science with focus on speech or audio-based modelsHands-on experience with PyTorch or equivalent deep learning frameworksUnderstanding of speech tasks: ASR/STT, diarization, VAD, language identification, emotion recognitionExperience with audio data pipelines and annotation workflowsFamiliarity with model evaluation, error analysis and benchmarking in speech systemsStrong analytical mindset and ability to collaborate with engineering teamsEnglish proficiency required; French or Arabic is a plus

Niveau estimé

This role requires solid experience in machine learning and data science with a focus on speech/audio models, suggesting 3-5 years of relevant experience. The position demands hands-on expertise with deep learning frameworks and speech tasks, positioning it beyond junior level but not requiring extensive senior leadership.

Attributs détectés par l’IA (si absents de l’offre)

Type de poste: CDI
Mode de travail: Sur site
Niveau d'expérience: 3–5 ans
Fonction: Data / BI
Secteur: Informatique / IT
Niveau d'études: Master / Bac+5
Langues: Anglais, Français, Arabe

Cette analyse a été générée automatiquement par Postule AI à partir de l'offre.

Introduction to the Position

As a Data Scientist – R&D (Speech-focused) at ToumAI, your mission will be to research, develop and optimize speech and voice intelligence models that power our Voice AI platforms, SDKs and on-device solutions. You will work on end-to-end speech pipelines, from data and annotation strategies to model training, evaluation and optimization, with strong constraints on accuracy, latency, robustness and footprint, particularly for low-resource languages and dialects such as Moroccan Darija.

Your work will directly impact production systems used in voicebots, QMS, VoC analytics, APIs and on-device SDKs, bridging applied research and real-world deployment.

Your Role

Research, design and train speech-related models (STT components, language identification, diarization, emotion recognition, speech segmentation, code-switching)
Improve transcription accuracy and robustness in noisy, conversational and real-world audio conditions
Work on multilingual and dialectal speech data with a focus on low-resource settings
Define and refine data collection, annotation strategies and quality control processes for speech datasets
Design evaluation protocols and metrics aligned with product and business requirements
Collaborate with platform and ML engineers to integrate models into real-time and batch pipelines
Optimize models for latency, memory footprint and inference efficiency (quantization, pruning, distillation)
Analyze model errors and failure modes, propose corrective strategies and iterate
Document experiments, architectures and learnings to support long-term R&D
Stay up to date with state-of-the-art research in speech processing and applied Voice AI

Your Qualifications

Strong curiosity for speech processing and applied AI research
Solid experience in machine learning and data science, with a focus on speech or audio-based models
Hands-on experience with PyTorch (or equivalent deep learning frameworks)
Understanding of speech tasks such as ASR/STT, diarization, VAD, language identification or emotion recognition
Experience working with audio data pipelines and annotation workflows
Familiarity with model evaluation, error analysis and benchmarking in speech systems
Interest in low-latency and on-device inference constraints is a strong plus
Experience with multilingual, low-resource or dialectal speech is highly valued
Familiarity with LLM-based speech pipelines or speech-to-text post-processing is a plus
Strong analytical mindset and ability to collaborate with engineering teams
Proficiency in French or Arabic is a plus; English required for technical work

Benefits

At ToumAI, you will work at the intersection of applied research and production Voice AI. You will:

Contribute to core R&D on speech and voice intelligence
Work on real datasets and deployed systems
Influence model choices, architectures and optimization strategies
Help shape the future of Voice AI for low-resource languages

If you enjoy pushing speech models from research to real-world deployment under tight constraints, this role is for you.

Recruitment Process

First conversation with the HR team to get to know you better and introduce you to ToumAI
A technical test or applied research case (speech-focused)
Role-specific interview with your future manager / R&D lead
Final meeting with top management (if needed)

Cette description d'emploi a pu être reformatée par Postule pour améliorer sa lisibilité et sa présentation. Le contenu et les informations restent fidèles à l'offre d'emploi originale. .