WavLM Large is an open-source embedding model developed by Microsoft, pretrained on 94,000 hours of speech audio data, including Libri-Light, GigaSpeech, and VoxPopuli. It is genuinely best at generating embeddings for speech processing tasks, with a focus on both spoken content modeling and speaker identity preservation.
Input
Output
Context
-
Max Output
-
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.