Audio Processing
Speech recognition, text-to-speech, and audio generation models
Last Updated
Nov 9, 2025
Total Votes
85,830
Total Models
10
Single Column
| Rank (UB) | Model | Score | Win% [?] | Votes | Organization | License | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 🥇 | Whisper Large v3 | 97 | 93 | 100,425 | OpenAI | Proprietary | 🥇 Whisper Large v3OpenAI Score97 Win%93 Votes100,425 | ||||||||
| 🥈 | ElevenLabs v2 | 95 | 90 | 99,982 | ElevenLabs | Proprietary | 🥈 ElevenLabs v2ElevenLabs Score95 Win%90 Votes99,982 | ||||||||
| 🥉 | MusicGen | 89 | 87 | 95,642 | Meta | Proprietary | 🥉 MusicGenMeta Score89 Win%87 Votes95,642 | ||||||||
| 4 | AudioCraft | 88 | 87 | 93,316 | Meta | Proprietary | 4 AudioCraftMeta Score88 Win%87 Votes93,316 | ||||||||
| 5 | Bark | 86 | 71 | 86,120 | Suno AI | Proprietary | 5 BarkSuno AI Score86 Win%71 Votes86,120 | ||||||||
| 6 | Voicebox | 84 | 84 | 93,009 | Meta | Proprietary | 6 VoiceboxMeta Score84 Win%84 Votes93,009 | ||||||||
| 7 | AudioLM | 83 | 82 | 89,163 | Proprietary | 7 AudioLMScore83 Win%82 Votes89,163 | |||||||||
| 8 | Vall-E | 81 | 67 | 86,500 | Microsoft | Proprietary | 8 Vall-EMicrosoft Score81 Win%67 Votes86,500 | ||||||||
| 9 | Tortoise TTS | 79 | 75 | 84,440 | Open Source | Proprietary | 9 Tortoise TTSOpen Source Score79 Win%75 Votes84,440 | ||||||||
| 10 | Coqui TTS | 76 | 74 | 86,096 | Coqui | Proprietary | 10 Coqui TTSCoqui Score76 Win%74 Votes86,096 | ||||||||