FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Open-Source Frontier Voice AI
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A generative speech model for daily dialogue.
Transcribe on your own!
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
5 captures since 2026-05-22