RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
SoftVC VITS Singing Voice Conversion
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Generative Models by Stability AI
Focus on prompting and generating
A generative speech model for daily dialogue.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
1 capture since 2026-05-25