2noise/ChatTTS
A generative speech model for daily dialogue.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
A generative speech model for daily dialogue.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Open-Source Frontier Voice AI
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
1 capture since 2026-05-27