pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Data manipulation and transformation for audio signal processing, powered by PyTorch
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
The most powerful local music generation model that outperforms almost all commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
Generative models for conditional audio generation
A PyTorch-based Speech Toolkit
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
2 captures since 2026-05-25