pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Auralisation of learned features in CNN (for audio)
Data manipulation and transformation for audio signal processing, powered by PyTorch
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
A generative speech model for daily dialogue.
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
3 captures since 2026-05-22