NVIDIA-NeMo/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Repository profile
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Repository updates
Get generated NVIDIA-NeMo/Speech development summaries by email, or follow the weekly and monthly RSS feeds.
Sign in to subscribe by email. RSS feeds are public.
Sign in to subscribeTracked growth, recent movement, and commit velocity from stored repository snapshots.
Latest capture 2026-06-28 03:13
1 capture since 2026-06-28
Stars from baseline 0
All tracked data
Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.
Scanned 2026-06-28 03:13
pyproject.toml
python ecosystem,
0 dependencies
setup.py
python ecosystem,
0 dependencies
uv.lock
python ecosystem,
0 dependencies
scripts/tts_comparison_report/requirements.txt
python ecosystem,
0 dependencies
tools/ctc_segmentation/requirements.txt
python ecosystem,
0 dependencies
tools/nemo_forced_aligner/requirements.txt
python ecosystem,
0 dependencies
tools/speech_data_explorer/requirements.txt
python ecosystem,
0 dependencies
examples/voice_agent/client/package.json
javascript ecosystem,
0 dependencies
Searchable topics, generated tags, and stack labels that explain where this repository fits.
Agent instructions and tool configuration paths found in the repository tree.
AI agent config detected
Showing the first 24 paths. 1 more detected.
Nearest indexed repositories by embedding similarity.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
A PyTorch-based Speech Toolkit
A generative speech model for daily dialogue.
SOTA Open Source TTS
No description.
https://docs.nvidia.com/nemo/speech/nightly/index.html
403 Forbidden | https://api.github.com/repos/NVIDIA-NeMo/Speech/readme | message=API rate limit exceeded for user ID 260990068. If you reach out to GitHub Support for help, please include the request ID B524:3687A3:DB6C015:D1CAE7A:6A409149 and timestamp 2026-06-28 03:13:13 UTC. For more on scraping GitHub and how it may affect your rights, please review our Terms of Service (htt | rate_limit_remaining=0 | rate_limit_reset=1782619204