NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Build high-performance AI models with modular building blocks
Ongoing research training transformer models at scale
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
Fast, small, secure, local-first personal AI assistant infrastructure: one Rust binary for tools, memory, channels, providers, and sandboxed autonomy.
Go ahead and axolotl questions
๐ Geometric Computer Vision Library for Spatial AI
No description.
1 capture since 2026-05-27