jd-opensource/xllm
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
Fast, Flexible and Portable Structured Generation
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
Universal LLM Deployment Engine with ML Compilation
SGLang is a high-performance serving framework for large language models and multimodal models.
Democratizing Reinforcement Learning for LLMs
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
slime is an LLM post-training framework for RL Scaling.
1 capture since 2026-05-25