jd-opensource/xllm
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
A high-performance inference engine for LLM, VLM, DiT and REC models, optimized for diverse AI accelerators.
A high-throughput and memory-efficient inference and serving engine for LLMs
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Achieve state of the art inference performance with modern accelerators on Kubernetes
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
2 captures since 2026-05-25
docs/requirements.txt
· python · 21 dependencies
3rdparty/protobuf/composer.json
· php · 1 dependencies
3rdparty/trt_fused_multihead_attention/CMakeLists.txt
· c-cpp · 0 dependencies
docs/index/package.json
· javascript · 7 dependencies
rtp_llm/flexlb/pom.xml
· java · 68 dependencies
3rdparty/protobuf/global.json
· dotnet · 0 dependencies
3rdparty/protobuf/cmake/CMakeLists.txt
· c-cpp · 2 dependencies
3rdparty/protobuf/examples/CMakeLists.txt
· c-cpp · 1 dependencies
3rdparty/protobuf/examples/pubspec.yaml
· dart · 1 dependencies
3rdparty/protobuf/java/pom.xml
· java · 12 dependencies
AI agent config detected
Key config paths
rtp_llm/flexlb/CLAUDE.md
rtp_llm/flexlb/flexlb-sync/CLAUDE.md