bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
A language for constraint-guided and efficient LLM programming.
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Fast Multimodal LLM on Mobile Devices
A framework for few-shot evaluation of language models.
LLM training code for Databricks foundation models
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
2 captures since 2026-05-27