meta-llama/llama
Inference code for Llama models
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Inference code for Llama models
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
LLM inference in C/C++
Utilities intended for use with Llama models.
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
llama.cpp fork with additional SOTA quants and improved performance
3 captures since 2026-05-26