ggml-org/llama.cpp
LLM inference in C/C++
llama.cpp fork with additional SOTA quants and improved performance
LLM inference in C/C++
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Fast Multimodal LLM on Mobile Devices
Distribute and run LLMs with a single file.
Inference code for Llama models
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
1 capture since 2026-05-25