ggml-org/llama.cpp
LLM inference in C/C++
llama.go is like llama.cpp in pure Golang!
LLM inference in C/C++
Utilities intended for use with Llama models.
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Inference code for Llama models
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
llama.cpp fork with additional SOTA quants and improved performance
2 captures since 2026-05-27