meta-llama/llama
Inference code for Llama models
Distribute and run LLMs with a single file.
Inference code for Llama models
Utilities intended for use with Llama models.
llama.cpp fork with additional SOTA quants and improved performance
The official Meta Llama 3 GitHub site
Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
1 capture since 2026-05-25
AI agent config detected
Key config paths
docs/AGENTS.md