ggml-org/llama.cpp
LLM inference in C/C++
Small self-contained pure-Go web server with Lua, Teal, Markdown, Ollama, HTTP/2, QUIC, Redis, TypeScript, SQLite and PostgreSQL support ++
LLM inference in C/C++
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
llama.cpp fork with additional SOTA quants and improved performance
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.
Putting a brain behind `cat`🐈⬛ Integrating language models in the Unix commands ecosystem through text streams.
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
2 captures since 2026-05-25
AI agent config detected
Key config paths
vendor/github.com/alecthomas/chroma/v2/AGENTS.md