ikawrakow/ik_llama.cpp
llama.cpp fork with additional SOTA quants and improved performance
Vim plugin for LLM-assisted code/text completion
llama.cpp fork with additional SOTA quants and improved performance
Distribute and run LLMs with a single file.
LLM inference in C/C++
💃 Dance with Intelligence in Your Code. Minuet offers code completion as-you-type from popular LLMs including OpenAI, Gemini, Claude, Ollama, Llama.cpp, Codestral, and more.
A fast inference library for running LLMs locally on modern consumer-class GPUs
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.
1 capture since 2026-05-25