ggml-org/llama.cpp
LLM inference in C/C++
Python bindings for llama.cpp
LLM inference in C/C++
Access large language models from the command-line
llama.cpp fork with additional SOTA quants and improved performance
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Inference code for Llama models
Utilities intended for use with Llama models.
2 captures since 2026-05-25