meta-llama/llama
Inference code for Llama models
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
Inference code for Llama models
LLM inference in C/C++
A school for camelids
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.
A fast inference library for running LLMs locally on modern consumer-class GPUs
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
1 capture since 2026-05-27