antimatter15/alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Locally run an Instruction-Tuned Chat-Style LLM
Large Language Model (LLM) Inference API and Chatbot
A school for camelids
Inference code for Llama models
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
Run LLaMA (and Stanford-Alpaca) inference on Apple Silicon GPUs.
1 capture since 2026-05-27