Sign in
← Back to search

turboderp-org/exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Stars
4,534
Forks
336
Commits
1459
Language
Python
Awesome lists
1

Similar repositories

meta-llama/llama

Inference code for Llama models

59434 stars
Python 1 awesome list

hiyouga/LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

71580 stars
Python 1 awesome list

getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

10961 stars
TypeScript 3 awesome lists

ikawrakow/ik_llama.cpp

llama.cpp fork with additional SOTA quants and improved performance

2579 stars
C++ 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:20

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-08-30
  • First commit: —
  • Last pushed: 2026-03-04
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.