Sign in
← Back to search

FMInference/FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Stars
9,367
Forks
590
Commits
Language
Python
Awesome lists
1

Similar repositories

bobazooba/xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

408 stars
Python 1 awesome list

Tiiny-AI/PowerInfer

High-speed Large Language Model Serving for Local Deployment

9494 stars
C++ 1 awesome list

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

41386 stars
Python 4 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:45

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-02-15
  • First commit: —
  • Last pushed: 2024-10-28
  • Archived: yes
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.