Sign in
← Back to search

google-ai-edge/LiteRT-LM

No description.

Stars
5,187
Forks
516
Commits
1645
Language
C++
Awesome lists
1

Similar repositories

NVIDIA/TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

13725 stars
Python 2 awesome lists

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

524 stars
Go 2 awesome lists

Lightning-AI/LitServe

A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.

3883 stars
Python 1 awesome list

google/gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

6908 stars
C++ 2 awesome lists

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

28241 stars
Python 4 awesome lists

langroid/langroid

Harness LLMs with Multi-Agent Programming

4026 stars
Python 5 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 20:58

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2025-04-14
  • First commit: —
  • Last pushed: 2026-05-25
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.