Sign in
← Back to search

microsoft/LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Stars
6,228
Forks
385
Commits
85
Language
Python
Awesome lists
1

Similar repositories

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

28241 stars
Python 4 awesome lists

ikawrakow/ik_llama.cpp

llama.cpp fork with additional SOTA quants and improved performance

2579 stars
C++ 1 awesome list

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

81308 stars
Python 3 awesome lists

curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain

LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.

1241 stars
Jupyter Notebook 2 awesome lists

langgptai/LangGPT

LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt design 10,000+ ⭐ | Battle-tested by thousands of users worldwide Created by 云中江树

12116 stars
Jupyter Notebook 3 awesome lists

Tracked growth

2 captures since 2026-05-23

Latest capture 2026-05-31 03:04

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-07-07
  • First commit: 2023-07-07
  • Last pushed: 2026-04-08
  • Website: https://llmlingua.com/
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.