Sign in
← Back to search

jackaduma/Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

Stars
221
Forks
18
Commits
Language
Python
Awesome lists
1

Similar repositories

jianzhnie/LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

620 stars
Python 1 awesome list

lm-sys/FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

39480 stars
Python 3 awesome lists

OptimalScale/LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

8486 stars
Python 2 awesome lists

artidoro/qlora

QLoRA: Efficient Finetuning of Quantized LLMs

10914 stars
Jupyter Notebook 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:42

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-04-22
  • First commit: —
  • Last pushed: 2024-05-20
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.

Appears in