FranxYao/chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Building DeepSeek R1 from Scratch
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
Practical course about Large Language Models.
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Simple RL training for reasoning
No description.
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
1 capture since 2026-05-27