Sign in
← Back to search

lucidrains/vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Stars
25,205
Forks
3,487
Commits
398
Language
Python
Awesome lists
1

Similar repositories

huggingface/transformers.js

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

16033 stars
JavaScript 1 awesome list

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

3799 stars
Python 1 awesome list

BlinkDL/RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

14538 stars
Python 3 awesome lists

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

8690 stars
Jupyter Notebook 1 awesome list

huggingface/transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

160951 stars
Python 5 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:05

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2020-10-03
  • First commit: —
  • Last pushed: 2026-05-19
  • Archived: no
  • Stack detected: —
  • License: MIT

AI development signals

No AI development config files detected.