Sign in
← Back to search
Stars
2,927
Forks
175
Commits
Language
Python
Awesome lists
1

Similar repositories

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

3799 stars
Python 1 awesome list

QwenLM/Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

19238 stars
Jupyter Notebook 1 awesome list

zai-org/CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

12755 stars
Python 1 awesome list

showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

1933 stars
Python 1 awesome list

microsoft/Magma

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

1927 stars
Python 1 awesome list

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

24838 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-27

Latest capture 2026-05-27 12:42

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2023-09-26
  • First commit: —
  • Last pushed: 2025-05-26
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.

Appears in