Sign in
โ† Back to search

showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Stars
1,933
Forks
91
Commits
357
Language
Python
Awesome lists
1

Similar repositories

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

3799 stars
Python 1 awesome list

PKU-YuanGroup/Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

12159 stars
Python 1 awesome list

Gen-Verse/MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models (dLLMs with block diffusion, mixed-CoT, unified RL)

1642 stars
Python 1 awesome list

guoyww/AnimateDiff

Official implementation of AnimateDiff.

12119 stars
Python 2 awesome lists

datajuicer/data-juicer

Data processing for and with foundation models! ๐ŸŽ ๐Ÿ‹ ๐ŸŒฝ โžก๏ธ โžก๏ธ๐Ÿธ ๐Ÿน ๐Ÿท

6474 stars
Python 1 awesome list

antgroup/echomimic

[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

4242 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:17

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2024-08-09
  • First commit: โ€”
  • Last pushed: 2026-01-08
  • Website: https://arxiv.org/abs/2408.12528
  • Archived: no
  • Stack detected: โ€”
  • License: Apache-2.0

AI development signals

No AI development config files detected.