Sign in
← Back to search

NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Stars
3,799
Forks
322
Commits
134
Language
Python
Awesome lists
1

Similar repositories

haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

24838 stars
Python 1 awesome list

lm-sys/FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

39480 stars
Python 3 awesome lists

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4157 stars
Python 1 awesome list

showlab/Show-o

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

1933 stars
Python 1 awesome list

salesforce/LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

11227 stars
Jupyter Notebook 1 awesome list

open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

4151 stars
Python 2 awesome lists

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:11

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2024-02-23
  • First commit: —
  • Last pushed: 2026-03-12
  • Archived: no
  • Stack detected: —
  • License: Apache-2.0

AI development signals

No AI development config files detected.