NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
LAVIS - A One-stop Library for Language-Vision Intelligence
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
A high-throughput and memory-efficient inference and serving engine for LLMs
OpenLens AI: A Fully Autonomous Multimodal Research Agent| OpenLens AI:全自主多模态科研智能体
Low-code framework for building custom LLMs, neural networks, and other AI models
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
3 captures since 2026-05-22