NVlabs/VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Repository profile
[IEEE OJSP'2021] "RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content", Zhengzhong Tu, Xiangxu Yu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Tracked growth, recent movement, and commit velocity from stored repository snapshots.
Latest capture 2026-06-17 10:31
1 capture since 2026-06-17
Stars from baseline 0
Frameworks, package managers, ecosystems, and dependency manifests found during catalog scans.
Scanned 2026-06-17 10:31
Searchable topics, generated tags, and stack labels that explain where this repository fits.
Agent instructions and tool configuration paths found in the repository tree.
Nearest indexed repositories by embedding similarity.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
No description.
Open source reference implementation of ITU-T P.1204.3
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
VQMT: Video Quality Measurement Tool. Fast implementations of the following objective image quality metrics: PSNR, SSIM, MS-SSIM, VIFp, PSNR-HVS and PSNR-HVS-M.
An introduction to volumetric video streaming using MPEG V-PCC, which includes tutorials, code samples, and links to relevant resources.