NVlabs/VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Language: Python
Stars: 2546
Forks: 203

Visit Website

NVlabs/VILA

GitHub Trending

https://github.com/trending/?since=daily&spoken_language_code=