NVlabs/VILAVILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud. Language: Python Stars: 2546 Forks: 203