Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5
26 September 2024
Molmo: Open Source Multimodal Vision-Language Models Outperform Gemini 1.5 and Claude 3.5
Molmo is a new series of multimodal vision-language models (VLMs) created by researchers at the Allen Institute for AI and the University of Washington. The Molmo family outperforms many state-of-the-art…