LLaVA-1.6 Model Released, Outperforms Gemini Pro on Se

Welcome to the era of open-source multimodal models, indeed! This is using: @ollama v0.1.23 (https://t.co/XZ1f5okUVg) Ollama Web UI v1.0.0-alpha.61 (https://t.co/IaG96VAhkZ) LLaVA 1.6 (https://t.co/4uO7Qx3FtI) Running on my Ubuntu "Server" with an old GTX 1650 Nice work… https://t.co/taiFuRReFR

Jeremy Morgan@JeremyCMorgan

5 mo

Welcome to the era of open-source multimodal models, indeed! This is using: @ollama v0.1.2 (https://t.co/XZ1f5okUVg) Ollama Web UI v1.0.0-alpha.61 (https://t.co/IaG96VAhkZ) LLaVA 1.6 (https://t.co/4uO7Qx3FtI) Running on my Ubuntu "Server" with an old GTX 1650 Nice work… https://t.co/W28CTCo2AM

Haider.@slow_developer

5 mo

Open Source model LlaVa just released LlaVa-34B. As claimed, LLaVA-1.6 even surpasses Gemini Pro on several benchmarks. Here is the complete breakdown (plus 2 live tests) 🧵 https://t.co/xOUg1oKS5Q

Nathan LeClaire@dotpem

5 mo

Very cool… I love how easy @ollama makes it to post images to these multi modals https://t.co/kwcn9hxTh5

Jeffrey Morgan@jmorgan

5 mo

LLaVA 1.6 from @imhaotian has been released with improved resolution support, visual reasoning, and OCR capabilities, all while maintaining minimalist design and data efficiency. https://t.co/396xMviPP9

Jeffrey Morgan@jmorgan

5 mo

LLaVa 1.6 from @imhaotian has been released with improved resolution support, visual reasoning, and OCR capabilities, all while maintaining minimalist design and data efficiency. https://t.co/396xMviPP9

lmsys.org@lmsysorg

5 mo

LLaVA v1.6 is out, pushing the limits of open multimodal models! We're glad to see two of our projects contribute to LLaVA: - SGLang for efficient inference and deployment - Vicuna 1.5 as the base language model Check out the demo at https://t.co/YmhKjWmOTi, served with SGLang!… https://t.co/FvughNOl88

Marco Mascorro@Mascobot

5 mo

LLaVA1.6 is out. OS models are making great progress https://t.co/DpqQ8gGvDp

Dimitris Papailiopoulos@DimitrisPapail

5 mo

Congrats to @imhaotian + @yong_jae_lee and team!!🥳 LLaVA-1.6 (an open source model!) beats Gemini Pro and comes close to GPT-4V on several benchmarks. https://t.co/mxFiSQHDcC

Brian Roemmele@BrianRoemmele

5 mo

Boom! LLaVA-1.6, with improved reasoning, OCR, and world knowledge. It supports higher-res inputs, more tasks, and exceeds Gemini Pro on several benchmarks! It is trained ~1 day with 32 A100s. Model: https://t.co/pqUjcp3A92

Omar Sanseviero@osanseviero

5 mo

LLaVA 1.6 is out! 🥳 - Outperforms Gemini PRO on some benchmarks - Higher resolution than LLaVA 1.5 (up to 4x more pixels!) - Better OCR capability and instruction-following - More conversational Models: https://t.co/200Qffi6fM Blog: https://t.co/nh5TaTHH3W https://t.co/kYrE7O2V1O

Haotian Liu@imhaotian

5 mo

🚀We are thrilled to release LLaVA-1.6, with improved reasoning, OCR, and world knowledge. It supports higher-res inputs, more tasks, and exceeds Gemini Pro on several benchmarks! 🤯 It maintains the data efficiency of LLaVA-1.5, and LLaVA-1.6-34B is trained ~1 day with 32 A100s.… https://t.co/nGRpLX8FQv

Similar Stories

LLaVA-1.6 Model Released, Outperforms Gemini Pro on Several Benchmarks

Similar Stories

Sources

LLaVA-1.6 Model Released, Outperforms Gemini Pro on Several Benchmarks