Google Open-Sources PaLI VLM as Part of Gemma Family w

Armand Joulin@armandjoulin

2 mo

PaliGemma is out and you can finetune it on Google Colab in a matter of minutes. https://t.co/lBmwYsPXwF

Rohan Paul@rohanpaul_ai

2 mo

Google's newly announced PaliGemma in @huggingface https://t.co/LrdfyPA1A8

Pablo Montalvo@m_olbap

2 mo

A very very nice release by @Google : PaliGemma! It's a pure VLM, SigLip encoder + Gemma decoder, 3b, comes in 3 resolutions including 224, 448 and 896 px. And it's in @huggingface transformers to try it: https://t.co/Ou2NQuRfgW

Neil Houlsby@neilhoulsby

2 mo

PaLI is finally opensourced as part of the Gemma family! PaLI has been our long-running VLM research project with colleagues in Google Research, where we explored scalability, pre-training methods, data mixtures, and more. See [1,2,3] for background. Super cool to see it go… https://t.co/dvbQWaXnyN

Alexander Kolesnikov@__kolesnikov__

2 mo

We just released PaliGemma-3B, a very capable Vision-Language Model. Do not waste any time, finetune it for your task: Code: https://t.co/V9wQU7jtmv Colab: https://t.co/aDGJd7Iz8z Kaggle: https://t.co/A5ZrnjDZni HF: https://t.co/Du52eHcXNh Vertex AI: https://t.co/qxK9Irgera

Similar Stories

Google Open-Sources PaLI VLM as Part of Gemma Family with PaliGemma-3B, SigLip Encoder, 224, 448, 896 px

Similar Stories

Sources

Google Open-Sources PaLI VLM as Part of Gemma Family with PaliGemma-3B, SigLip Encoder, 224, 448, 896 px