Microsoft has released Florence-2, a novel vision foundation model capable of a wide variety of tasks including OCR recognition and visual question answering. The model is open-source and available for fine-tuning on custom datasets.
Yes, the Florence-2 vision is very good. When I tested it, I thought it accessed the image's metadata because the description was so close to the original prompt used for its creation. But as I continued testing, I realized it's just that good! Experiment with the settings, tho… https://t.co/awgADF6tpf
New blog post regarding how to fine-tune Florence-2, the small and powerful VLM by @Microsoft, on a custom dataset (DocVQA) in plain @PyTorch: https://t.co/V7mwr53CYv
🚀 Fine-tune Florence-2 on any task! We are releasing fine tuning scripts for microsoft's Florence-2, alongside with a walkthrough blogspot, a space demo, and a Colab notebook. @mervenoyann @skalskip92 🧵 https://t.co/iZH86DekSE
Fine-tune Florence-2 on any task 🔥 Today we release a notebook and a walkthrough blog on fine-tuning Florence-2 on DocVQA dataset @andi_marafioti @skalskip92 Keep reading ⇓ https://t.co/vv28Efaf4g
Microsoft's small Florence-2 models are excellent for Visual Question Answering (VQA): On-par and beating all LLaVA-1.6 variants. While Florence-2 isn't SOTA in object detection, it's remarkably good in Visual Question Answering (VQA) and Referring Expression Comprehension… https://t.co/FZmEXsLskR
Microsoft Releases Florence-2: A Novel Vision Foundation Model with a Unified, Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision https://t.co/5cxQH65QjZ
Florence-2 is out and is now available on the Clarifai Platform! 🔥 Florence-2 is the new lightweight vision-language model open-sourced by Microsoft. Here are the key takeaways of the model: • Handles a variety of vision and vision-language tasks through a prompt-based… https://t.co/KuNdDOMUpo
Finally, a good handwriting recognition tool? I'm impressed by Microsoft's latest vision model, Florence-2. The results are really good, boasting a remarkably low error rate, as you can see with this letter from George W. Bush to Bill Clinton! 👉 Try it out here:… https://t.co/aiFGIIzJa4
Stunning results from the @Microsoft Florence-2 open model. You can try your own text recognition and other vision tasks as well! https://t.co/LP2xWvoeY7
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks - a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks. https://t.co/IlPBw3z9na
Run Florence-2 on Your Local Machine Florence-2 is a new vision model from Microsoft that excels at all kinds of vision tasks (text recognition, object detection, ..) I ported this huggingface gradio app to run on all machines (mac, linux, windows) and wrote a 1 click launcher. https://t.co/oKpHIDpRsG https://t.co/hHmFbH9A6Q
Florence works surprisingly well out of the box for OCR recognition on challenging text (old French ads with unusual typography). Will get even better with fine-tuning. https://t.co/Grb6PhRJ9p https://t.co/ujOD1veoY8
looks like Florence-2 is really good at OCR… it’s co cool to have models like this under MIT license https://t.co/aXV1rpfjyu
Consume-Florence2 is on the way. Will push to my github in the next few hours. Here is the Huggingface repo if interested: https://t.co/76osohtjG7 #airesearch
Florence-2 is finally out! 1 model; 10+ computer vision tasks! ↓ key takeaways are listed below. see my blog post for details. link: https://t.co/X03LCsjSOH https://t.co/alYTIKnhYT
Florence-2 is a new vision foundation model by MSFT capable of a wide variety of tasks 🤯 Let's unpack! 🧶 Demo, models and more on the next one 🐣 https://t.co/Frf0blc99M