Microsoft Releases Florence-2 Vision Model with Zero-S

Florence-2 WebGPU: The vision foundation model from @Microsoft - running locally in your browser w/ Transformers.js https://t.co/Z7L5Dp5PAs

Rohan Paul@rohanpaul_ai

4 d

Florence-2 WebGPU: vision foundation model running locally in your browser w/ Transformers.js https://t.co/NrP3S0VSsP

Luis C@lucataco93

4 d

Florence-2: a new vision foundation model by Microsoft It supports tasks like image captioning, optical character recognition, object detection and more Try it out on @replicate 👇 https://t.co/4wMiLNOkYD

SkalskiP@skalskip92

4 d

OMG. I was waiting for this to happen. Florence-2 is running in browser. @xenovacom can I run any fine-tuned model or only pre-trained ones? https://t.co/7PSc6MHhvI

Luis C@lucataco93

4 d

Florence-2: a new vision foundation model by Microsoft It supports tasks like image captioning, optical character recognition, object detection, and more! Try it out on @replicate 👇 https://t.co/08HmFL6Lfr

Xenova@xenovacom

4 d

Florence-2, the new vision foundation model by Microsoft, can now run 100% locally in your browser on WebGPU, thanks to Transformers.js! 🤗🤯 It supports tasks like image captioning, optical character recognition, object detection, and many more! 😍 WOW! Demo (+ source code) 👇 https://t.co/TyKnp8XUdN

Boris Dayma 🖍️@borisdayma

4 d

Nice progress on the training of CapPa 🥳 The model now performs nice OCR! The prediction is often better than the actual caption 🤯 Really excited about the results!!! https://t.co/IYOGrwRKke

SkalskiP@skalskip92

4 d

fine-tune Florence-2 on custom object detection dataset (super technical blog post; video tutorial coming this week) - format dataset - configure LoRA for optimized training - train and benchmark fine-tuned model link: https://t.co/p6VUROUw6t ↓ key takeaways https://t.co/uBlXyQhuWK

Clarifai@clarifai

5 d

Object Detection Using Florence-2 🔥 The recently released Florence-2 model demonstrates strong zero-shot capabilities across tasks such as captioning, object detection, grounding, and segmentation. Below is an example of using the model for an object detection task and getting… https://t.co/PROC1h7T3M

Wolfram Ravenwolf 🐺🐦‍⬛@WolframRvnwlf

6 d

Yes, the Florence-2 vision is very good. When I tested it, I thought it accessed the image's metadata because the description was so close to the original prompt used for its creation. But as I continued testing, I realized it's just that good! Experiment with the settings, tho… https://t.co/awgADF6tpf

Niels Rogge@NielsRogge

6 d

New blog post regarding how to fine-tune Florence-2, the small and powerful VLM by @Microsoft, on a custom dataset (DocVQA) in plain @PyTorch: https://t.co/V7mwr53CYv

Andi Marafioti@andi_marafioti

6 d

🚀 Fine-tune Florence-2 on any task! We are releasing fine tuning scripts for microsoft's Florence-2, alongside with a walkthrough blogspot, a space demo, and a Colab notebook. @mervenoyann @skalskip92 🧵 https://t.co/iZH86DekSE

merve@mervenoyann

6 d

Fine-tune Florence-2 on any task 🔥 Today we release a notebook and a walkthrough blog on fine-tuning Florence-2 on DocVQA dataset @andi_marafioti @skalskip92 Keep reading ⇓ https://t.co/vv28Efaf4g

Rohan Paul@rohanpaul_ai

7 d

Microsoft's small Florence-2 models are excellent for Visual Question Answering (VQA): On-par and beating all LLaVA-1.6 variants. While Florence-2 isn't SOTA in object detection, it's remarkably good in Visual Question Answering (VQA) and Referring Expression Comprehension… https://t.co/FZmEXsLskR

Deep_In_Depth@Deep_In_Depth

8 d

Microsoft Releases Florence-2: A Novel Vision Foundation Model with a Unified, Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision https://t.co/5cxQH65QjZ

Similar Stories

Microsoft Releases Florence-2 Vision Model with Zero-Shot Capabilities, WebGPU Support

Similar Stories

Sources

Microsoft Releases Florence-2 Vision Model with Zero-Shot Capabilities, WebGPU Support