OpenAI has achieved a significant milestone with the introduction of real-time in-browser speech recognition using its Whisper model. This breakthrough, known as Whisper WebGPU, allows for speech-to-text functionality that operates entirely on-device within a web browser. The technology leverages Transformers.js and ONNX Runtime Web to provide multilingual transcription across 100 different languages. Developed by a Hugging Face Engineer, known by the nickname ‘Xenova’, this development is seen as a game changer, particularly due to its ability to run locally without the need for cloud-based processing.
Whisper WebGPU: Real-Time in-Browser Speech Recognition with OpenAI Whisper https://t.co/PevJo9zWns #WhisperWebGPU #SpeechRecognition #AIinBrowser #OpenAIWhisper #RealTimeAI #ai #news #llm #ml #research #ainews #innovation #artificialintelligence #machinelearning #technology … https://t.co/MExVkYxB8T
Whisper WebGPU: Real-Time in-Browser 🎙️ Speech Recognition with OpenAI Whisper Achieving real-time speech recognition directly within a web browser has long been a sought-after milestone. Whisper WebGPU by a Hugging Face Engineer (nickname ‘Xenova’) is a groundbreaking… https://t.co/0YLZiHRbkp
🚨🚨 Real-time in-browser speech recognition with OpenAI Whisper! 🤯 fully on-device using Transformers.js and ONNX Runtime Web Plus multilingual transcription across 100 different languages! 🔥 Demo (+ source code)! 👇 https://t.co/SYMz6FV3GH
real time whisper completely in the browser 😮 https://t.co/319zOVVrFI
This is just 🤯 - Real-time speech-to-text... - entirely running locally in your browser... - multilingual WebGPU is game changing https://t.co/YSe0uK7BOh
It's finally possible: real-time in-browser speech recognition with OpenAI Whisper! 🤯 The model runs fully on-device using Transformers.js and ONNX Runtime Web, and supports multilingual transcription across 100 different languages! 🔥 Check out the demo (+ source code)! 👇 https://t.co/W9CSM9zPwB