OpenAI Updates Voice Engine: Human-Like Audio from Tex

Insider Tech@TechInsider

18 d

OpenAI seems awfully defensive about its AI voice engine https://t.co/qsempjVxQ4

Smoke-away@SmokeAwayyy

19 d

OpenAI: "starting last summer we showed global policymakers at the highest levels [Voice Engine's] potential and discussed the associated risks with them." Voice Engine is not GPT-4o Voice. It's a TTS model for generating human-like audio using text and 15 sec of sample speech.

Smoke-away@SmokeAwayyy

19 d

New OpenAI Voice Engine Blog Some highlights: - "The voice capability is powered by a text-to-speech (TTS) model, capable of generating human-like audio from just text and 15-seconds of sample speech." - "it employs a diffusion process, starting with random noise and… https://t.co/jnDJ8B8iaf

andrew gao@itsandrewgao

19 d

OpenAI had voice engine in 2022, here's their statement on it https://t.co/uXcZlW7qDd https://t.co/SPanwANvAW

Simon Willison@simonw

19 d

Surprising detail in this new article about OpenAI's Voice Engine: apparently their existing collection of TTS voices, built using professional voice actors, still only use a 15 second sample from each of those professionals to define each voice! https://t.co/xXpykjVFRB

bitlauncher@bitlauncherai

20 d

🔥 Unleash the power of #LargeLanguageModels to transform text into spoken magic! 🎙️ #NaturalLanguageProcessing #SpeechRecognition #AI🧵👇 https://t.co/73aDgIN0r1

Tibor Blaho@btibor91

20 d

OpenAI published an update on how Voice Engine works and their safety research, likely to address concerns related to the Sky voice - The Voice Engine generates human-like audio from text using a 15-second sample of speech and employs a diffusion process to match the speaker's… https://t.co/wm1SvOSTYK https://t.co/FiBaZa4NfJ

Paul Triolo@pstAsiatech

20 d

The, Um, Psychology of, Like, AI-Generated Voices; Effective Altruists Strike Back at OpenAI — The Information https://t.co/vo4tWP57PC

Similar Stories

OpenAI Updates Voice Engine: Human-Like Audio from Text Using 15-Second Sample Amid Safety Concerns

Similar Stories

Sources

OpenAI Updates Voice Engine: Human-Like Audio from Text Using 15-Second Sample Amid Safety Concerns