ChatGPT Creator OpenAI’s Voice Cloning Technology Is So Good That Even They Find It Too Scary for Public Release

CryptoGlobe Writer

30 Mar 2024
/
In #AI, #OpenAI

OpenAI, a prominent artificial intelligence research organization, published a blog post on March 29, discussing their latest development: Voice Engine. This model, created in late 2022, can generate natural-sounding speech that closely resembles the original speaker using only a 15-second audio sample and text input. While the technology is impressive, OpenAI is cautious about its broader release due to the potential for misuse.

Voice Engine has already been used in various applications, such as powering preset voices in OpenAI’s text-to-speech API and enhancing ChatGPT Voice and Read Aloud features. To better understand the real-world applications of Voice Engine, OpenAI has been working with a select group of trusted partners since late 2022.

These collaborations have yielded interesting results, with companies like Age of Learning using Voice Engine for personalized educational content, HeyGen leveraging it for video translation, and Dimagi utilizing it to provide interactive feedback to community health workers. The technology has even been piloted in healthcare, with the Norman Prince Neurosciences Institute at Lifespan using it to restore the voices of patients with speech impairments.

However, OpenAI is well aware of the risks associated with generating speech that closely mimics people’s voices, particularly in an election year. To address these concerns, the company has implemented safety measures and usage policies for their partners, such as prohibiting impersonation without consent, requiring explicit permission from the original speaker, and using watermarking to trace the origin of generated audio.

As synthetic speech technology advances, OpenAI is advocating for proactive measures to ensure its responsible deployment. This includes phasing out voice-based authentication for sensitive information, educating the public on the capabilities and limitations of AI, and developing techniques to track the origin of audiovisual content.

In line with their commitment to AI safety, OpenAI has decided to preview Voice Engine but not release it widely at this time. By sharing these insights, the company aims to initiate a conversation about the future of synthetic voices and the necessary steps to harness their potential while mitigating the risks of misuse.

Here are a few reactions to OpenAI’s announcement:

Voice AI is by far the most dangerous modality.

Superhuman, persuasive voice is something we have minimal defences to.

Figuring out what to do about this should be one of our top priorities.

(We had sota models but didn’t release for this reason eg https://t.co/vjY99uCdTl) https://t.co/fKIZrVQCml
— Emad acc/acc (@EMostaque) March 29, 2024

If you haven't disabled voice authentication for your bank account and had a conversation with your family about AI voice impersonation yet, now would be a good time. https://t.co/TkpdGUfr76
— Noam Brown (@polynoamial) March 29, 2024

OpenAI has had wild speech tech for a while now.

We're still unsure whether/how we want to make them widely available ourselves (which ofc raises a bunch of issues), but it's just a matter of time before someone does, and more should be done to prepare: https://t.co/8F2jTqbrLO
— Miles Brundage (@Miles_Brundage) March 29, 2024

Featured Image via Pixabay

Disclaimer

The views and opinions expressed by the author, or any people mentioned in this article, are for informational purposes only, and they do not constitute financial, investment, or other advice. Investing in or trading cryptoassets comes with a risk of financial loss.

ChatGPT Creator OpenAI’s Voice Cloning Technology Is So Good That Even They Find It Too Scary for Public Release

Disclaimer

Related Articles

Dogecoin Poised for Major Breakout, Predicts Analyst Who Called Bitcoin’s 2021 Crash

Bitcoin Accumulation Addresses Now Hold $194 Billion in BTC as Long-Term Investors Stack Sats at Unprecedented Pace

$473 Billion Debt Increase in Just Three Weeks Pushes U.S. National Debt to Unprecedented Levels

You Might Like

Most Read

$473 Billion Debt Increase in Just Three Weeks Pushes U.S. National Debt to Unprecedented Levels

VanEck Solana ETN Now Offers Staking Rewards for Investors Across the European Union

Bitcoin’s Rising Value Enriches Early Adopters at the Expense of Everyone Else: ECB Paper

VanEck’s $ETH 2030 Price Target Could Drop by Two-Thirds if Ethereum’s Economic Model Does Not Change

Record Crypto Activity in 2024, Led by Solana’s 100M Active Addresses: Report