AI Called VALL-E Needs 3 Seconds To Imitate Anyone's Voice

Microsoft showed AI that can imitate any human voice. It’s called VALL-E, just like the previous DALL-E algorithm. If you know, the latter creates an image based on a text.

VALL-E can imitate the timbre and manner of speech by listening to a real person’s voice in just three seconds. Although the sound sounds a little like the voice of a robot, the result is still impressive.

Microsoft called it a “neural codec language model.” VALL-E was built on the basis of EnCodec (an audio codec using machine learning techniques), developed by Meta a year ago, in 2022.

Join GizChina on Telegram

VALL-E imitates anyone’s voice

Other text-to-speech methods take into account waveforms. But VALL-E generates separate audio codecs from text and audio. In effect, it analyzes how a person sounds. Then, it breaks that info down into separate parts (called “tokens”) via EnCodec. And in the end, it uses training data to match what it “knows” about how that voice would sound if it spoke other phrases outside of the three-second sample.

VALL-E was taught using a special library. The latter contains 60,000 hours of English speech from more than 7,000 people. The developers suggest that the method could be used for high-quality text-to-speech applications. For instance, you can use it for editing speech recordings where human words are allowed to be changed. As a result, you can create audio content (such as voiceovers for audiobooks), and more.

Of course, such a tech can also carry a certain danger. Sooner or later, “one-eyed” users will make it a blackmail tool. Say, they can use AI to prove that famous people have said something that they didn’t. There have already been such cases with deepfakes in video format.

We guess you have watched the video featuring Elon Musk, who promises huge returns to invest in a dodgy cryptocurrency.

Disclaimer: We may be compensated by some of the companies whose products we talk about, but our articles and reviews are always our honest opinions. For more details, you can check out our editorial guidelines and learn about how we use affiliate links.

Source/VIA :

VALL-E

Official Galaxy S25 Wallpapers Now Available for Download

Samsung Reveals the Ultra-Thin Galaxy S25 Edge

Samsung Galaxy S25 series India pricing revealed

Goodbye Bixby: Gemini is the new personal assistant on the Galaxy S25

Scykei, the US Brand Designed for Z Generation, Will Make Its Debut at CES 2025

OnePlus Watch 3 Pro to Launch in 2025 alongside the Watch 3

Essential Tips Before Purchasing Your First Smart Ring

Apple Watch Series 10: Bigger Screen, Thinner Design, More Power

AGM PAD T2 Review: A Tablet for Every Outdoor Adventure and More

Honor MagicPad 2 Review: A Stunning Display with Unmatched VFM!

AGM Pad P2 Active Review: Robust Tablet in a Practical Case

Redmi Pad SE 8.7 Leaked Ahead of Launch

NOVOO 100W USB C Charger Review: Compact Power with GaN III Technology

Honor Magic 7 Lite: A “Budget Flagship” That Redefines Value

Honor Magic7 Pro Review: A Robust Flagship Packed with Innovation and AI

What Makes vivo X200 Pro the Ultimate Flagship?

Microsoft’s AI Called VALL-E Needs 3 Seconds To Imitate Anyone’s Voice

VALL-E imitates anyone’s voice

Previous Galaxy S23 Ultra Seems to Be a Clear Winner Against iPhone 15

Next Samsung Galaxy Z Fold 5 Will Have Extra Features

Argam Artashyan

Apple Intelligence: Tim Cook Puts the Fee Rumors to Rest

Amazon’s Shift: The End of Free Alexa Services

Google May Have to Sale Android, AdWords and Chrome

Samsung’s Galaxy AI Features to Reach 200 Million Devices This Year

Snapdragon 8 Elite for Galaxy: The Fastest Mobile Chip with Satellite Connectivity

Official Galaxy S25 Wallpapers Now Available for Download

Samsung Reveals the Ultra-Thin Galaxy S25 Edge

Samsung Galaxy S25 series India pricing revealed

MENU

VALL-E imitates anyone’s voice

Previous Galaxy S23 Ultra Seems to Be a Clear Winner Against iPhone 15

Next Samsung Galaxy Z Fold 5 Will Have Extra Features

Argam Artashyan

Related Posts

MENU