• ThePrompt
  • Posts
  • VALL-E by Microsoft breaks text-to-speech barriers ๐Ÿ—ฃ

VALL-E by Microsoft breaks text-to-speech barriers ๐Ÿ—ฃ

With VALL-E  you can create your own personalized audio files using just a 3-second recording of your voice and a written text prompt. 

Its new architecture outperforms older models because it doesn't need a lot of training data to make the speech sound natural.

Plus, it also preserves the same emotions and background sounds.

That's crazy.

3-second audio to whatever you want to do with your voice. ๐Ÿช„

The use cases are endless:

  • Read text messages from a sender in their own voice;

  • Create personalized audiobooks with the author's voice

They have a few examples that you can try on their project page, but unfortunately, their code is not open-sourced yet.

๐Ÿ“š Educational corner

๐Ÿ›ผ Makers corner

๐Ÿค“ Tech corner

  • HyperReel: create a video that you can move around and look at from different angles

  • RoDynRF - rendering 3D scenes in real-time from videos

  • PACO: Detecting parts + attributes of common objects in a photo

๐ŸŽจ Creativity corner

๐Ÿงฑ What are you building?

If you're using AI in your work or projects, I'd love to hear about it! Please reply to this email and let us know what you're up to. We may feature your work in our newsletter.

โค๏ธ If you like The Prompt, and want to support my work:

  • Share The Prompt with a friend, and invite them to subscribe here.

  • Book an ad in The Prompt ( reply to this email if youโ€™re interested)

Thank you for reading! โœŒ๐Ÿผ