- ThePrompt
- Posts
- VALL-E by Microsoft breaks text-to-speech barriers ๐ฃ
VALL-E by Microsoft breaks text-to-speech barriers ๐ฃ
With VALL-E you can create your own personalized audio files using just a 3-second recording of your voice and a written text prompt.
Its new architecture outperforms older models because it doesn't need a lot of training data to make the speech sound natural.
Plus, it also preserves the same emotions and background sounds.
That's crazy.
3-second audio to whatever you want to do with your voice. ๐ช
The use cases are endless:
Read text messages from a sender in their own voice;
Create personalized audiobooks with the author's voice
They have a few examples that you can try on their project page, but unfortunately, their code is not open-sourced yet.
๐ Educational corner
๐ผ Makers corner
arXiv Xplorer - semantic search engine for arXiv papers
๐ค Tech corner
HyperReel: create a video that you can move around and look at from different angles
RoDynRF - rendering 3D scenes in real-time from videos
PACO: Detecting parts + attributes of common objects in a photo
๐จ Creativity corner
๐งฑ What are you building?
If you're using AI in your work or projects, I'd love to hear about it! Please reply to this email and let us know what you're up to. We may feature your work in our newsletter.
โค๏ธ If you like The Prompt, and want to support my work:
Share The Prompt with a friend, and invite them to subscribe here.
Book an ad in The Prompt ( reply to this email if youโre interested)
Thank you for reading! โ๐ผ