NVIDIA's new AI model Fugatto can create audio from text prompts
SMRTR summary
NVIDIA introduced Fugatto, a generative AI model for sound creation and manipulation. This experimental tool generates or modifies audio, music, and voices using text prompts. Its potential applications span music production, language learning, and video game sound design. Fugatto showcases multi-accent and multilingual capabilities, combining sound elements and creating evolving audio scenes. While NVIDIA hasn't announced public access, similar text-to-sound AI tools are available from companies like Meta and Google.
SMRTR provides this summary for quick context. The original article belongs to Engadget.
Read the original article