DiffRhythm generates full-length songs within seconds

Community-Team · March 10, 2025, 9:09pm

Subscribe for free access to Data Points!

Chinese researchers developed DiffRhythm, a diffusion-based model capable of generating complete songs up to 4 minutes 45 seconds long, including both vocals and accompaniment. The model uses a variational autoencoder to compress audio into latent representations, which are then generated by a diffusion transformer conditioned on lyrics and style prompts. DiffRhythm can produce high-quality 4-minute songs in just 10 seconds, significantly faster than previous language model approaches. The researchers released their model and code under a noncommercial license.
GitHub and arXiv

rmwkwok · March 11, 2025, 12:57pm

The Github and arXiv links are broken.

Community-Team · March 11, 2025, 1:03pm

Thank you. Edited the links.

Topic		Replies	Views
New AI model generates full five-minute songs from lyrics AI Discussions ai-discussions , data-points	3	301	February 5, 2025
Ai Music Generator AI Discussions ai-discussions	0	68	January 14, 2025
[The Batch] Stability AI released Stable Audio Open AI Discussions the-batch	1	102	June 15, 2024
AI and Music AI Discussions ai-discussions	0	19	September 6, 2024
He Who Types the Prompt Calls the Tune: Google introduces an AI that generates music from text AI Discussions the-batch , ai-discussions	1	82	May 20, 2023

DiffRhythm generates full-length songs within seconds

Related topics