-
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Paper • 2511.17282 • Published • 14 -
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Paper • 2512.01686 • Published • 1 -
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Paper • 2402.17467 • Published • 2 -
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Paper • 2404.18081 • Published • 2
Collections
Discover the best community collections!
Collections including paper arxiv:2312.08723
-
Musical Form Generation
Paper • 2310.19842 • Published • 1 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 49 -
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 27 -
SongCreator: Lyrics-based Universal Song Generation
Paper • 2409.06029 • Published • 22
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33
-
Where Culture Fades: Revealing the Cultural Gap in Text-to-Image Generation
Paper • 2511.17282 • Published • 14 -
DreamingComics: A Story Visualization Pipeline via Subject and Layout Customized Generation using Video Models
Paper • 2512.01686 • Published • 1 -
Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval: a Survey
Paper • 2402.17467 • Published • 2 -
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Paper • 2404.18081 • Published • 2
-
Musical Form Generation
Paper • 2310.19842 • Published • 1 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 49 -
Long-form music generation with latent diffusion
Paper • 2404.10301 • Published • 27 -
SongCreator: Lyrics-based Universal Song Generation
Paper • 2409.06029 • Published • 22
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33