AudioX-Turbo: A Unified Framework for Efficient Anything-to-Audio...
Audio and music generation based on flexible multimodal control signals is a widely applicable topic, with the following key challenges: 1) a unified multimodal modeling framework, 2) large-scale,...
arxiv.org