|
Yuxuan Jiang (江宇轩)
I am a first-year PhD student at
Tsinghua University.
My research interest includes generative models such as Generative Models, Audio Generation and Multimodal Learning.
Email  / 
Google Scholar
|
|
|
ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Generation via Progressive Diffusion Modeling
Yuxuan Jiang,
Zehua Chen,
Zeqian Ju,
Yusheng Dai,
Weibei Dou,
Jun Zhu†
ACL 2026 Main (Oral)
Paper / 
Website / 
|
|
|
Omni2Sound: Towards Unified Video-Text-to-Audio Generation
Yusheng Dai,
Zehua Chen,
Yuxuan Jiang,
Hongke Qiu,
Jian Fei,
Jun Zhu†
CVPR 2026 (Highlight)
Paper / 
Website / 
|
|
|
FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation
Yuxuan Jiang,
Zehua Chen,
Zeqian Ju,
Chang Li,
Weibei Dou,
Jun Zhu†
ACM MM 2025 (Oral)
Paper / 
Website / 
|
website template
|