arxiv:2510.10396
Yu Zhang
AaronZ345
AI & ML interests
Multi-Modal Generative AI (Spatial Audio/Music/Singing/Speech).
Recent Activity
new activity about 11 hours ago
GTSinger/GTSinger:Annotation quality is very low, not usable for training authored a paper 6 months ago
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with
Refined Annotations authored a paper 7 months ago
ASAudio: A Survey of Advanced Spatial Audio Research