Folgen
Yi Yuan
Yi Yuan
University of Surrey, Centre for Vision, Speech, and Signal processing (CVSSP)
Bestätigte E-Mail-Adresse bei surrey.ac.uk
Titel
Zitiert von
Zitiert von
Jahr
Audioldm: Text-to-audio generation with latent diffusion models
H Liu, Z Chen, Y Yuan, X Mei, X Liu, D Mandic, W Wang, MD Plumbley
arXiv preprint arXiv:2301.12503, 2023
5292023
Audioldm 2: Learning holistic audio generation with self-supervised pretraining
H Liu, Y Yuan, X Liu, X Mei, Q Kong, Q Tian, Y Wang, W Wang, Y Wang, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
217*2024
Separate anything you describe
X Liu, Q Kong, Y Zhao, H Liu, Y Yuan, Y Liu, R Xia, Y Wang, MD Plumbley, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
382024
Retrieval-augmented text-to-audio generation
Y Yuan, H Liu, X Liu, Q Huang, MD Plumbley, W Wang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
212024
Wavjourney: Compositional audio creation with large language models
X Liu, Z Zhu, H Liu, Y Yuan, M Cui, Q Huang, J Liang, Y Cao, Q Kong, ...
arXiv preprint arXiv:2307.14335, 2023
182023
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound
H Liu, X Xu, Y Yuan, M Wu, W Wang, MD Plumbley
arXiv preprint arXiv:2405.00233, 2024
162024
Latent diffusion model based foley sound generation system for dcase challenge 2023 task 7
Y Yuan, H Liu, X Liu, X Kang, MD Plumbley, W Wang
arXiv preprint arXiv:2305.15905, 2023
122023
Mlops spanning whole machine learning life cycle: A survey
F Zhengxin, Y Yi, Z Jingyu, L Yue, M Yuechen, L Qinghua, X Xiwei, W Jeff, ...
arXiv preprint arXiv:2304.07296, 2023
122023
Leveraging pre-trained AudioLDM for sound generation: A benchmark study
Y Yuan, H Liu, J Liang, X Liu, MD Plumbley, W Wang
2023 31st European Signal Processing Conference (EUSIPCO), 765-769, 2023
102023
Text-driven foley sound generation with latent diffusion model
Y Yuan, H Liu, X Liu, X Kang, P Wu, MD Plumbley, W Wang
arXiv preprint arXiv:2306.10359, 2023
102023
Improving audio generation with visual enhanced caption
Y Yuan, D Jia, X Zhuang, Y Chen, Z Liu, Z Chen, Y Wang, Y Wang, X Liu, ...
arXiv e-prints, arXiv: 2407.04416, 2024
62024
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining
Y Yuan, Z Chen, X Liu, H Liu, X Xu, D Jia, Y Chen, MD Plumbley, W Wang
arXiv preprint arXiv:2404.17806, 2024
32024
DIFFUSION BASED SOUND SCENE SYNTHESIS FOR DCASE CHALLENGE 2024 TASK 7
Y Yuan, H Liu, X Liu, MD Plumbley, W Wang
22024
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder
J Zhao, X Liu, J Zhao, Y Yuan, Q Kong, MD Plumbley, W Wang
2024 32nd European Signal Processing Conference (EUSIPCO), 1-5, 2024
12024
PLDISET: Probabilistic localization and detection of independent sound events with transformers
P Wu, J Zhao, Y Chen, D Berghi, Y Yuan, C Zhu, Y Cao, Y Liu, ...
Detection and Classification of Acoustic Scenes and Events 2023, 2023
12023
Sound-VECaps: Improving Audio Generation With Visual Enhanced Captions
Y Yuan, D Jia, X Zhuang, Y Chen, Z Liu, Z Chen, Y Wang, Y Wang, X Liu, ...
Audio Imagination: NeurIPS 2024 Workshop AI-Driven Speech, Music, and Sound …, 2024
2024
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
Y Yuan, X Liu, H Liu, MD Plumbley, W Wang
arXiv preprint arXiv:2409.07614, 2024
2024
HFM++: An Enhanced Holographic Factorization Machine for Recommendation
Z Fang, M Qu, S Zhang, J Zhang, Y Yuan, L Yao, S Chen
Australasian Conference on Data Mining, 72-85, 2021
2021
AudioMorphix: Training-free audio editing with diffusion probabilistic models
J Liang, Y Yuan, D Jia, X Zhuang, Z Liu, Y Chen, Z Chen, Y Wang, ...
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–19