Yu Wang

Publication Activity (10 Years)

Years Active: 2013-2024
Publications (10 Years): 48

Top Topics

Speech Recognition

Top Venues

IEEE ACM Trans. Audio Speech Lang. Process.

Publications

Hongcheng Liu, Pingjie Wang, Zhiyuan Zhu, Yanfeng Wang, Yu Wang
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation. LREC/COLING (2024)
Yusheng Liao, Shuyang Jiang, Yanfeng Wang, Yu Wang
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation. CoRR (2024)
Yusheng Liao, Yanfeng Wang, Yu Wang
Leveraging Diverse Modeling Contexts With Collaborating Learning for Neural Machine Translation. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Yusheng Liao, Yutong Meng, Yuhao Wang, Hongcheng Liu, Yanfeng Wang, Yu Wang
Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator. CoRR (2024)
Hongcheng Liu, Pingjie Wang, Yu Wang, Yanfeng Wang
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation. CoRR (2024)
Hongcheng Liu, Zhe Chen, Hui Li, Pingjie Wang, Yanfeng Wang, Yu Wang
MSG-BART: Multi-Granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-Grounded Dialogue Generation. ICASSP (2024)
Yusheng Liao, Yanfeng Wang, Yu Wang
Leveraging Diverse Modeling Contexts with Collaborating Learning for Neural Machine Translation. CoRR (2024)
Shuyang Jiang, Yusheng Liao, Ya Zhang, Yu Wang, Yanfeng Wang
TAIA: Large Language Models are Out-of-Distribution Data Learners. CoRR (2024)
Heyang Liu, Yu Wang, Yanfeng Wang
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview. CoRR (2024)
Zhe Chen, Hongcheng Liu, Yu Wang
DialogMCF: Multimodal Context Flow for Audio Visual Scene-Aware Dialog. IEEE ACM Trans. Audio Speech Lang. Process. 32 (2024)
Zhe Chen, Heyang Liu, Wenyi Yu, Guangzhi Sun, Hongcheng Liu, Ji Wu, Chao Zhang, Yu Wang, Yanfeng Wang
AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset. CoRR (2024)
Yuchen Yang, Yu Wang, Yanfeng Wang
SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval. ACL (Findings) (2024)
Yusheng Liao, Shuyang Jiang, Yu Wang, Yanfeng Wang
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts. CoRR (2024)
Jinxiang Liu, Chen Ju, Chaofan Ma, Yanfeng Wang, Yu Wang, Ya Zhang
Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation. CoRR (2023)
Chaoqin Huang, Qinwei Xu, Yanfeng Wang, Yu Wang, Ya Zhang
Self-Supervised Masking for Unsupervised Anomaly Detection and Localization. IEEE Trans. Multim. 25 (2023)
Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition. INTERSPEECH (2023)
Yusheng Liao, Yutong Meng, Hongcheng Liu, Yanfeng Wang, Yu Wang
An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models. CoRR (2023)
Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation. INTERSPEECH (2023)
Zhisheng Zheng, Ziyang Ma, Yu Wang, Xie Chen
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition. CoRR (2023)
Ziyang Ma, Zhisheng Zheng, Guanrou Yang, Yu Wang, Chao Zhang, Xie Chen
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation. CoRR (2023)
Zhiyuan Zhu, Yusheng Liao, Yu Wang, Yunfeng Guan
Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System. INTERSPEECH (2023)
Zihan Zhao, Yiyang Jiang, Heyang Liu, Yanfeng Wang, Yu Wang
LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework. CoRR (2023)
Jinxiang Liu, Yu Wang, Chen Ju, Chaofan Ma, Ya Zhang, Weidi Xie
Annotation-free Audio-Visual Segmentation. CoRR (2023)
Chenyu Yang, Mengxi Chen, Yanfeng Wang, Yu Wang
Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings. ACM Multimedia (2023)
Yiting Lu, Yu Wang, Mark J. F. Gales
Efficient Use of End-to-End Data in Spoken Language Processing. ICASSP (2021)
Yiting Lu, Mark J. F. Gales, Yu Wang
Spoken Language 'Grammatical Error Correction'. INTERSPEECH (2020)
Kate M. Knill, Linlin Wang, Yu Wang, Xixin Wu, Mark J. F. Gales
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems. INTERSPEECH (2020)
Jeremy Heng Meng Wong, Mark J. F. Gales, Yu Wang
Learning Between Different Teacher and Student Models in ASR. ASRU (2019)
Xie Chen, Xunying Liu, Yu Wang, Anton Ragni, Jeremy Heng Meng Wong, Mark J. F. Gales
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27 (9) (2019)
Yiting Lu, Mark J. F. Gales, Kate M. Knill, P. P. Manakul, Linlin Wang, Yu Wang
Impact of ASR Performance on Spoken Grammatical Error Detection. INTERSPEECH (2019)
Yiting Lu, Mark J. F. Gales, Katherine Knill, Potsawee Manakul, Yu Wang
Disfluency Detection for Spoken Learner English. SLaTE (2019)
Linlin Wang, Yu Wang, Mark J. F. Gales
Non-native Speaker Verification for Spoken Language Assessment. CoRR (2019)
Dushyant Sharma, Aidan O. T. Hogg, Yu Wang, Amr H. Nour-Eldin, Patrick A. Naylor
Non-Intrusive POLQA Estimation of Speech Quality using Recurrent Neural Networks. EUSIPCO (2019)
Jeremy Heng Meng Wong, Mark John Francis Gales, Yu Wang
General Sequence Teacher-Student Learning. IEEE ACM Trans. Audio Speech Lang. Process. 27 (11) (2019)
Yu Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems. INTERSPEECH (2018)
Anton Ragni, Qiujia Li, Mark J. F. Gales, Yu Wang
Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks. CoRR (2018)
Yu Wang, Xie Chen, M. J. F. Gales, Anton Ragni, Jeremy Heng Meng Wong
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription. ICASSP (2018)
Kate Knill, Mark J. F. Gales, Konstantinos Kyriakopoulos, Andrey Malinin, Anton Ragni, Yu Wang, Andrew Caines
Impact of ASR Performance on Free Speaking Language Assessment. INTERSPEECH (2018)
Yu Wang, Xie Chen, Mark J. F. Gales, Anton Ragni, Jeremy Heng Meng Wong
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription. CoRR (2018)
Yu Wang, Mike Brookes
Model-Based Speech Enhancement in the Modulation Domain. IEEE ACM Trans. Audio Speech Lang. Process. 26 (3) (2018)
Yu Wang, M. J. F. Gales, Kate M. Knill, Konstantinos Kyriakopoulos, Andrey Malinin, Rogier C. van Dalen, M. Rashid
Towards automatic assessment of spontaneous spoken English. Speech Commun. 104 (2018)
Andrey Malinin, Kate Knill, Anton Ragni, Yu Wang, Mark J. F. Gales
An attention based model for off-topic spontaneous spoken response detection: An Initial Study. SLaTE (2017)
Yu Wang, Mike Brookes
Model-Based Speech Enhancement in the Modulation Domain. CoRR (2017)
Kate M. Knill, Mark J. F. Gales, Konstantinos Kyriakopoulos, Anton Ragni, Yu Wang
Use of Graphemic Lexicons for Spoken Language Assessment. INTERSPEECH (2017)
Xie Chen, Xunying Liu, Anton Ragni, Yu Wang, Mark J. F. Gales
Future Word Contexts in Neural Network Language Models. CoRR (2017)
Andrey Malinin, Rogier C. van Dalen, Kate Knill, Yu Wang, Mark J. F. Gales
Off-topic Response Detection for Spontaneous Spoken English Assessment. ACL (1) (2016)
Dushyant Sharma, Yu Wang, Patrick A. Naylor, Mike Brookes
A data-driven non-intrusive measure of speech quality and intelligibility. Speech Commun. 80 (2016)
Yu Wang, Mike Brookes
Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior. ICASSP (2016)
Yu Wang, Mike Brookes
Speech enhancement usinga modulation domain Kalman filter post-processor with a Gaussian Mixture noise model. ICASSP (2014)
Yu Wang, Mike Brookes
A subspace method for speech enhancement in the modulation domain. EUSIPCO (2013)
Yu Wang, Mike Brookes
Speech enhancement using a robust Kalman filter post-processor in the modulation domain. ICASSP (2013)