​
Login / Signup
Yu Wang
ORCID
Publication Activity (10 Years)
Years Active: 2013-2024
Publications (10 Years): 48
Top Topics
Speech Recognition
Language Model
Neural Network
N Gram
Top Venues
CoRR
INTERSPEECH
ICASSP
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Hongcheng Liu
,
Pingjie Wang
,
Zhiyuan Zhu
,
Yanfeng Wang
,
Yu Wang
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation.
LREC/COLING
(2024)
Yusheng Liao
,
Shuyang Jiang
,
Yanfeng Wang
,
Yu Wang
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation.
CoRR
(2024)
Yusheng Liao
,
Yanfeng Wang
,
Yu Wang
Leveraging Diverse Modeling Contexts With Collaborating Learning for Neural Machine Translation.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Yusheng Liao
,
Yutong Meng
,
Yuhao Wang
,
Hongcheng Liu
,
Yanfeng Wang
,
Yu Wang
Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator.
CoRR
(2024)
Hongcheng Liu
,
Pingjie Wang
,
Yu Wang
,
Yanfeng Wang
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation.
CoRR
(2024)
Hongcheng Liu
,
Zhe Chen
,
Hui Li
,
Pingjie Wang
,
Yanfeng Wang
,
Yu Wang
MSG-BART: Multi-Granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-Grounded Dialogue Generation.
ICASSP
(2024)
Yusheng Liao
,
Yanfeng Wang
,
Yu Wang
Leveraging Diverse Modeling Contexts with Collaborating Learning for Neural Machine Translation.
CoRR
(2024)
Shuyang Jiang
,
Yusheng Liao
,
Ya Zhang
,
Yu Wang
,
Yanfeng Wang
TAIA: Large Language Models are Out-of-Distribution Data Learners.
CoRR
(2024)
Heyang Liu
,
Yu Wang
,
Yanfeng Wang
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview.
CoRR
(2024)
Zhe Chen
,
Hongcheng Liu
,
Yu Wang
DialogMCF: Multimodal Context Flow for Audio Visual Scene-Aware Dialog.
IEEE ACM Trans. Audio Speech Lang. Process.
32 (2024)
Zhe Chen
,
Heyang Liu
,
Wenyi Yu
,
Guangzhi Sun
,
Hongcheng Liu
,
Ji Wu
,
Chao Zhang
,
Yu Wang
,
Yanfeng Wang
AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset.
CoRR
(2024)
Yuchen Yang
,
Yu Wang
,
Yanfeng Wang
SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval.
ACL (Findings)
(2024)
Yusheng Liao
,
Shuyang Jiang
,
Yu Wang
,
Yanfeng Wang
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts.
CoRR
(2024)
Jinxiang Liu
,
Chen Ju
,
Chaofan Ma
,
Yanfeng Wang
,
Yu Wang
,
Ya Zhang
Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation.
CoRR
(2023)
Chaoqin Huang
,
Qinwei Xu
,
Yanfeng Wang
,
Yu Wang
,
Ya Zhang
Self-Supervised Masking for Unsupervised Anomaly Detection and Localization.
IEEE Trans. Multim.
25 (2023)
Zhisheng Zheng
,
Ziyang Ma
,
Yu Wang
,
Xie Chen
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition.
INTERSPEECH
(2023)
Yusheng Liao
,
Yutong Meng
,
Hongcheng Liu
,
Yanfeng Wang
,
Yu Wang
An Automatic Evaluation Framework for Multi-turn Medical Consultations Capabilities of Large Language Models.
CoRR
(2023)
Ziyang Ma
,
Zhisheng Zheng
,
Guanrou Yang
,
Yu Wang
,
Chao Zhang
,
Xie Chen
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation.
INTERSPEECH
(2023)
Zhisheng Zheng
,
Ziyang Ma
,
Yu Wang
,
Xie Chen
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition.
CoRR
(2023)
Ziyang Ma
,
Zhisheng Zheng
,
Guanrou Yang
,
Yu Wang
,
Chao Zhang
,
Xie Chen
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech Representation.
CoRR
(2023)
Zhiyuan Zhu
,
Yusheng Liao
,
Yu Wang
,
Yunfeng Guan
Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System.
INTERSPEECH
(2023)
Zihan Zhao
,
Yiyang Jiang
,
Heyang Liu
,
Yanfeng Wang
,
Yu Wang
LibriSQA: Advancing Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework.
CoRR
(2023)
Jinxiang Liu
,
Yu Wang
,
Chen Ju
,
Chaofan Ma
,
Ya Zhang
,
Weidi Xie
Annotation-free Audio-Visual Segmentation.
CoRR
(2023)
Chenyu Yang
,
Mengxi Chen
,
Yanfeng Wang
,
Yu Wang
Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings.
ACM Multimedia
(2023)
Yiting Lu
,
Yu Wang
,
Mark J. F. Gales
Efficient Use of End-to-End Data in Spoken Language Processing.
ICASSP
(2021)
Yiting Lu
,
Mark J. F. Gales
,
Yu Wang
Spoken Language 'Grammatical Error Correction'.
INTERSPEECH
(2020)
Kate M. Knill
,
Linlin Wang
,
Yu Wang
,
Xixin Wu
,
Mark J. F. Gales
Non-Native Children's Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems.
INTERSPEECH
(2020)
Jeremy Heng Meng Wong
,
Mark J. F. Gales
,
Yu Wang
Learning Between Different Teacher and Student Models in ASR.
ASRU
(2019)
Xie Chen
,
Xunying Liu
,
Yu Wang
,
Anton Ragni
,
Jeremy Heng Meng Wong
,
Mark J. F. Gales
Exploiting Future Word Contexts in Neural Network Language Models for Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (9) (2019)
Yiting Lu
,
Mark J. F. Gales
,
Kate M. Knill
,
P. P. Manakul
,
Linlin Wang
,
Yu Wang
Impact of ASR Performance on Spoken Grammatical Error Detection.
INTERSPEECH
(2019)
Yiting Lu
,
Mark J. F. Gales
,
Katherine Knill
,
Potsawee Manakul
,
Yu Wang
Disfluency Detection for Spoken Learner English.
SLaTE
(2019)
Linlin Wang
,
Yu Wang
,
Mark J. F. Gales
Non-native Speaker Verification for Spoken Language Assessment.
CoRR
(2019)
Dushyant Sharma
,
Aidan O. T. Hogg
,
Yu Wang
,
Amr H. Nour-Eldin
,
Patrick A. Naylor
Non-Intrusive POLQA Estimation of Speech Quality using Recurrent Neural Networks.
EUSIPCO
(2019)
Jeremy Heng Meng Wong
,
Mark John Francis Gales
,
Yu Wang
General Sequence Teacher-Student Learning.
IEEE ACM Trans. Audio Speech Lang. Process.
27 (11) (2019)
Yu Wang
,
Chao Zhang
,
Mark J. F. Gales
,
Philip C. Woodland
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems.
INTERSPEECH
(2018)
Anton Ragni
,
Qiujia Li
,
Mark J. F. Gales
,
Yu Wang
Confidence Estimation and Deletion Prediction Using Bidirectional Recurrent Neural Networks.
CoRR
(2018)
Yu Wang
,
Xie Chen
,
M. J. F. Gales
,
Anton Ragni
,
Jeremy Heng Meng Wong
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription.
ICASSP
(2018)
Kate Knill
,
Mark J. F. Gales
,
Konstantinos Kyriakopoulos
,
Andrey Malinin
,
Anton Ragni
,
Yu Wang
,
Andrew Caines
Impact of ASR Performance on Free Speaking Language Assessment.
INTERSPEECH
(2018)
Yu Wang
,
Xie Chen
,
Mark J. F. Gales
,
Anton Ragni
,
Jeremy Heng Meng Wong
Phonetic and Graphemic Systems for Multi-Genre Broadcast Transcription.
CoRR
(2018)
Yu Wang
,
Mike Brookes
Model-Based Speech Enhancement in the Modulation Domain.
IEEE ACM Trans. Audio Speech Lang. Process.
26 (3) (2018)
Yu Wang
,
M. J. F. Gales
,
Kate M. Knill
,
Konstantinos Kyriakopoulos
,
Andrey Malinin
,
Rogier C. van Dalen
,
M. Rashid
Towards automatic assessment of spontaneous spoken English.
Speech Commun.
104 (2018)
Andrey Malinin
,
Kate Knill
,
Anton Ragni
,
Yu Wang
,
Mark J. F. Gales
An attention based model for off-topic spontaneous spoken response detection: An Initial Study.
SLaTE
(2017)
Yu Wang
,
Mike Brookes
Model-Based Speech Enhancement in the Modulation Domain.
CoRR
(2017)
Kate M. Knill
,
Mark J. F. Gales
,
Konstantinos Kyriakopoulos
,
Anton Ragni
,
Yu Wang
Use of Graphemic Lexicons for Spoken Language Assessment.
INTERSPEECH
(2017)
Xie Chen
,
Xunying Liu
,
Anton Ragni
,
Yu Wang
,
Mark J. F. Gales
Future Word Contexts in Neural Network Language Models.
CoRR
(2017)
Andrey Malinin
,
Rogier C. van Dalen
,
Kate Knill
,
Yu Wang
,
Mark J. F. Gales
Off-topic Response Detection for Spontaneous Spoken English Assessment.
ACL (1)
(2016)
Dushyant Sharma
,
Yu Wang
,
Patrick A. Naylor
,
Mike Brookes
A data-driven non-intrusive measure of speech quality and intelligibility.
Speech Commun.
80 (2016)
Yu Wang
,
Mike Brookes
Speech enhancement using an MMSE spectral amplitude estimator based on a modulation domain Kalman filter with a Gamma prior.
ICASSP
(2016)
Yu Wang
,
Mike Brookes
Speech enhancement usinga modulation domain Kalman filter post-processor with a Gaussian Mixture noise model.
ICASSP
(2014)
Yu Wang
,
Mike Brookes
A subspace method for speech enhancement in the modulation domain.
EUSIPCO
(2013)
Yu Wang
,
Mike Brookes
Speech enhancement using a robust Kalman filter post-processor in the modulation domain.
ICASSP
(2013)