​
Login / Signup
Rao Ma
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 32
Top Topics
Speech Recognition
Natural Language Understanding
Language Model
Multiple Input
Top Venues
CoRR
ICASSP
INTERSPEECH
IEEE ACM Trans. Audio Speech Lang. Process.
</>
Publications
</>
Stefano BannĂ²
,
Rao Ma
,
Mengjie Qian
,
Kate M. Knill
,
Mark J. F. Gales
Towards End-to-End Spoken Grammatical Error Correction.
ICASSP
(2024)
Rao Ma
,
Adian Liusie
,
Mark J. F. Gales
,
Kate M. Knill
Investigating the Emergent Audio Classification Ability of ASR Foundation Models.
NAACL-HLT
(2024)
Vyas Raina
,
Rao Ma
,
Charles McGhee
,
Kate Knill
,
Mark J. F. Gales
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models.
CoRR
(2024)
Mengjie Qian
,
Siyuan Tang
,
Rao Ma
,
Kate M. Knill
,
Mark J. F. Gales
Learn and Don't Forget: Adding a New Language to ASR Foundation Models.
CoRR
(2024)
Rao Ma
,
Yassir Fathullah
,
Mengjie Qian
,
Siyuan Tang
,
Mark J. F. Gales
,
Kate M. Knill
Cross-Lingual Transfer Learning for Speech Translation.
CoRR
(2024)
Rao Ma
,
Mark J. F. Gales
,
Kate M. Knill
,
Mengjie Qian
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space.
INTERSPEECH
(2023)
Rao Ma
,
Mark J. F. Gales
,
Kate Knill
,
Mengjie Qian
N-best T5: Robust ASR Error Correction using Multiple Input Hypotheses and Constrained Decoding Space.
CoRR
(2023)
Rao Ma
,
Mengjie Qian
,
Mark J. F. Gales
,
Kate M. Knill
Adapting an Unadaptable ASR System.
CoRR
(2023)
Rao Ma
,
Mengjie Qian
,
Potsawee Manakul
,
Mark J. F. Gales
,
Kate Knill
Can Generative Large Language Models Perform ASR Error Correction?
CoRR
(2023)
Rao Ma
,
Mengjie Qian
,
Mark J. F. Gales
,
Kate M. Knill
Adapting an Unadaptable ASR System.
INTERSPEECH
(2023)
Stefano BannĂ²
,
Rao Ma
,
Mengjie Qian
,
Kate M. Knill
,
Mark J. F. Gales
Towards End-to-End Spoken Grammatical Error Correction.
CoRR
(2023)
Rao Ma
,
Mengjie Qian
,
Mark J. F. Gales
,
Kate M. Knill
Adapting an ASR Foundation Model for Spoken Language Assessment.
CoRR
(2023)
Mengjie Qian
,
Rao Ma
,
Adian Liusie
,
Erfan Loweimi
,
Kate M. Knill
,
Mark J. F. Gales
Zero-shot Audio Topic Reranking using Large Language Models.
CoRR
(2023)
Rao Ma
,
Mengjie Qian
,
Mark J. F. Gales
,
Katherine M. Knill
Adapting an ASR Foundation Model for Spoken Language Assessment.
SLaTE
(2023)
Rao Ma
,
Adian Liusie
,
Mark J. F. Gales
,
Kate M. Knill
Investigating the Emergent Audio Classification Ability of ASR Foundation Models.
CoRR
(2023)
Rao Ma
,
Xiaobo Wu
,
Jin Qiu
,
Yanan Qin
,
Haihua Xu
,
Peihao Wu
,
Zejun Ma
Internal Language Model Estimation Based Adaptive Language Model Fusion for Domain Adaptation.
ICASSP
(2023)
Rao Ma
,
Xiaobo Wu
,
Jin Qiu
,
Yanan Qin
,
Haihua Xu
,
Peihao Wu
,
Zejun Ma
Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation.
CoRR
(2022)
Yufei Liu
,
Rao Ma
,
Haihua Xu
,
Yi He
,
Zejun Ma
,
Weibin Zhang
Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR.
CoRR
(2022)
Yufei Liu
,
Rao Ma
,
Haihua Xu
,
Yi He
,
Zejun Ma
,
Weibin Zhang
Internal Language Model Estimation Through Explicit Context Vector Learning for Attention-based Encoder-decoder ASR.
INTERSPEECH
(2022)
Houjun Huang
,
Xu Xiang
,
Yexin Yang
,
Rao Ma
,
Yanmin Qian
AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge.
ICASSP
(2021)
Houjun Huang
,
Xu Xiang
,
Yexin Yang
,
Rao Ma
,
Yanmin Qian
AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge.
CoRR
(2021)
Tian Tan
,
Yizhou Lu
,
Rao Ma
,
Sen Zhu
,
Jiaqi Guo
,
Yanmin Qian
AISpeech-SJTU ASR System for the Accented English Speech Recognition Challenge.
ICASSP
(2021)
Su Zhu
,
Zijian Zhao
,
Rao Ma
,
Kai Yu
Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Su Zhu
,
Zijian Zhao
,
Rao Ma
,
Kai Yu
Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding.
CoRR
(2020)
Ruisheng Cao
,
Su Zhu
,
Chenyu Yang
,
Chen Liu
,
Rao Ma
,
Yanbin Zhao
,
Lu Chen
,
Kai Yu
Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing.
CoRR
(2020)
Kai Yu
,
Rao Ma
,
Kaiyu Shi
,
Qi Liu
Neural Network Language Model Compression With Product Quantization and Soft Binarization.
IEEE ACM Trans. Audio Speech Lang. Process.
28 (2020)
Rao Ma
,
Lesheng Jin
,
Qi Liu
,
Lu Chen
,
Kai Yu
Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings.
ICASSP
(2020)
Zihan Zhao
,
Yuncong Liu
,
Lu Chen
,
Qi Liu
,
Rao Ma
,
Kai Yu
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models.
CoRR
(2020)
Ruisheng Cao
,
Su Zhu
,
Chenyu Yang
,
Chen Liu
,
Rao Ma
,
Yanbin Zhao
,
Lu Chen
,
Kai Yu
Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing.
ACL
(2020)
Rao Ma
,
Hao Li
,
Qi Liu
,
Lu Chen
,
Kai Yu
Neural Lattice Search for Speech Recognition.
ICASSP
(2020)
Zihan Zhao
,
Yuncong Liu
,
Lu Chen
,
Qi Liu
,
Rao Ma
,
Kai Yu
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models.
NLPCC (1)
(2020)
Rao Ma
,
Qi Liu
,
Kai Yu
Highly Efficient Neural Network Language Model Compression Using Soft Binarization Training.
ASRU
(2019)