Login / Signup
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
Published in:
ASRU (2021)
Keyphrases
</>
audio visual
automatic speech recognition
multimedia
visual information
strengths and weaknesses
data mining
machine learning
bayesian networks
machine learning algorithms
speech recognition