Login / Signup
A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.
Naoyuki Kanda
Xiong Xiao
Jian Wu
Tianyan Zhou
Yashesh Gaur
Xiaofei Wang
Zhong Meng
Zhuo Chen
Takuya Yoshioka
Published in:
CoRR (2021)
Keyphrases
</>
speech recognition
automatic speech recognition
multimedia
audio visual
information retrieval
decision trees
speaker identification