• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio.

Naoyuki KandaXiong XiaoJian WuTianyan ZhouYashesh GaurXiaofei WangZhong MengZhuo ChenTakuya Yoshioka
Published in: ASRU (2021)
Keyphrases
  • audio visual
  • automatic speech recognition
  • multimedia
  • visual information
  • strengths and weaknesses
  • data mining
  • machine learning
  • bayesian networks
  • machine learning algorithms
  • speech recognition