Sign in

4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders.

Yui SudoMuhammad ShakeelBrian YanJiatong ShiShinji Watanabe
Published in: CoRR (2022)
Keyphrases
  • search engine
  • mobile robot
  • visual attention
  • data mining
  • decision making
  • multiresolution
  • multi class
  • speech recognition
  • automatic speech recognition
  • modeling method