Login / Signup

State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention with Dilated 1D Convolutions.

Kyu J. HanRamon PrietoTao Ma
Published in: ASRU (2019)
Keyphrases
  • multi stream
  • audio visual
  • audio visual speech recognition
  • hidden markov models
  • multi modal
  • grey level
  • emotion recognition
  • multimedia
  • image retrieval
  • image features
  • speech recognition
  • visual information