Recurrent Neural Network Transducer for Audio-Visual Speech Recognition.
Takaki MakinoHank LiaoYannis M. AssaelBrendan ShillingfordBasilio GarciaOtavio BragaOlivier SiohanPublished in: CoRR (2019)
Keyphrases
- recurrent neural networks
- audio visual speech recognition
- multi stream
- audio visual
- neural network
- feed forward
- complex valued
- reservoir computing
- hidden layer
- recurrent networks
- artificial neural networks
- neural model
- echo state networks
- speech recognition
- multi modal
- noisy environments
- image retrieval
- neural network structure
- pattern recognition
- video sequences