Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems.
Hung LeDoyen SahooNancy F. ChenSteven C. H. HoiPublished in: CoRR (2019)
Keyphrases
- end to end
- dialogue system
- high bandwidth
- internet protocol
- scalable video
- wireless ad hoc networks
- multimedia
- video sequences
- differentiated services
- video streams
- video content
- admission control
- congestion control
- spoken dialogue systems
- video data
- natural language
- real time
- human users
- tutorial dialogue
- ad hoc networks
- video frames
- content delivery
- multiple description coding