Multimodal Audio-textual Architecture for Robust Spoken Language Understanding.
Anderson R. AvilaMehdi RezagholizadehChao XingPublished in: CoRR (2023)
Keyphrases
- language understanding
- natural language
- natural language understanding
- multimedia
- dialogue management
- language processing
- dialogue system
- semantic interpretation
- audio visual
- general knowledge
- cognitive psychology
- spoken dialogue systems
- knowledge representation
- keywords
- semantic analysis
- visual information
- multi agent