Dialogue Situation Recognition in Everyday Conversation From Audio, Visual, and Linguistic Information.
Yuya ChibaRyuichiro HigashinakaPublished in: IEEE Access (2023)
Keyphrases
- audio visual
- linguistic information
- multi modal
- natural language
- visual information
- multimedia
- object recognition
- semantic information
- linguistic features
- structural information
- visual data
- domain knowledge
- part of speech
- action recognition
- dialogue system
- search engine
- visual features
- knn
- probabilistic model
- computer vision