Hierarchical & multimodal video captioning: Discovering and transferring multimodal knowledge for vision to language.
An-An LiuNing XuYongkang WongJunnan LiYuting SuMohan S. KankanhalliPublished in: Comput. Vis. Image Underst. (2017)
Keyphrases
- multimedia
- multi modal
- multimodal interfaces
- video sequences
- real time
- multimodal interaction
- knowledge acquisition
- language learning
- video data
- multimodal information
- audio visual
- knowledge sharing
- computer vision
- programming language
- domain knowledge
- knowledge base
- natural language
- prior knowledge
- knowledge transfer
- knowledge management
- story segmentation
- image sequences
- transfer learning
- space time
- vision system
- moving objects
- conceptual model
- video frames
- concept hierarchy
- conceptual graphs
- multiple modalities
- human computer interaction