Gaining Extra Supervision via Multi-task learning for Multi-Modal Video Question Answering.
Junyeong KimMinuk MaKyungsu KimSungjin KimChang D. YooPublished in: IJCNN (2019)
Keyphrases
- multi modal
- question answering
- multi task learning
- multi task
- video search
- semantic concepts
- learning tasks
- information retrieval
- gaussian processes
- information extraction
- natural language
- video data
- high order
- natural language processing
- video sequences
- transfer learning
- multimedia
- learning algorithm
- video content
- video frames
- image annotation
- video retrieval
- key frames
- audio visual
- high dimensional
- active learning
- metric learning
- machine learning