ITA: Image-Text Alignments for Multi-Modal Named Entity Recognition.
Xinyu WangMin GuiYong JiangZixia JiaNguyen BachTao WangZhongqiang HuangFei HuangKewei TuPublished in: CoRR (2021)
Keyphrases
- multi modal
- named entity recognition
- text summarization
- information extraction
- multiple modalities
- image content
- named entities
- input image
- image classification
- uni modal
- video search
- natural language processing
- image representation
- multi modality
- segmentation method
- conditional random fields
- relation extraction
- single modality
- image collections
- maximum entropy
- text mining
- semi supervised
- image retrieval
- named entity recognizer
- information retrieval
- low level
- semantic concepts
- question answering
- segmentation algorithm
- object recognition