Wukong-CMNER: A Large-Scale Chinese Multimodal NER Dataset with Images Modality.
Xigang BaoShouhui WangPengnian QiBiao QinPublished in: DASFAA (3) (2023)
Keyphrases
- image dataset
- image data
- image database
- multi modal
- ground truth
- input image
- million images
- text summarization
- edge detection
- three dimensional
- multiple modalities
- information extraction
- image set
- image matching
- feature points
- test images
- image registration
- image collections
- nus wide
- image regions
- image features
- conditional random fields
- street view
- similarity measure
- image analysis
- named entity recognition
- image annotation
- natural language processing
- image classification
- automatic annotation
- photo collections
- image retrieval
- multiscale
- segmentation method
- named entities
- multimodal image registration
- image segmentation