Login / Signup
VEC-MNER: Hybrid Transformer with Visual-Enhanced Cross-Modal Multi-level Interaction for Multimodal NER.
Pengfei Wei
Hongjun Ouyang
Qintai Hu
Bi Zeng
Guang Feng
Qingpeng Wen
Published in:
ICMR (2024)
Keyphrases
</>
cross modal
multi modal
named entity recognition
multimedia retrieval
information extraction
perceptual information
multimedia databases
visual similarity
image retrieval
natural language processing
human computer interaction
visual recognition
visual data
named entities
machine learning