Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction.
Meishan ZhangHao FeiBin WangShengqiong WuYixin CaoFei LiMin ZhangPublished in: CoRR (2024)
Keyphrases
- information extraction
- multimodal fusion
- multiple modalities
- multi modal
- cross modal
- single modality
- multimodal biometrics
- precision and recall
- text mining
- free text
- natural language processing
- named entities
- structured data
- semi structured
- information retrieval
- question answering
- textual data
- named entity recognition
- web mining
- machine translation
- relation extraction
- text documents
- multimodal information
- machine learning
- conditional random fields
- multimodal interfaces
- multimodal data
- text processing
- relational learning
- open domain
- multimodal interaction
- semantic tagging
- natural language text
- brain image analysis
- databases
- audio visual
- visual data
- web documents
- multimedia
- automatic recognition
- co occurrence
- ontology based information extraction
- data sets
- neural network
- search engine
- natural language
- information extraction systems
- knowledge representation
- graphical models
- wordnet
- high robustness
- multimedia data
- text summarization