GMN: Generative Multi-modal Network for Practical Document Information Extraction.
Haoyu CaoJiefeng MaAntai GuoYiqing HuHao LiuDeqiang JiangYinsong LiuBo RenPublished in: NAACL-HLT (2022)
Keyphrases
- multi modal
- information extraction
- information retrieval
- web documents
- text documents
- multi modality
- audio visual
- text mining
- text summarization
- image annotation
- cross modal
- machine learning
- document collections
- high dimensional
- natural language processing
- information retrieval systems
- probabilistic model
- generative model
- retrieval systems
- feature extraction
- semantic concepts
- video search