GMN: Generative Multi-modal Network for Practical Document Information Extraction.
Haoyu CaoJiefeng MaAntai GuoYiqing HuHao LiuDeqiang JiangYinsong LiuBo RenPublished in: CoRR (2022)
Keyphrases
- multi modal
- information extraction
- web documents
- multi modality
- audio visual
- information retrieval
- generative model
- cross modal
- image annotation
- text documents
- document collections
- high dimensional
- text mining
- information retrieval systems
- uni modal
- fusing multiple
- x ray
- medical images
- natural language processing
- probabilistic model
- natural language
- video sequences
- keywords
- image processing