mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding.
Jiabo YeAnwen HuHaiyang XuQinghao YeMing YanYuhao DanChenlin ZhaoGuohai XuChenliang LiJunfeng TianQian QiJi ZhangFei HuangPublished in: CoRR (2023)
Keyphrases
- language model
- document understanding
- designing effective
- automatic summarization
- document clustering
- automatic text summarization
- language modeling
- multi document summarization
- n gram
- probabilistic model
- document retrieval
- retrieval model
- information retrieval
- test collection
- query expansion
- language independent
- vector space model
- text mining
- relevance model
- document representation
- cross lingual
- text summarization
- pseudo relevance feedback
- query terms
- bag of words
- translation model
- document ranking
- text categorization
- clustering algorithm
- web search
- feature selection