Donut: Document Understanding Transformer without OCR.
Geewook KimTeakgyu HongMoonbin YimJinyoung ParkJinyeong YimWonseok HwangSangdoo YunDongyoon HanSeunghyun ParkPublished in: CoRR (2021)
Keyphrases
- document understanding
- designing effective
- automatic summarization
- optical character recognition
- document clustering
- automatic text summarization
- multi document summarization
- document images
- document summarization
- clustering algorithm
- text mining
- lexical chains
- graphical models
- clustering method
- language independent