FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction.
Chen-Yu LeeChun-Liang LiTimothy DozatVincent PerotGuolong SuNan HuaJoshua AinslieRenshen WangYasuhisa FujiiTomas PfisterPublished in: CoRR (2022)
Keyphrases
- information extraction
- web documents
- text documents
- information retrieval
- document collections
- free text
- information retrieval systems
- question answering
- document images
- structural information
- document clustering
- natural language processing
- text mining
- machine learning
- named entities
- keywords
- database
- structured data
- document retrieval
- semi structured
- query processing
- modeling method
- structural analysis