Advancements in Financial Document Structure Extraction: Insights from Five Years of FinTOC (2019-2023).
Juyeon KangMauli Mehulkumar PatelAnushka AgrawalSimhadri SevithaR. SrinivasaSandra BellatoM. Anand KumarNgawang Dempa TsangMo El-HajPublished in: BigData (2023)
Keyphrases
- structure extraction
- document structure
- document layout
- inex book track
- xml documents
- relevant documents
- document collections
- document representation
- information retrieval systems
- document images
- web documents
- information retrieval
- data mining
- database
- retrieval systems
- document retrieval
- structured documents
- digital libraries
- keywords
- semantic information
- query terms
- query language
- xml elements