Business Document Information Extraction: Towards Practical Benchmarks.
Matyás SkalickýStepán SimsaMichal UricárMilan SulcPublished in: CoRR (2022)
Keyphrases
- information extraction
- web documents
- text documents
- information retrieval
- natural language processing
- unstructured documents
- text mining
- precision and recall
- real world
- data mining
- business processes
- document classification
- free text
- machine learning
- retrieval systems
- relation extraction
- document retrieval
- extracting meaningful
- electronic commerce
- business intelligence
- structured data
- semantic information
- information systems
- document collections
- text summarization
- information retrieval systems
- semi structured
- document clustering
- business models
- textual data
- database
- case study
- document analysis
- return on investment
- question answering
- business rules
- vector space model
- knowledge management
- text categorization
- web mining
- machine translation
- test collection
- business process