Layout and Language: Exploring Text Block Discovery in Tables Using Linguistic Resources.
Matthew HurstPublished in: ICDAR (2001)
Keyphrases
- linguistic resources
- domain dependent
- text collections
- domain independent
- information retrieval systems
- cross language information retrieval
- text documents
- domain specific
- database
- information retrieval
- text categorization
- document collections
- text retrieval
- knowledge discovery
- free text
- cross lingual
- natural language
- text clustering
- text corpora
- text mining
- textual data
- semantic content
- data structure