Table Recognition in Heterogeneous Documents Using Machine Learning.
Sheikh Faisal RashidAbdullah AkmalMuhammad AdnanAli Adnan AslamAndreas DengelPublished in: ICDAR (2017)
Keyphrases
- machine learning
- pattern recognition
- document analysis
- automatic text categorization
- word spotting
- object recognition
- recognition accuracy
- database
- handwritten text
- information retrieval
- document collections
- web documents
- information retrieval systems
- recognition rate
- heterogeneous collections
- automatic recognition
- document clustering
- data mining
- machine learning methods
- text documents
- information extraction
- natural language processing
- feature extraction
- handwriting recognition
- xml documents
- recognition process
- recognition algorithm
- data analysis
- free text
- computer vision
- metadata
- keywords
- printed documents
- vector space model
- document classification
- character recognition
- inductive logic programming
- knowledge discovery
- text mining
- text classification
- ranked list
- natural language
- knowledge representation
- activity recognition
- document retrieval
- relevant documents