Structure recognition and information extraction from tabular documents.
Surekha ChandranSanjay BalasubramanianTarak GandhiArathi PrasadRangachar KasturiAtul K. ChhabraPublished in: Int. J. Imaging Syst. Technol. (1996)
Keyphrases
- information extraction
- free text
- document analysis
- information retrieval
- text documents
- recognition rate
- web documents
- pattern recognition
- object recognition
- feature extraction
- recognition accuracy
- document collections
- information retrieval systems
- unstructured documents
- keywords
- metadata
- content and structure
- structured data
- document structure
- precision and recall
- document classification
- textual data
- machine learning
- co occurrence
- semantic structure
- structured information
- search engine
- structural analysis
- natural language text
- structural information
- semi structured
- digital libraries
- retrieval systems
- xml documents
- natural language processing