Table of Contents Recognition in OCR Documents using Image-based Machine Learning.
Sai KosarajuNelson Zange TsakuPritesh PatelTanju BayramogluGirish ModgilMingon KangPublished in: ACM Southeast Regional Conference (2019)
Keyphrases
- table of contents
- printed documents
- document analysis
- machine learning
- ocr systems
- character recognition
- optical character recognition
- document images
- document processing
- text lines
- database marketing
- handwriting recognition
- scanned documents
- scanned images
- information retrieval
- xml documents
- web documents
- page layout
- oracle database
- handwritten documents
- text mining
- preprocessing
- website
- metadata
- distributed databases
- databases
- database