Separation of Text and Non-text in Document Layout Analysis using a Recursive Filter.
Tuan Anh TranIn Seop NaSoo-Hyung KimPublished in: KSII Trans. Internet Inf. Syst. (2015)
Keyphrases
- information retrieval
- text documents
- web documents
- text collections
- keywords
- digital documents
- text mining
- document processing
- text content
- document analysis
- free text
- document categorization
- multimedia documents
- text clustering
- textual data
- document images
- information retrieval systems
- latent semantic analysis
- database
- document clustering
- textual content
- textual documents
- text corpus
- semantic information
- document content
- information extraction
- scientific papers
- automatic text summarization
- extractive summarization
- technical papers