Document image segmentation and text area ordering.
Takashi SaitohMichiyoshi TachikawaToshifumi YamaaiPublished in: ICDAR (1993)
Keyphrases
- image segmentation
- text documents
- digital documents
- keywords
- document analysis
- information retrieval
- web documents
- document processing
- textual content
- text content
- printed documents
- multimedia documents
- text clustering
- document content
- scientific documents
- database
- document structure
- multiscale
- semantic information
- text corpus
- latent semantic analysis
- document categorization
- document corpus
- text representation
- text collections
- scientific papers
- document collections
- text summarization
- document images
- technical papers
- page layout analysis
- keyword extraction
- document classification
- free text
- text mining
- document level
- graph cuts
- textual documents
- automatic text summarization
- electronic documents
- related documents
- active contours
- markov random field
- retrieval systems
- document set
- extractive summarization
- pdf files
- text retrieval
- document representation
- handwritten text
- structured documents
- information extraction
- computer vision
- text classifiers
- text lines
- retrieval engine
- information retrieval systems
- image processing
- topic models
- content and structure