Separation of Textual and Non-textual Information within Mixed-Mode Documents.
Frank HönesRainer ZimmerPublished in: MVA (1992)
Keyphrases
- textual information
- mixed mode
- text documents
- web documents
- keywords
- textual data
- visual information
- visual content
- contextual information
- low level features
- financial reports
- text classification
- document collections
- topic models
- news stories
- visual features
- text mining
- bag of words
- multimedia content
- code generation
- information extraction