Two-part segmentation of text documents.
P. DeepakKarthik VisweswariahNirmalie WiratungaSadiq SaniPublished in: CIKM (2012)
Keyphrases
- text documents
- text mining
- information extraction
- text categorization
- text classification
- keywords
- topic models
- document classification
- image segmentation
- news articles
- text data
- wordnet
- document clustering
- named entities
- textual information
- tf idf
- bag of words
- text collections
- multiscale
- text corpus
- real world
- information retrieval
- image processing
- question answering
- natural language processing
- probabilistic model
- expert systems
- data sets
- information extraction systems