Mining Relevant Text from Unlabelled Documents.
Daniel BarbaráCarlotta DomeniconiNing KangPublished in: ICDM (2003)
Keyphrases
- text mining
- text documents
- scientific literature
- information retrieval
- web documents
- free text
- digital documents
- text analysis
- text data
- text analytics
- text content
- document analysis
- keywords
- document content
- textual content
- newspaper articles
- text retrieval
- text collections
- document classification
- document collections
- automatic categorization
- textual data
- document processing
- text information
- document clustering
- printed documents
- document categorization
- latent semantic analysis
- retrieve documents
- text classification
- document structure
- natural language text
- highly relevant
- electronic documents
- textual information
- text corpus
- plagiarism detection
- text corpora
- text categorization
- page layout
- data mining
- document set
- knowledge discovery
- document retrieval
- web mining
- information retrieval systems
- automatic summarization
- text classifiers
- semantically related
- journal articles
- topic segmentation
- handwritten text
- semantic information
- search engine
- medical literature
- natural language processing
- metadata
- document level
- multimedia documents
- key concepts
- digital libraries
- semi supervised learning
- related documents
- text lines
- news stories
- ranked list
- relevant documents
- retrieval systems
- user queries
- information extraction
- wordnet
- relevance feedback