Machine learning approach for text and document mining.
Vishwanath BijalwanPinki KumariJordán Pascual EspadaVijay Bhaskar SemwalPublished in: CoRR (2014)
Keyphrases
- text mining
- text documents
- machine learning
- textual documents
- text clustering
- document clustering
- document classification
- information retrieval
- keywords
- text classification
- data mining
- document processing
- digital documents
- knowledge discovery
- document analysis
- information extraction
- text data
- document content
- supervised machine learning
- text collections
- text content
- natural language processing
- web mining
- textual content
- scientific papers
- textual data
- text retrieval
- web documents
- retrieval systems
- database
- named entities
- multimedia documents
- document corpus
- document representation
- latent semantic analysis
- noun phrases
- page layout analysis
- extractive summarization
- text corpus
- text processing
- text classifiers
- news articles
- scientific documents
- text summarization
- document collections
- information retrieval systems
- printed documents
- free text
- handwritten text
- keyword extraction
- electronic documents
- related documents
- document structure
- retrieval engine
- itemsets
- text categorization
- decision trees
- automatic text summarization
- vector space model
- document retrieval
- topic models
- document images