Automated Mining of Relevant N-grams in Relation to Predominant Topics of Text Documents.
Jan ZizkaFrantisek DarenaPublished in: TSD (2015)
Keyphrases
- knowledge base
- text documents
- n gram
- text classification
- text mining
- bag of words
- wordnet
- part of speech
- topic models
- relevant concepts
- news articles
- text data
- language model
- text categorization
- variable length
- text collections
- document clustering
- document representation
- topic modeling
- expert systems
- tf idf
- keywords
- text corpora
- information retrieval
- knowledge discovery
- information extraction
- machine learning
- term frequency
- image classification
- knn
- labeled data
- natural language processing
- named entities
- semi supervised learning
- k nearest neighbor
- association rules
- real world