Revealing Prevailing Semantic Contents of Clusters Generated from Untagged Freely Written Text Documents in Natural Languages.
Jan ZizkaFrantisek DarenaPublished in: TSD (2013)
Keyphrases
- text documents
- natural language
- document clustering
- text mining
- information extraction
- text representation
- wordnet
- text classification
- text categorization
- semantic interpretation
- natural language understanding
- keywords
- semantic features
- named entities
- topic models
- semantic information
- bag of words
- semantic representation
- clustering algorithm
- semantic similarity
- semantic network
- semantic analysis
- natural language processing
- question answering
- text collections
- expert systems
- data points
- text corpus
- automatic text categorization
- computer vision
- n gram
- document collections
- image representation
- knowledge representation
- data analysis
- relevant concepts
- real world