Tolerance Rough Set-Based Bag-of-Words Model for Document Representation.
Dong QiuHaihuan JiangRuiteng YanPublished in: Int. J. Comput. Intell. Syst. (2020)
Keyphrases
- document representation
- bag of words
- document clustering
- document collections
- vector space model
- web documents
- data fusion
- language model
- vector space
- text documents
- vector representation
- document categorization
- semantic information
- document content
- text data
- background knowledge
- image classification
- text classification
- image representation
- document retrieval
- information retrieval
- data mining
- k nearest neighbor
- text mining
- information extraction
- feature extraction
- high level
- machine learning