Cross-Lingual Document Representation and Semantic Similarity Measure: A Fuzzy Set and Rough Set Based Approach.
Hsun-Hui HuangYau-Hwang KuoPublished in: IEEE Trans. Fuzzy Syst. (2010)
Keyphrases
- rough sets
- cross lingual
- document representation
- fuzzy sets
- document clustering
- rough set theory
- machine translation
- vector space model
- text classification
- fuzzy logic
- data analysis
- data mining
- text documents
- pattern recognition
- language model
- bag of words
- semantic similarity
- document collections
- web documents
- control system
- k means
- co occurrence
- information extraction
- vector space
- training set
- clustering algorithm
- databases