Using Fuzzy-Word Correlation Factors to Compute Document Similarity Based on Phrase Matching.
Jun won LeeYiu-Kai NgPublished in: FSKD (2) (2007)
Keyphrases
- noun phrases
- keywords
- word level
- short list
- document images
- multiword
- document level
- syntactic analysis
- fuzzy sets
- word frequency
- training corpus
- web documents
- matching algorithm
- text corpus
- latent topics
- information retrieval systems
- co occurrence
- n gram
- pattern matching
- fuzzy logic
- search engine
- string matching
- term weighting
- term frequency
- document clustering
- fuzzy rules
- document retrieval
- text documents
- compound words
- document collections
- tf idf
- word co occurrence
- information retrieval
- similarity estimation
- document content
- document image retrieval
- word clouds
- keyword extraction
- automatic summarization
- related words
- word recognition
- sentiment classification
- relevant documents
- retrieval systems
- user queries