Information content measures of semantic similarity between documents based on Hadoop system.
Marouane BirjaliAbderrahim Beni HssaneMohammed ErritaliYouness MadaniPublished in: WINCOM (2016)
Keyphrases
- semantic similarity
- information content
- vector space model
- semantic relationships
- document similarity
- semantic information
- related documents
- sentence similarity
- cosine similarity
- co occurrence
- semantic content
- word pairs
- similarity measure
- semantic relatedness
- wordnet
- word similarity
- lexical database
- human judgments
- document collections
- semantically similar
- semantic similarity computation
- text documents
- information retrieval systems
- keywords
- document representation
- information retrieval
- average precision
- relevant documents
- visual similarity
- web documents
- metadata
- document clustering
- computer vision
- database
- high level
- query language
- image registration
- language model
- tf idf