Test collection recycling for semantic text similarity.
Faisal RahutomoTeruaki KitasukaMasayoshi AritsugiPublished in: iiWAS (2012)
Keyphrases
- test collection
- sentence similarity
- information retrieval
- semantic similarity
- text representation
- newspaper articles
- retrieval model
- semantic information
- retrieval effectiveness
- document collections
- search tasks
- relevant documents
- text retrieval
- document set
- anchor text
- language model
- retrieval systems
- similarity measure
- average precision
- vector space model
- word pairs
- natural language
- relevance assessments
- relevance judgments
- evaluation methodology
- relevance judgements
- ir evaluation
- trec web
- text mining
- keywords
- chinese web
- spoken document retrieval
- text classification
- trec web track
- evaluation of information retrieval systems
- text summarization
- document clustering
- text documents
- web documents
- co occurrence
- document content
- information seeking
- ranking functions
- information retrieval systems
- multimedia
- search engine