Review of Copy Detection Techniques for Monolingual Natural-Language Documents.
Ruoyun XieLijun ZhuYonghong ChengPublished in: WI (2018)
Keyphrases
- copy detection
- natural language
- question answering
- information retrieval
- document retrieval
- passage retrieval
- natural language questions
- machine translation
- query expansion
- web retrieval
- ad hoc retrieval
- document collections
- natural language text
- linguistic analysis
- relevant documents
- information retrieval systems
- question answering systems
- natural language processing
- multilingual information retrieval
- written in natural language
- cross language
- tasks in natural language processing
- web documents
- text retrieval
- document classification
- metadata
- retrieved documents
- information extraction
- parallel corpus
- source language
- cross lingual
- cross language information retrieval
- document clustering
- keywords
- text documents
- machine learning
- language model
- semantic information
- parallel corpora
- retrieval systems
- machine translation system
- statistical machine translation
- query terms
- pseudo relevance feedback
- language independent