Candidate document retrieval for Arabic-based text reuse detection on the web.
Leena LuluBoumediene BelkhoucheSaad HarousPublished in: IIT (2016)
Keyphrases
- document retrieval
- text retrieval
- information retrieval
- web documents
- document indexing
- language model
- document collections
- retrieval model
- arabic text
- arabic language
- relevance feedback
- relevant documents
- document ranking
- web pages
- cross language
- inverted index
- database
- passage retrieval
- document level
- information access
- query terms
- xml retrieval
- pseudo relevance feedback
- web images
- search engine
- document image retrieval
- text mining
- databases
- question answering systems
- machine learning
- document analysis
- retrieved documents
- free text
- semi structured
- information extraction
- text documents
- test collection
- natural language processing
- semantic web