Building a Heterogeneous Information Retrieval Collection of Printed Arabic Documents.
Abdelrahim AbdelsaporNoha AdlyKareem DarwishOssama EmamWalid MagdyMagdi NagiPublished in: LREC (2006)
Keyphrases
- information retrieval
- document collections
- information retrieval systems
- search engine
- document retrieval
- information access
- arabic documents
- language modeling
- test collection
- query expansion
- text mining
- information extraction
- retrieval systems
- relevant documents
- relevance feedback
- tf idf
- optical character recognition