Indexing and weighting of multilingual and mixed documents.
Mohammed MustafaIzzedin OsmanHussein SulemanPublished in: SAICSIT (2011)
Keyphrases
- information retrieval
- document indexing
- index terms
- document processing
- word spotting
- document analysis
- multilingual documents
- document collections
- text retrieval
- retrieval engine
- database
- information retrieval systems
- heterogeneous collections
- digital libraries
- term weighting
- tf idf
- text documents
- weighting schemes
- parallel corpus
- chinese text retrieval
- retrieval process
- document retrieval
- document clustering
- vector space model
- metadata
- web documents
- retrieval strategies
- language independent
- content based retrieval
- multimedia databases
- effective retrieval
- controlled vocabulary
- cross lingual
- multilingual search
- cross language
- similarity measure
- text mining
- multilingual information retrieval
- keywords
- search engine
- retrieved documents
- document space
- indian languages
- co occurrence
- document representation
- document images
- inverted index