Automatic Identification of Document Translations in Large Multilingual Document Collections
Bruno PouliquenRalf SteinbergerCamelia IgnatPublished in: CoRR (2006)
Keyphrases
- document collections
- automatic identification
- cross language
- digital libraries
- information retrieval systems
- document retrieval
- document clustering
- query translation
- text retrieval
- document representation
- information retrieval
- relevant documents
- test collection
- document clusters
- similar documents
- index terms
- cross language information retrieval
- text collections
- language independent
- ad hoc retrieval
- topic detection
- cross lingual
- machine translation
- bilingual dictionaries
- scatter gather
- document archives
- query terms
- document set
- barcode
- parallel corpora
- retrieval systems
- text mining
- active learning
- document space