BAD LUC$@$WMT 2016: a Bilingual Document Alignment Platform Based on Lucene.
Laurent JakubinaPhilippe LanglaisPublished in: WMT (2016)
Keyphrases
- document retrieval
- word alignment
- information retrieval systems
- document ranking
- document collections
- cross language
- document classification
- inverted index
- search engine
- word level
- open source
- sentence pairs
- relevant documents
- machine translation
- query expansion
- information retrieval
- keywords
- retrieval systems
- parallel texts
- document images
- cross lingual
- document clustering
- passage retrieval
- text documents
- user queries
- retrieval model
- image alignment
- retrieved documents
- cross language information retrieval
- statistical machine translation
- source language
- clustering algorithm
- parallel corpora
- tf idf
- n gram
- vector space model