Automated layout preservation in cross language translation of document: an integrated approach and implementation.
Vivek YadavChandrashekar RamanathanPublished in: COMPUTE (2014)
Keyphrases
- cross language
- document collections
- document retrieval
- cross language information retrieval
- text retrieval
- translation probabilities
- spoken document retrieval
- question answering
- cross lingual
- query translation
- cross language retrieval
- language independent
- source language
- cf loadingtexthtml
- text categorization
- information access
- information retrieval
- language model
- plagiarism detection
- retrieval systems
- multilingual retrieval
- information retrieval systems
- query terms
- parallel corpora
- relevant documents
- document clustering
- digital libraries
- bilingual lexicon
- test collection
- retrieval model
- machine translation
- text documents
- document images
- translation model
- comparable corpora
- co occurrence
- information extraction
- character n grams
- multimedia
- machine learning