Processing Comparable Corpora With Bilingual Suffix Trees.
Dragos Stefan MunteanuDaniel MarcuPublished in: EMNLP (2002)
Keyphrases
- comparable corpora
- suffix tree
- cross language information retrieval
- parallel corpora
- bilingual lexicon
- news articles
- language modeling
- terminology extraction
- machine translation
- data structure
- cross lingual
- word pairs
- bilingual dictionaries
- pattern matching
- text corpora
- cross language
- query translation
- parallel corpus
- database
- language model
- linguistic resources
- translation model
- digital libraries
- high dimensional
- multi dimensional
- text documents