An Iterative Approach for Mining Parallel Sentences in a Comparable Corpus.
Lise ReboutPhilippe LanglaisPublished in: LREC (2014)
Keyphrases
- sentence level
- lexical features
- semantic roles
- training corpus
- mining algorithm
- natural language
- text corpus
- parallel processing
- sequential patterns
- data mining
- linguistic features
- knowledge discovery
- hand crafted
- inter annotator agreement
- shared memory
- parallel implementation
- parallel computing
- multiword
- annotated corpus
- plain text
- text mining
- link grammar
- tree bank
- data streams
- machine translation system
- document level
- itemsets
- noun phrases
- wordnet
- association rule mining