PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora.
Radu IonPublished in: LREC (2012)
Keyphrases
- mining algorithm
- comparable corpora
- pattern mining
- cross language information retrieval
- association rules
- association rule mining
- tree structure
- frequent patterns
- itemsets
- sequential patterns
- data mining algorithms
- frequent itemsets
- parallel corpora
- news articles
- machine translation
- natural language
- sentence level
- language modeling
- feature extraction
- n gram
- part of speech
- knowledge discovery