Topic Based Creation of a Persian-English Comparable Corpus.
Zahra RahimiAzadeh ShakeryPublished in: AIRS (2011)
Keyphrases
- link grammar
- statistical machine translation
- person names
- parallel corpus
- english words
- open domain
- wide coverage
- cross lingual
- broad coverage
- topic segmentation
- training corpus
- sentence pairs
- word sense
- text classification
- mono lingual
- english language
- word sense disambiguation
- penn treebank
- machine translation
- text retrieval
- parallel corpora
- semantic roles
- linguistic features
- hand crafted
- text corpora
- multiword
- conversational speech
- chinese english
- document level
- stop words
- news articles
- document corpus
- topic tracking
- cross language
- wikipedia articles
- machine translation system
- text mining
- natural language processing
- topic models
- topic detection and tracking
- news stories
- unknown words
- word level
- natural language