Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: a Bayesian Non-Parametric Approach.
Benjamin SnyderTahira NaseemJacob EisensteinRegina BarzilayPublished in: HLT-NAACL (2009)
Keyphrases
- language independent
- cross lingual
- multi lingual
- multilingual information retrieval
- language specific
- multilingual documents
- data driven
- pos tagging
- language resources
- machine translation
- gaussian processes
- semi supervised
- cross lingual information retrieval
- word segmentation
- syntactic parsing
- unsupervised learning
- language identification
- bayesian networks
- grammar induction
- maximum likelihood
- n gram
- posterior probability
- information access
- supervised learning
- expressive power
- chinese word segmentation
- morphological analysis
- language modeling
- gaussian process
- indian languages
- target language
- pos taggers
- natural language processing
- comparable corpora
- named entity recognition
- bayesian inference
- cross language
- information retrieval systems
- part of speech
- linguistic resources
- machine translation system
- statistical machine translation
- dependency parsing
- text summarization