A Preliminary Study on Probabilistic Models for Chinese Abbreviations.
Jing-Shin ChangYu-Tso LaiPublished in: SIGHAN@ACL (2004)
Keyphrases
- probabilistic model
- word segmentation
- graphical models
- language modeling
- generative model
- expectation maximization
- language model
- mixture model
- bayesian inference
- hidden variables
- latent variables
- bayesian networks
- chinese text
- conditional random fields
- chinese language
- n gram
- natural language interface to databases
- text processing
- web corpora
- prior knowledge
- language learning
- topic models
- information retrieval systems
- information systems
- information retrieval
- data sets