Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices.
Jahn HeymannOliver WalterReinhold Haeb-UmbachBhiksha RajPublished in: ICASSP (2014)
Keyphrases
- word segmentation
- pos tagging
- chinese text retrieval
- word recognition
- n gram
- handwriting recognition
- language independent
- chinese text
- bayesian networks
- semi supervised
- speech recognition
- text classification
- document analysis
- unsupervised learning
- chinese word segmentation
- knowledge discovery
- cross lingual
- automatic speech recognition
- part of speech
- data mining
- information retrieval
- speech signal
- language modeling
- machine translation
- information extraction