Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings.
Herman KamperAren JansenSharon GoldwaterPublished in: IEEE ACM Trans. Audio Speech Lang. Process. (2016)
Keyphrases
- word segmentation
- pos tagging
- chinese text retrieval
- word recognition
- chinese word segmentation
- n gram
- handwritten words
- chinese text
- text classification
- word level
- unknown words
- language modeling
- knowledge discovery
- vector space
- language independent
- handwriting recognition
- semi supervised
- lexical semantics
- sparse data
- cross lingual
- document analysis
- supervised learning
- natural language
- data mining
- information retrieval systems
- dimensionality reduction
- pattern recognition
- information retrieval
- machine learning