Exploiting Unlabeled Text to Extract New Words of Different Semantic Transparency for Chinese Word Segmentation.
Richard Tzong-Han TsaiHsi-Chuan HungPublished in: IJCNLP (2008)
Keyphrases
- chinese word segmentation
- topic tracking
- natural language understanding
- word segmentation
- word pairs
- semantic information
- pos tagging
- text classification
- news stories
- syntactic analysis
- information filtering
- topic models
- topic detection and tracking
- labeled data
- natural language text
- natural language
- text documents
- language specific
- news video
- semantic analysis
- video search
- keywords
- text mining
- unlabeled data
- news events
- knowledge representation
- document analysis
- semi supervised learning
- multiword
- natural language processing
- part of speech
- n gram
- news articles
- text retrieval
- word sense disambiguation
- dependency parsing
- machine learning
- active learning
- semantic features