Part-of-speech Sequences and Distribution in a Learner Corpus of English.
Rebecca H. ShihJohn Y. ChiangF. TienPublished in: ROCLING (2000)
Keyphrases
- part of speech
- pos tagging
- training corpus
- penn treebank
- unknown words
- linguistic features
- multiword
- tree bank
- word sense
- n gram
- parse tree
- lexical information
- natural language processing
- dependency parsing
- machine translation
- linguistic information
- pos taggers
- word sense disambiguation
- statistical machine translation
- chinese word segmentation
- text documents
- noun phrases
- syntactic features
- dependency parser
- syntactic categories
- natural language
- information retrieval systems
- word segmentation
- information retrieval
- domain adaptation
- source language
- text classification
- target language
- information extraction
- named entity recognition