Part of Speech Annotation of a Turkish-German Code-Switching Corpus.
Özlem ÇetinogluÇagri ÇöltekinPublished in: LAW@ACL (2016)
Keyphrases
- part of speech
- pos tagging
- training corpus
- linguistic features
- multiword
- noun phrases
- n gram
- penn treebank
- unknown words
- linguistic information
- natural language processing
- tree bank
- unsupervised grammar induction
- syntactic features
- word sense
- chinese word segmentation
- metadata
- domain adaptation
- word sense disambiguation
- dependency parsing
- syntactic categories
- active learning
- pos taggers
- ambiguous words
- cross language
- parse tree
- similarity measure
- artificial intelligence
- tf idf
- feature selection
- image retrieval
- knowledge discovery
- information extraction
- wordnet
- machine translation