ANTILLES: An Open French Linguistically Enriched Part-of-Speech Corpus.
Yanis LabrakRichard DufourPublished in: TSD (2022)
Keyphrases
- part of speech
- pos tagging
- training corpus
- noun phrases
- linguistic features
- multiword
- n gram
- linguistic information
- syntactic features
- unknown words
- penn treebank
- natural language processing
- word sense disambiguation
- pos taggers
- unsupervised grammar induction
- chinese word segmentation
- tree bank
- word sense
- syntactic categories
- text documents
- linguistic knowledge
- word segmentation
- parse tree
- domain adaptation
- ambiguous words
- world knowledge
- dependency parsing
- machine translation
- named entity recognition