Increasing Quality of the Corpus of Frequency Dictionary of Contemporary Polish for Morphosyntactic Tagging of the Polish Language.
Marcin KutaPawel ChrzaszczJacek KitowskiPublished in: Comput. Informatics (2009)
Keyphrases
- parallel corpus
- word forms
- part of speech
- multiword
- training corpus
- spanish language
- improve quality
- high quality
- natural language
- language learning
- text corpus
- news corpus
- language independent
- query translation
- sparse representation
- data quality
- linguistic knowledge
- human judgments
- natural language processing
- bilingual dictionaries
- knowledge representation
- pos tagging