SYN2020: A New Corpus of Czech with an Innovated Annotation.
Tomás JelínekJan KrivanVladimír PetkevicHana SkoumalováJana SindlerováPublished in: TDS (2021)
Keyphrases
- annotated corpus
- hand crafted
- manually annotated
- active learning
- manual annotation
- metadata
- automatically generated
- language independent
- semantic annotation
- automatic annotation
- semi automatically
- supervised machine learning
- named entity recognition
- automatic image annotation
- inter annotator agreement
- relation extraction
- image retrieval
- data sets
- image annotation
- sentence level
- linguistic features
- text retrieval
- semantic web
- ground truth
- annotation tool