The Szeged Corpus: A POS Tagged and Syntactically Annotated Hungarian Natural Language Corpus.
Dóra CsendesJános CsirikTibor GyimóthyPublished in: TSD (2004)
Keyphrases
- manually annotated
- natural language
- annotated corpus
- linguistic features
- training corpus
- machine learning
- reference resolution
- text corpora
- natural language text
- relation extraction
- part of speech
- test set
- neural network
- semantic representation
- natural language processing
- information extraction
- ground truth
- supervised machine learning
- artificial intelligence
- genia corpus