The DReaM Corpus: A Multilingual Annotated Corpus of Grammars for the World's Languages.
Shafqat Mumtaz VirkHarald HammarströmMarkus ForsbergSøren WichmannPublished in: LREC (2020)
Keyphrases
- semantic annotation
- annotated corpus
- automatic annotation
- language independent
- cross lingual
- context free grammars
- context free
- named entity recognition
- named entities
- relation extraction
- comparable corpora
- natural language processing
- digital libraries
- cross language
- english words
- high level
- machine translation
- cross language information retrieval
- parallel corpora
- medline abstracts
- manually annotated
- n gram
- text classification
- information extraction
- natural language