CodE Alltag: A German-Language E-Mail Corpus.
Ulrike Krieg-HolzChristian SchuschnigFranz MatthiesBenjamin RedlingUdo HahnPublished in: LREC (2016)
Keyphrases
- parallel corpus
- programming language
- spanish language
- manually annotated
- language learning
- natural language
- spam filtering
- test set
- source code
- java virtual machine
- word forms
- bilingual lexicon
- text classification
- cross lingual
- language independent
- computer programs
- text mining
- natural language processing
- comparable corpora
- linguistic patterns
- error handling