A Corpus for Automatic Readability Assessment and Text Simplification of German.
Alessia BattistiSarah EblingPublished in: CoRR (2019)
Keyphrases
- supervised machine learning
- broad coverage
- open domain
- semi automatically
- text retrieval
- natural language text
- newspaper articles
- text data
- recognizing textual entailment
- text mining
- spontaneous speech
- text collections
- multiresolution
- automatic text
- information retrieval
- anaphora resolution
- named entity disambiguation
- natural language processing
- keywords
- multiword
- training corpus
- text corpus
- linguistic information
- lexical features
- word sense
- sentence level
- noun phrases
- english words
- textual data
- database
- information extraction
- linguistic patterns
- semi automatic
- free text
- textual features
- cross language
- plain text
- document level
- linguistic features
- manually annotated
- text processing