A lingnistically interpreted corpus of German newspaper text.
Wojciech SkutThorsten BrantsBrigitte KrennHans UszkoreitPublished in: LREC (1998)
Keyphrases
- supervised machine learning
- text data
- open domain
- broad coverage
- natural language text
- plain text
- english words
- text retrieval
- newspaper articles
- text collections
- text corpus
- document corpus
- noun phrases
- sentence level
- text mining
- information retrieval
- text corpora
- scientific papers
- anaphora resolution
- document level
- recognizing textual entailment
- temporal expressions
- multiword
- word sense
- free text
- information retrieval systems
- database
- keywords
- lexical features
- topic segmentation
- text processing
- manually annotated
- word pairs
- linguistic information
- natural language processing
- text categorization
- entity extraction
- information extraction systems
- linguistic patterns