A Linguistically Interpreted Corpus of German Newspaper Text
Wojciech SkutThorsten BrantsBrigitte KrennHans UszkoreitPublished in: CoRR (1998)
Keyphrases
- supervised machine learning
- broad coverage
- text data
- plain text
- open domain
- english words
- linguistic patterns
- recognizing textual entailment
- text corpus
- newspaper articles
- lexical features
- free text
- natural language text
- training corpus
- text corpora
- linguistic information
- sentence level
- information extraction systems
- text collections
- text retrieval
- multiword
- topic segmentation
- word sense
- world knowledge
- textual data
- text documents
- information extraction
- anaphora resolution
- information retrieval
- named entity disambiguation
- topic tracking
- spontaneous speech
- scientific papers
- textual features
- text processing
- noun phrases
- text mining
- linguistic knowledge
- computational linguistics
- text classification
- keywords