Words in Contexts: Digital Editions of Literary Journals in the "AAC - Austrian Academy Corpus".
Hanno BiberEvelyn BreitenederKarlheinz MörthPublished in: LREC (2008)
Keyphrases
- english words
- word frequencies
- word pairs
- multiword
- text corpora
- unknown words
- text corpus
- lexical features
- text analysis
- person names
- spontaneous speech
- linguistic information
- training corpus
- word co occurrence
- noun phrases
- digital libraries
- ambiguous words
- text documents
- related words
- pos tagging
- conversational speech
- word sense disambiguation
- world knowledge
- textual features
- word frequency
- n gram
- stop words
- word sense
- document level
- manually annotated
- sentence level
- word segmentation
- natural language text
- coreference resolution
- keywords
- semantic roles
- wordnet
- parallel texts