Tailoring Text Using Topic Words: Selection and Compression.
Timm EulerPublished in: DEXA Workshops (2002)
Keyphrases
- document content
- text corpora
- text documents
- short texts
- keywords
- short text
- word frequency
- word pairs
- document level
- text recognition
- related words
- english words
- concept space
- topic models
- compressed text
- text corpus
- chinese text
- world knowledge
- topic segmentation
- word co occurrence
- text mining
- multiword
- syntactic categories
- text compression
- automatic summarization
- news articles
- linguistic information
- text data
- compression algorithm
- wikipedia articles
- topic detection
- lexical features
- text retrieval
- semantically related
- lexical chains
- conversational speech
- textual features
- punctuation marks
- writing style
- n gram
- arabic language
- topic hierarchy
- image compression
- related documents
- news stories
- word level
- information retrieval
- natural language text
- topic modeling
- word sense disambiguation
- stop words
- document representation
- printed text
- text summarization
- multi document summarization
- sentence level
- handwritten documents
- spontaneous speech
- text representation