Better Text Compression from Fewer Lexical n-Grams.
Tony C. SmithMichelle LorenzPublished in: Data Compression Conference (2001)
Keyphrases
- n gram
- text compression
- variable length
- language model
- language independent
- text classification
- wordnet
- compression scheme
- natural language processing
- context sensitive
- language modelling
- bag of words
- character n grams
- part of speech
- word segmentation
- language modeling
- word sense disambiguation
- artificial intelligence
- inside outside algorithm
- image quality
- multiresolution
- keywords
- multiscale