Constructing Word-Based Text Compression Algorithms.
R. Nigel HorspoolGordon V. CormackPublished in: Data Compression Conference (1992)
Keyphrases
- compression algorithm
- image compression
- data compression
- compression ratio
- sentence level
- related words
- text corpus
- word pairs
- keywords
- quadtree decomposition
- bitstream
- noun phrases
- english words
- wavelet based image
- natural language text
- string matching
- linguistic information
- compression scheme
- text input
- word counts
- word level
- english text
- lexical features
- information retrieval
- multiword
- file size
- lossless data compression
- jpeg images
- co occurrence
- n gram
- text mining
- handwritten words
- word segmentation
- wavelet compression
- subband
- information extraction