Lempel-Ziv compression of highly structured documents.
Joaquín AdiegoGonzalo NavarroPablo de la FuentePublished in: J. Assoc. Inf. Sci. Technol. (2007)
Keyphrases
- highly structured
- lempel ziv
- data compression
- compression scheme
- lossless compression
- approximate string matching
- document collections
- image compression
- arithmetic coding
- compression ratio
- information retrieval
- compression algorithm
- source coding
- suffix tree
- web documents
- information retrieval systems
- document classification
- entropy coding
- n gram
- xml documents
- document retrieval
- relevant documents
- high order
- image coding
- suffix array
- patient records
- keywords