Detection of significant patterns by compression algorithms: the case of approximate tandem repeats in DNA sequences.
Eric RivalsOlivier DelgrangeJean-Paul DelahayeMax DauchetMarie-Odile DelormeAlain HénautEmmanuelle OllivierPublished in: Comput. Appl. Biosci. (1997)
Keyphrases
- dna sequences
- tandem repeats
- compression algorithm
- human genome
- sequence patterns
- data compression
- image compression
- compression ratio
- biological databases
- bitstream
- coding regions
- dna computing
- binding sites
- quadtree decomposition
- biological sequences
- motif discovery
- databases
- transcription factor binding sites
- lossless data compression
- pattern mining
- single nucleotide polymorphisms
- data integration
- bit rate
- preprocessing
- computational complexity