Nucleotide Archival Format (NAF) enables efficient lossless reference-free compression of DNA sequences.
Kirill KryukovMahoko Ueda TakahashiSo NakagawaTadashi ImanishiPublished in: Bioinform. (2019)
Keyphrases
- dna sequences
- image compression
- lossless compression
- lossy compression
- predictive coding
- arithmetic coding
- dna sequencing
- tandem repeats
- compression ratio
- human genome
- motif discovery
- progressive transmission
- dna computing
- compression scheme
- lempel ziv
- binding sites
- protein coding regions
- compression algorithm
- sequence patterns
- problems in computational biology
- coding regions
- gene structure prediction
- genomic sequences
- data compression
- lossless image compression
- biological sequences
- gene prediction
- lossless coding
- image quality
- data model
- metadata
- homo sapiens
- data mining