A perceptual hash function to store and retrieve large scale DNA sequences.
Jocelyn de Goër de HerveMyoung-Ah KangXavier BaillyEngelbert Mephu NguifoPublished in: CoRR (2014)
Keyphrases
- dna sequences
- hash functions
- hashing methods
- similarity search
- human genome
- dna sequencing
- dna computing
- tandem repeats
- motif discovery
- nearest neighbor search
- problems in computational biology
- hashing algorithm
- stream cipher
- hash tables
- gene structure prediction
- coding regions
- locality sensitive hashing
- hash table
- biological sequences
- binding sites
- data distribution
- database
- binary codes
- hamming distance
- data mining
- transcription factors
- secret key
- databases