Compressed q-Gram Indexing for Highly Repetitive Biological Sequences.
Francisco ClaudeAntonio FariñaMiguel A. Martínez-PrietoGonzalo NavarroPublished in: BIBE (2010)
Keyphrases
- biological sequences
- protein sequences
- molecular biology
- sequence data
- biological data
- computational biology
- motif finding
- dna sequences
- data structure
- information retrieval
- indexing method
- neural network
- longest common subsequence
- multimedia databases
- information retrieval systems
- data mining
- binding sites
- indexing methods
- database