Compression of Biological Sequences by Greedy Off-Line Textual Substitution.
Alberto ApostolicoStefano LonardiPublished in: Data Compression Conference (2000)
Keyphrases
- biological sequences
- protein sequences
- sequence data
- greedy algorithm
- biological data
- search algorithm
- molecular biology
- computational biology
- dna sequences
- motif finding
- feature selection
- keywords
- high throughput
- binding sites
- self organizing maps
- database
- high dimensional data
- multi dimensional
- natural language
- metadata
- databases