RSDB: representative protein sequence databases have high information content.
Jong ParkLiisa HolmAndreas HegerCyrus ChothiaPublished in: Bioinform. (2000)
Keyphrases
- information content
- sequence databases
- protein sequences
- sequence data
- sequential patterns
- mutual information
- similarity search
- normal form
- sequential pattern mining
- amino acids
- biological data
- semantic similarity
- database
- protein structure
- data analysis
- pattern recognition
- similarity measure
- database systems
- information retrieval