A software system for gene sequence database construction based on fast approximate string matching.
Zheng LiuJames BornemanTao JiangPublished in: Int. J. Bioinform. Res. Appl. (2005)
Keyphrases
- approximate string matching
- sequence databases
- edit distance
- string matching
- genomic databases
- n gram
- sequence data
- sequence alignment
- protein sequences
- suffix array
- suffix tree
- microarray
- sequential pattern mining
- similarity search
- sequential patterns
- information retrieval
- gene expression
- biological data
- personal information
- high throughput
- indexing techniques
- graph matching
- database
- text classification
- text mining
- end users
- dynamic programming
- data model
- feature space
- data analysis
- similarity measure
- databases