A novel data structure to support ultra-fast taxonomic classification of metagenomic sequences with k-mer signatures.
Xinan LiuYe YuJinpeng LiuCorrine F. ElliottChen QianJinze LiuPublished in: Bioinform. (2018)
Keyphrases
- data structure
- classification systems
- classification accuracy
- sequence analysis
- feature selection
- decision trees
- feature space
- benchmark datasets
- pattern classification
- classification method
- automatic classification
- pattern recognition
- feature vectors
- support vector machine svm
- training set
- sequence data
- support vector machine
- classification scheme
- suffix tree
- classification process
- model selection
- multidimensional data
- genomic sequences
- sequential patterns
- machine learning
- image classification
- end users
- support vector
- training data
- learning algorithm