A Fast Word Embedding Based Classifier to Profile Target Gene Databases in Metagenomic Samples.
Gustavo A. Arango-ArgotyLenwood S. HeathAmy PrudenPeter J. VikeslandLiqing ZhangPublished in: ICCABS (2020)
Keyphrases
- databases
- training samples
- training set
- genomic sequences
- small sample
- feature values
- data sets
- training data
- gene expression
- optimum path forest
- test data
- co occurrence
- support vector
- training examples
- database
- relational databases
- representative samples
- labeled samples
- feature selection
- sequence data
- feature space
- microarray
- data sources
- data integration
- word sense disambiguation
- training dataset
- user profiles
- sample set
- learning algorithm
- feature set
- test sample
- gene expression datasets
- vector space
- decision trees
- sampling methods
- sequence analysis
- dna microarray
- lexical features
- classification method