O-GlcNAcPRED-II: an integrated classification algorithm for identifying O-GlcNAcylation sites based on fuzzy undersampling and a K-means PCA oversampling technique.
Cangzhi JiaYun ZuoQuan ZouPublished in: Bioinform. (2018)
Keyphrases
- classification algorithm
- k means
- clustering algorithm
- class imbalance
- concept drift
- fuzzy k means
- principal component analysis
- fuzzy clustering
- document classification
- support vector machine
- training set
- fuzzy clustering algorithm
- fuzzy sets
- knn
- training phase
- classification method
- k nearest neighbor
- naive bayes
- classification rules
- variable weighting
- learning algorithm
- class distribution
- class labels
- fuzzy rules
- expectation maximization
- clustering method
- accurate classification
- spectral clustering
- fuzzy c means
- covariance matrix
- data sets
- training data
- small number
- active learning
- feature extraction
- machine learning