OpenProteinSet: Training data for structural biology at scale.
Gustaf AhdritzNazim BouattaSachin KadyanLukas JaroschDaniel BerenbergIan FiskAndrew M. WatkinsStephen RaRichard BonneauMohammed AlQuraishiPublished in: CoRR (2023)
Keyphrases
- training data
- training set
- data sets
- structural information
- decision trees
- supervised learning
- learning algorithm
- scientific fields
- training examples
- test data
- test set
- scale space
- naive bayes
- structural analysis
- molecular biology
- small scale
- prior knowledge
- case study
- training samples
- real time
- support vector machine
- classification accuracy
- domain knowledge
- computational biology
- hidden markov models
- feature selection