Exploring Machine Learning Algorithms and Protein Language Models Strategies to Develop Enzyme Classification Systems.
Diego FernándezAlvaro Olivera-NappaRoberto Uribe ParedesDavid Medina-OrtizPublished in: IWBBIO (1) (2023)
Keyphrases
- machine learning algorithms
- language model
- classification systems
- language modeling
- benchmark data sets
- learning algorithm
- machine learning
- machine learning methods
- n gram
- decision trees
- amino acids
- probabilistic model
- retrieval model
- information retrieval
- speech recognition
- language modelling
- document retrieval
- test collection
- query expansion
- statistical language models
- language models for information retrieval
- machine learning approaches
- machine learning models
- statistical machine learning
- drug discovery
- pseudo relevance feedback
- context sensitive
- domain specific
- spoken term detection
- protein sequences
- training data
- relevance model
- retrieval effectiveness
- data sets
- smoothing methods
- transfer learning
- bag of words
- active learning
- standard machine learning algorithms
- error rate
- document collections