LM-ARG: Identification & classification of antibiotic resistance genes leveraging pre-trained protein language models.
Shafayat AhmedMuhit Islam EmonNazifa Ahmed MoumiLiqing ZhangPublished in: BIBM (2022)
Keyphrases
- language model
- language modeling
- pre trained
- n gram
- information retrieval
- query expansion
- probabilistic model
- retrieval model
- speech recognition
- test collection
- document retrieval
- decision trees
- language modelling
- pattern recognition
- text classification
- machine learning
- document ranking
- image classification
- feature vectors
- smoothing methods
- feature selection
- statistical language models
- feature space
- feature extraction
- training samples
- supervised learning
- active learning
- training set
- gene expression data
- support vector
- vector space model
- neural network
- escherichia coli