Classifying alkaliphilic proteins using embeddings from protein language model.
Meredita SusantyMuhammad K. N. MursalimRukman HertadiAyu PurwariantiTati Latifah Rajab MengkoPublished in: Comput. Biol. Medicine (2024)
Keyphrases
- language model
- amino acids
- subcellular localization
- protein function
- protein structure
- protein sequences
- amino acid sequences
- protein structure prediction
- protein interaction data
- contact map
- protein protein interactions
- language modeling
- dna binding
- protein folding
- n gram
- speech recognition
- protein interaction networks
- physicochemical properties
- document retrieval
- amino acid composition
- information retrieval
- protein protein interaction networks
- probabilistic model
- protein complexes
- retrieval model
- language modelling
- protein interaction
- mass spectrometry
- mixture model
- secondary structure
- test collection
- smoothing methods
- vector space
- query expansion
- ad hoc information retrieval
- context sensitive
- protein families
- query terms
- statistical language models
- functional modules
- statistical machine translation
- low dimensional
- dimensionality reduction
- language model for information retrieval
- high throughput
- translation model
- text classification
- distance measure
- language models for information retrieval