Protein language models meet reduced amino acid alphabets.
Ioan IeremieRob M. EwingMahesan NiranjanPublished in: Bioinform. (2024)
Keyphrases
- amino acids
- language model
- protein sequences
- language modeling
- protein function
- speech recognition
- n gram
- query expansion
- document retrieval
- probabilistic model
- retrieval model
- secondary structure
- amino acid sequences
- test collection
- information retrieval
- tertiary structure
- protein folding
- sequence alignment
- protein structure
- protein structure prediction
- statistical language models
- computational biology
- language modelling
- physicochemical properties
- document ranking
- relevance model
- pseudo relevance feedback
- smoothing methods
- contact map
- contact maps
- language models for information retrieval
- amino acid residues
- molecular biology
- spoken term detection