Generative design of compounds with desired potency from target protein sequences using a multimodal biochemical language model.
Hengwei ChenJürgen BajorathPublished in: J. Cheminformatics (2024)
Keyphrases
- language model
- protein sequences
- language modeling
- n gram
- computational biology
- information retrieval
- context sensitive
- retrieval model
- document retrieval
- speech recognition
- query expansion
- probabilistic model
- machine learning
- mixture model
- protein structure
- language modelling
- protein classification
- text classification
- translation model
- multiple sequence alignment
- amino acid sequences