A method for multiple-sequence-alignment-free protein structure prediction using a protein language model.
Xiaomin FangFan WangLihang LiuJingzhou HeDayong LinYingfei XiangKunrui ZhuXiaonan ZhangHua WuHui LiLe SongPublished in: Nat. Mac. Intell. (2023)
Keyphrases
- language model
- protein structure prediction
- protein sequences
- pairwise
- probabilistic model
- protein fold recognition
- computational biology
- multiple sequence alignment
- machine learning methods
- information retrieval
- language modeling
- high dimensional
- machine learning
- n gram
- unsupervised learning
- bayesian networks
- clustering algorithm
- smoothing methods