Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences.
Alexander RivesJoshua MeierTom SercuSiddharth GoyalZeming LinJason LiuDemi GuoMyle OttC. Lawrence ZitnickJerry MaRob FergusPublished in: Proc. Natl. Acad. Sci. USA (2021)
Keyphrases
- protein sequences
- unsupervised learning
- protein structure and function
- protein structure prediction
- computational biology
- physicochemical properties
- structural motifs
- computational approaches
- amino acids
- protein structural
- secondary structure
- multiple sequence alignment
- protein protein
- supervised learning
- biological sequences
- sequence analysis
- protein classification
- similarity search
- protein secondary structure
- biological data
- machine learning
- molecular biology
- physico chemical
- network structure