Generative power of a protein language model trained on multiple sequence alignments.
Damiano SgarbossaUmberto LupoAnne-Florence BitbolPublished in: CoRR (2022)
Keyphrases
- language model
- multiple sequence alignments
- protein sequences
- language modeling
- pairwise
- multiple sequence alignment
- protein protein interactions
- document retrieval
- probabilistic model
- n gram
- retrieval model
- information retrieval
- protein structure
- sequence alignment
- query expansion
- protein structure prediction
- phylogenetic trees
- test collection
- generative model
- smoothing methods
- amino acids
- query terms
- biological sequences
- computational biology
- multiple alignment
- computational methods
- coarse grained
- unsupervised learning