Pairing interacting protein sequences using masked language modeling.
Umberto LupoDamiano SgarbossaAnne-Florence BitbolPublished in: CoRR (2023)
Keyphrases
- language modeling
- protein sequences
- language model
- computational biology
- query expansion
- information retrieval
- retrieval model
- protein structure
- biological sequences
- probabilistic model
- amino acids
- n gram
- protein classification
- protein function
- amino acid sequences
- sequence analysis
- multiple sequence alignment
- protein secondary structure
- secondary structure
- statistical language modeling
- document retrieval
- text classification
- test collection
- text categorization
- information retrieval systems
- knowledge discovery
- training set