ProtFIM: Fill-in-Middle Protein Sequence Design via Protein Language Models.
Youhan LeeHasun YuPublished in: CoRR (2023)
Keyphrases
- language model
- protein sequences
- protein structure
- language modeling
- amino acids
- n gram
- probabilistic model
- protein structure prediction
- speech recognition
- protein folding
- protein secondary structure
- computational biology
- retrieval model
- document retrieval
- structural motifs
- document ranking
- protein function
- amino acid sequences
- protein structure and function
- query expansion
- protein classification
- protein protein
- information retrieval
- language modelling
- smoothing methods
- test collection
- multiple sequence alignments
- physico chemical
- remote homology detection
- secondary structure