Diffusion on language model embeddings for protein sequence generation.
Viacheslav MeshchaninovPavel V. StrashnovAndrey ShevtsovFedor NikolaevNikita IvanisenkoOlga L. KardymonDmitry P. VetrovPublished in: CoRR (2024)
Keyphrases
- language model
- protein sequences
- language modeling
- protein structure
- speech recognition
- n gram
- probabilistic model
- document retrieval
- retrieval model
- protein secondary structure
- query expansion
- secondary structure
- language modelling
- test collection
- amino acids
- protein folding
- information retrieval
- protein structure prediction
- mixture model
- protein secondary structure prediction
- statistical language models
- pseudo relevance feedback
- ad hoc information retrieval
- structural motifs
- smoothing methods
- context sensitive
- translation model
- query terms
- vector space
- protein structural
- language model for information retrieval
- relevance model
- dimensionality reduction
- document length
- cross lingual
- experimentally determined
- distance measure
- data analysis