Protein Generation via Genome-scale Language Models with Bio-physical Scoring.
Gautham DharumanArvind RamanathanPublished in: SC Workshops (2023)
Keyphrases
- language model
- genome scale
- sequence similarity
- protein function
- language modeling
- dna binding
- protein protein interactions
- probabilistic model
- saccharomyces cerevisiae
- retrieval model
- test collection
- information retrieval
- protein structure prediction
- query expansion
- metabolic pathways
- protein sequences
- computational methods
- systems biology
- protein structure
- smoothing methods
- amino acids
- okapi bm
- microarray data
- protein interaction networks
- protein protein interaction networks
- genomic data
- computational tools
- co occurrence
- sequence alignment
- information extraction
- vector space model