Predicting the Sequence Specificities of DNA-Binding Proteins by DNA Fine-Tuned Language Model With Decaying Learning Rates.
Ying HeQinhu ZhangSiguo WangZhanheng ChenZhen CuiZhen-Hao GuoDe-Shuang HuangPublished in: IEEE ACM Trans. Comput. Biol. Bioinform. (2023)
Keyphrases
- dna binding
- language model
- learning rate
- fine tuned
- binding sites
- sequence similarity
- genome wide
- fine tuning
- biological sequences
- document retrieval
- probabilistic model
- information retrieval
- retrieval model
- speech recognition
- transcription factor binding sites
- query expansion
- sequence analysis
- convergence rate
- transcription factors
- learning algorithm
- sequence data
- domain specific
- dna sequences
- test collection
- protein families
- mixture model
- gene expression
- motif discovery
- high throughput
- computational methods
- biological processes
- evolutionary algorithm
- relevant documents
- information retrieval systems
- secondary structure
- search engine
- retrieval effectiveness