Multi-scale DNA language model improves 6 mA binding sites prediction.
Anlin HouHanyu LuoHuan LiuLingyun LuoPingjian DingPublished in: Comput. Biol. Chem. (2024)
Keyphrases
- binding sites
- language model
- multiscale
- dna sequences
- language modeling
- gene expression
- n gram
- sequence data
- motif discovery
- document retrieval
- retrieval model
- probabilistic model
- statistical significance
- information retrieval
- dna binding
- transcription factors
- ad hoc information retrieval
- speech recognition
- query terms
- biological sequences
- transcription factor binding sites
- query expansion
- mixture model
- test collection
- regulatory elements
- retrieval effectiveness
- smoothing methods
- coding regions