Generative Language Models on Nucleotide Sequences of Human Genes.
Musa Nuri IhtiyarArzucan OzgurPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- nucleotide sequences
- genome scale
- probabilistic model
- n gram
- document retrieval
- information retrieval
- retrieval model
- query expansion
- sequence data
- generative model
- test collection
- gene expression
- statistical language models
- language models for information retrieval
- relevance model
- language modeling framework
- microarray data
- systems biology
- sequence similarity
- molecular biology
- protein sequences
- dna sequences
- gene regulatory networks
- gene ontology
- smoothing methods
- gene selection
- data sources