Viral genome prediction from raw human DNA sequence samples by combining natural language processing and machine learning techniques.
Mohammad H. AlshayejiSilpa ChandraBhasi SindhuSa'ed AbedPublished in: Expert Syst. Appl. (2023)
Keyphrases
- dna sequences
- gene prediction
- human genome
- natural language processing
- coding regions
- machine learning
- genomic sequences
- problems in computational biology
- single nucleotide polymorphisms
- nucleotide sequences
- tandem repeats
- evolutionary history
- dna sequencing
- information extraction
- dna computing
- regulatory elements
- sequence patterns
- natural language
- text mining
- motif discovery
- biological sequences
- sequence data
- machine learning algorithms
- transcription factors
- escherichia coli
- binding sites
- statistical methods
- wordnet
- protein coding regions
- protein interaction
- computational biology
- semantic similarity
- human immunodeficiency virus
- knowledge representation
- gene structure prediction