Suite of Tools for Statistical n-Gram Language Modeling for Pattern Mining in whole genome Sequences.
Madhavi GanapathirajuAsia D. MitchellMohamed ThahirKamiya MotwaniSeshan AnanthasubramanianPublished in: J. Bioinform. Comput. Biol. (2012)
Keyphrases
- language modeling
- pattern mining
- n gram
- language model
- statistical language modeling
- sequential patterns
- variable length
- cross lingual
- language modelling
- text classification
- language independent
- frequent patterns
- itemsets
- data mining
- retrieval model
- data mining techniques
- information retrieval
- query expansion
- word segmentation
- probabilistic model
- query terms
- multimedia
- digital libraries
- document retrieval
- hidden markov models
- relevance model
- expert finding
- naive bayes classification