An N-gram based approach to auto-extracting topics from research articles.
Linkai ZhuMaoyi HuangMaomao ChenWennan WangPublished in: CoRR (2021)
Keyphrases
- n gram
- language model
- language independent
- text classification
- bag of words
- language modeling
- language modelling
- information retrieval
- variable length
- viterbi algorithm
- part of speech
- word segmentation
- topic models
- inside outside algorithm
- probabilistic model
- neural network
- web documents
- keywords
- text documents
- character n grams
- databases