A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection.
Hanqing ZhouAmal ZouaqDiana InkpenPublished in: Inf. (2019)
Keyphrases
- n gram
- language model
- language modelling
- character n grams
- bag of words
- statistical language modeling
- language independent
- text classification
- variable length
- word segmentation
- language modeling
- probabilistic model
- viterbi algorithm
- text categorization
- web documents
- named entities
- data mining
- co occurrence
- part of speech
- information extraction
- keywords