Improving researcher's area of expertise identification using TF-IDF Characters N-grams.
Felipe Penhorate Carvalho da FonsecaLuciano Antonio DigiampietriPublished in: SBSI (2021)
Keyphrases
- n gram
- tf idf
- language modelling
- language model
- part of speech
- retrieval model
- information retrieval
- weighting scheme
- text documents
- vector space model
- text classification
- text categorization
- bag of words
- document frequency
- term frequency
- document clustering
- language modeling
- term weighting
- web documents
- ranking algorithm
- image representation