Fast logistic regression for text categorization with variable-length n-grams.
Georgiana IfrimGökhan H. BakirGerhard WeikumPublished in: KDD (2008)
Keyphrases
- variable length
- logistic regression
- text categorization
- n gram
- text classification
- naive bayes
- bag of words
- feature selection
- multi label
- language model
- information gain
- labeled data
- decision trees
- support vector
- text documents
- knn
- tf idf
- machine learning
- unlabeled data
- k nearest neighbor
- semi supervised learning
- viterbi algorithm
- text mining
- data sets
- term frequency
- nearest neighbor
- linear svm
- loss function
- topic models
- unsupervised learning
- transfer learning
- knowledge representation
- classification accuracy
- probabilistic model