UCD : Diachronic Text Classification with Character, Word, and Syntactic N-grams.
Terrence SzymanskiGerard LynchPublished in: SemEval@NAACL-HLT (2015)
Keyphrases
- n gram
- text classification
- bag of words
- text categorization
- text mining
- word segmentation
- language independent
- feature selection
- variable length
- language modeling
- machine learning
- text classifiers
- part of speech
- sentiment analysis
- natural language
- text documents
- character n grams
- viterbi algorithm
- labeled data
- language specific
- bayesian networks
- knn
- sentiment classification
- language model
- term frequency
- word level
- natural language text
- databases
- cross lingual
- image classification
- statistical language modeling