Author Verification Using Common N-Gram Profiles of Text Documents.
Magdalena JankowskaEvangelos E. MiliosVlado KeseljPublished in: COLING (2014)
Keyphrases
- n gram
- text documents
- text classification
- bag of words
- text mining
- language model
- text categorization
- document classification
- news articles
- language modeling
- tf idf
- information extraction
- document clustering
- document representation
- text data
- keywords
- labeled data
- part of speech
- feature selection
- wordnet
- named entities
- sentiment analysis
- machine learning
- term frequency
- naive bayes
- unsupervised learning
- natural language processing
- cross lingual
- knn
- probabilistic model
- image retrieval
- text classifiers