Local n-grams for Author Identification Notebook for PAN at CLEF 2013.
Robert LaytonPaul A. WattersRichard DazeleyPublished in: CLEF (Working Notes) (2013)
Keyphrases
- n gram
- author identification
- language independent
- language model
- query expansion
- cross lingual
- cross language
- language modeling
- question answering
- character n grams
- test collection
- text classification
- information retrieval
- bag of words
- highly skewed
- variable length
- part of speech
- document retrieval
- ad hoc retrieval
- natural language processing
- machine learning
- web documents
- text retrieval
- text categorization
- information extraction
- probabilistic model
- language specific
- image retrieval
- inside outside algorithm