Using the Google N-Gram corpus to measure cultural complexity.
Patrick JuolaPublished in: Lit. Linguistic Comput. (2013)
Keyphrases
- n gram
- language model
- language independent
- text classification
- bag of words
- language modelling
- search engine
- variable length
- similarity measure
- viterbi algorithm
- language modeling
- word segmentation
- test collection
- part of speech
- web documents
- distance measure
- statistical language modeling
- document images
- question answering
- language specific