Using the Google N-Gram corpus to measure cultural complexity.

Published in: Lit. Linguistic Comput. (2013)

Keyphrases

n gram
language model
language independent
text classification
bag of words
language modelling
search engine
variable length
similarity measure
viterbi algorithm
language modeling
word segmentation
test collection
part of speech
web documents
distance measure
statistical language modeling
document images
question answering
language specific