Statistical analysis of the Indus script using $n$-grams
Nisha YadavHrishikesh JoglekarRajesh P. N. RaoMayank N. VahiaIravatham MahadevanRonojoy AdhikariPublished in: CoRR (2009)
Keyphrases
- n gram
- statistical analysis
- language model
- text classification
- bag of words
- language independent
- part of speech
- variable length
- viterbi algorithm
- language modeling
- language modelling
- data mining
- word segmentation
- document retrieval
- word level
- statistical language modeling
- knn
- statistical analyses
- real world
- inside outside algorithm