Automated words stability and languages phylogeny
Filippo PetroniMaurizio ServaPublished in: CoRR (2009)
Keyphrases
- arabic language
- n gram
- semi automated
- language independent
- language specific
- databases
- keywords
- indian languages
- fully automated
- word segmentation
- multilingual documents
- related words
- word sense disambiguation
- expressive power
- feature selection
- search engine
- stability analysis
- phylogenetic trees
- sequence data
- text documents
- english words
- character n grams
- word forms