Extraction of Authors' Charateristics from japanese Modern Setences via N-gram Distribution.
Tsukasa MatsuuraYasumasa KanadaPublished in: Discovery Science (2000)
Keyphrases
- n gram
- language model
- variable length
- language independent
- language modelling
- part of speech
- bag of words
- language modeling
- text classification
- word segmentation
- probability distribution
- information extraction
- viterbi algorithm
- language specific
- character n grams
- inside outside algorithm
- question answering
- natural language