Les n-grams de caractères pour l'aide à l'extraction de connaissances dans des bases de données textuelles multilingues.
Ismaïl BiskriSylvain DelislePublished in: TALN (Articles longs) (2001)
Keyphrases
- n gram
- language model
- bag of words
- text classification
- language independent
- viterbi algorithm
- part of speech
- language modelling
- language modeling
- information extraction
- variable length
- inside outside algorithm
- character n grams
- language specific
- artificial intelligence
- text categorization
- word level
- hidden markov models
- information retrieval
- machine learning