Combining statistics on n-grams for automatic term recognition.
Almudena BallesterÁngel Martín MunicioFernando PardosJordi Porta ZamoranoRafael J. Ruiz UreñaFernando Sánchez LeónPublished in: LREC (2002)
Keyphrases
- n gram
- language model
- bag of words
- language modeling
- text classification
- language independent
- object recognition
- language modelling
- part of speech
- feature extraction
- neural network
- word segmentation
- semi automatic
- document representation
- inside outside algorithm
- query terms
- coding scheme
- cross language information retrieval
- action recognition
- web documents
- labor intensive
- document analysis
- text categorization
- viterbi algorithm
- word level
- co occurrence
- probabilistic model