ISTILAH SAINS: A Malay-English Terminology Retrieval System Experiment Using Stemming and N-grams Approach on Malay Words.
Tengku M. T. SembokKulothunkan PalasundramNazlena Mohamad AliYahya AidanismahTengku Siti Meriam Tengku WookPublished in: ICADL (2003)
Keyphrases
- n gram
- part of speech
- character n grams
- parse tree
- multiword
- language specific
- language model
- language independent
- word level
- text classification
- variable length
- noun phrases
- bag of words
- tf idf
- language modeling
- stop words
- syntactic categories
- natural language
- word segmentation
- viterbi algorithm
- information retrieval
- relevance feedback
- web documents
- word forms
- inside outside algorithm
- machine translation
- information retrieval systems
- probabilistic model
- sentence level
- vector space model