Between Terms and Words for European Language IR and Between Words and Bigrams for Chinese IR.
Jian-Yun NieJean-Pierre ChevalletMarie-France BruandetPublished in: TREC (1997)
Keyphrases
- n gram
- word segmentation
- information retrieval
- chinese texts
- related words
- text retrieval
- syntactic categories
- chinese text
- keyword extraction
- training corpus
- information retrieval systems
- chinese word segmentation
- lexical information
- retrieval model
- linguistic knowledge
- chinese characters
- character n grams
- chinese text retrieval
- keywords
- natural language
- language model
- retrieval effectiveness
- co occurrence
- language specific
- part of speech
- unknown words
- english text
- word sense disambiguation
- language modeling
- multiword
- language independent
- term weighting
- parallel corpus
- word level
- out of vocabulary
- document retrieval
- query expansion
- document representation
- natural language processing
- word forms
- word meanings