Language Independent Tokenization vs. Stemming in Automated Detection of Health Websites' HONcode Conformity: An Evaluation.
Célia BoyerLjiljana DolamicGilles FalquetPublished in: CENTERIS/ProjMAN/HCist (2015)
Keyphrases
- language independent
- automated detection
- n gram
- automated analysis
- lung cancer
- text classification
- character n grams
- language model
- word forms
- machine translation
- chinese text retrieval
- word level
- cross language
- cross lingual
- word segmentation
- text retrieval
- language specific
- information filtering
- information retrieval
- information retrieval systems
- image retrieval
- artificial intelligence