Text Classification for Thai Medicinal Web Pages.
Verayuth LertnatteeThanaruk TheeramunkongPublished in: PAKDD (2007)
Keyphrases
- text classification
- web page classification
- web pages
- classifying web pages
- word segmentation
- text categorization
- bag of words
- search engine
- website
- text data
- feature selection
- naive bayes
- text documents
- text mining
- web search
- multi label
- machine learning
- text classifiers
- n gram
- sentiment analysis
- labeled data
- web documents
- link analysis
- web search engines
- semantic features
- dynamic content
- knn
- data extraction
- language modeling
- visual features
- data cleaning
- rough sets
- web users
- web content
- data mining
- automatic text classification