Improving Vietnamese Web Page Classification by Combining Hybrid Feature Selection and Label Propagation with Link Information.
Ngo Van LinhNguyen Thi Kim AnhCao Manh DatPublished in: ICCASA (2012)
Keyphrases
- web page classification
- feature selection
- anchor text
- text classification
- web pages
- web mining
- link structure
- link analysis
- semi supervised learning
- document representation
- semi supervised
- web search
- community detection
- labeled data
- text categorization
- test collection
- unlabeled data
- search tasks
- web documents
- action recognition
- class labels
- document collections
- data mining
- language modeling
- text mining
- information extraction
- probabilistic model
- feature space
- machine learning