Statistical feature extraction for cross-language web content quality assessment.
Guanggang GengXiaodong LiLi-Ming WangWei WangShuo ShenPublished in: SIGIR (2011)
Keyphrases
- quality assessment
- web content
- cross language
- feature extraction
- document retrieval
- text retrieval
- website
- cross language information retrieval
- question answering
- document collections
- cross lingual
- text categorization
- web documents
- web pages
- image quality
- information access
- cross media
- human visual system
- data quality
- feature vectors
- visual quality
- face recognition
- feature selection
- image classification
- social media
- active learning
- feature space
- k nearest neighbor
- language model
- machine learning
- data mining