Feature Matrix Extraction and Classification of XML Pages.
Hongcan YanDianchuan JinLihong LiBaoxiang LiuYanan HaoPublished in: APWeb Workshops (2008)
Keyphrases
- text mining
- text classification
- semi structured
- machine learning
- feature vectors
- information retrieval
- feature set
- pattern recognition
- decision trees
- databases
- classification accuracy
- feature space
- relational databases
- support vector
- xml documents
- website
- image classification
- feature selection
- feature values
- training samples
- support vector machine svm
- keywords
- supervised learning
- feature weights
- feature analysis
- classification algorithm
- feature extraction and classification
- feature representation
- singular value decomposition
- database
- xml data
- web pages
- metadata
- feature extraction