A Boosting-based Algorithm for Classification of Semi-Structured Text using the Frequency of Substructures.
Tomoya IwakuraPublished in: RANLP (2013)
Keyphrases
- semi structured
- learning algorithm
- classification algorithm
- support vector machine svm
- free text
- feature selection
- classification accuracy
- multi class classification
- structured data
- machine learning
- data model
- information retrieval
- support vector machine
- web documents
- relational databases
- feature space
- decision trees
- search engine
- automatic extraction