Learning Semi-Structured Document Categorization Using Bounded-Length Spectrum Sub-Sequence Kernels.
Olivier Y. de VelPublished in: Data Min. Knowl. Discov. (2006)
Keyphrases
- semi structured
- document categorization
- learning algorithm
- learning process
- learning tasks
- structured data
- web documents
- text mining
- active learning
- information extraction
- meta learning
- database
- data model
- decision trees
- machine learning
- data sets
- supervised learning
- feature space
- information retrieval systems
- artificial intelligence
- kernel methods
- data mining
- inductive learning
- databases