Semisupervised Wrapper Choice and Generation for Print-Oriented Documents.
Alberto BartoliGiorgio DavanzoEric MedvetEnrico SorioPublished in: IEEE Trans. Knowl. Data Eng. (2014)
Keyphrases
- semi supervised
- document collections
- information retrieval
- feature selection
- relevant documents
- document classification
- xml documents
- document clustering
- text documents
- information retrieval systems
- semantic information
- digital documents
- generation process
- black box
- document retrieval
- user queries
- learning algorithm
- co occurrence
- multimedia documents
- digital libraries
- ranked list
- query biased
- web documents
- textual content
- metadata
- retrieved documents
- document representation
- keywords
- web data
- relational databases
- information extraction
- text mining
- labeled data
- database