Classification of Web Documents Using a Graph Model.
Adam SchenkerMark LastHorst BunkeAbraham KandelPublished in: ICDAR (2003)
Keyphrases
- web documents
- graph model
- vector space model
- information extraction
- semi structured
- web pages
- bipartite graph
- keywords
- web search engines
- machine learning
- html documents
- data mining
- supervised learning
- information retrieval systems
- co occurrence
- feature vectors
- weighted graph
- feature space
- feature selection
- learning algorithm