What's there and what's not?: focused crawling for missing documents in digital libraries.
Ziming ZhuangRohit WagleC. Lee GilesPublished in: JCDL (2005)
Keyphrases
- focused crawling
- digital libraries
- web documents
- focused crawler
- topic specific
- text content
- metadata
- web pages
- semantic information
- document collections
- web mining
- multimedia
- web sources
- topic modeling
- information extraction
- search tools
- information sources
- vector space model
- semi structured
- xml documents
- document retrieval
- relevant documents
- keywords
- website
- information retrieval