: A Distributed Network of Physics Institutions Collecting, Indexing, and Searching High Quality Documents by using Harvest.
Thomas SeveriensMichael HohlfeldKerstin ZimmermannEberhard R. HilfPublished in: D Lib Mag. (2000)
Keyphrases
- distributed network
- high quality
- information retrieval
- effective retrieval
- document indexing
- word spotting
- index terms
- document analysis
- document collections
- document processing
- relevant documents
- web documents
- document retrieval
- sensor placement
- database
- retrieval engine
- focused crawling
- retrieval systems
- text retrieval
- information retrieval systems
- document repositories
- network latency
- metadata
- computer science
- bandwidth usage
- controlled vocabulary
- content based retrieval
- string matching
- keywords
- document clustering
- chinese text retrieval
- bibliographic databases
- inverted index
- free text
- web data
- text documents
- xml documents
- artificial intelligence
- semantic information
- data collection
- language model
- digital libraries