From Box to Bin - Semi-automatic Digitization of a Huge Collection of Ethnological Documents.
Alf-Christian ScheringIlvio BruderSusanne JürgensmannHolger MeyerChristoph SchmittPublished in: ICADL (2011)
Keyphrases
- semi automatic
- document collections
- fully automatic
- semi automatically
- information retrieval systems
- information retrieval
- text collections
- time stamped
- automatic categorization
- distributed information retrieval
- domain ontology
- semantic annotation
- document set
- gold standard
- xml documents
- relevant documents
- document repositories
- ontology mapping
- document clustering
- labor intensive
- digital libraries
- text documents
- web documents
- keywords
- metadata
- fully automated
- wrapper generation
- text mining
- ontology construction
- design rationale