Combining Knowledge about Text Types and Document Structures for Enhanced Content Curation.
Karolina ZaczynskaFlorian KintzelJulián Moreno SchneiderGeorg RehmPublished in: Qurator (2021)
Keyphrases
- semantic information
- textual content
- text content
- web documents
- document content
- content and structure
- text documents
- document analysis
- semantic structure
- multimedia documents
- information retrieval
- scientific papers
- keywords
- document structure
- domain knowledge
- digital documents
- structured documents
- textual information
- text collections
- document processing
- pdf files
- semantic content
- key concepts
- web pages
- free text
- textual data
- knowledge base
- document collections
- text mining
- information extraction
- document representation
- printed documents
- relevant content
- metadata
- knowledge discovery
- scientific documents
- digital repositories
- textual features
- digital libraries
- related documents
- effective retrieval
- document images
- relevant documents
- text retrieval