Towards an Annotation Standard for STEM Documents - Datasets, Benchmarks, and Spotters.
Jan Frederik SchaeferMichael KohlhasePublished in: CICM (2023)
Keyphrases
- metadata
- legal documents
- information retrieval systems
- information retrieval
- document retrieval
- database
- xml documents
- semantic annotation
- image annotation
- keywords
- training data
- document collections
- relevant documents
- document clustering
- text data
- data collections
- semantic web
- benchmark datasets
- text collections
- structured documents