unarXive: a large scholarly data set with publications' full-text, annotated in-text citations, and links to metadata.
Tarek SaierMichael FärberPublished in: Scientometrics (2020)
Keyphrases
- digital libraries
- metadata
- scientific literature
- journal articles
- data sets
- digital documents
- scientific publications
- scientific papers
- medical subject headings
- database
- google scholar
- document repositories
- training data
- electronic documents
- information resources
- medical literature
- scientific articles
- information retrieval
- real world
- multimedia
- text mining
- multimedia documents
- text data
- text retrieval
- bibliographic information
- web documents
- databases
- keywords
- free text
- semantic information
- dublin core
- training set
- manually annotated
- text processing
- link analysis
- learning resources
- learning objects
- natural language processing