Wikipedia Citations: A comprehensive dataset of citations with identifiers extracted from English Wikipedia.
Harshdeep SinghRobert WestGiovanni ColavizzaPublished in: CoRR (2020)
Keyphrases
- computing semantic relatedness
- wordnet
- knowledge base
- automatically extracted
- named entities
- document collections
- wikipedia pages
- digital libraries
- citation networks
- external knowledge
- english language
- metadata
- named entity disambiguation
- citation analysis
- scientific papers
- entity ranking
- web search
- scientific literature
- synthetic datasets
- cross language
- natural language processing