Wikipedia Cultural Diversity Dataset: A Complete Cartography for 300 Language Editions.
Marc Miquel-RibéDavid LaniadoPublished in: ICWSM (2019)
Keyphrases
- programming language
- knowledge base
- natural language
- named entities
- world knowledge
- cross cultural
- document collections
- entity ranking
- object oriented programming
- language processing
- link structure
- training dataset
- semantic relations
- language learning
- machine translation
- benchmark datasets
- database
- wordnet
- information retrieval systems
- object detection
- digital libraries
- high level
- information retrieval
- machine learning
- data mining
- neural network