Wikipedia Text Reuse: Within and Without.
Milad AlshomaryMichael VölskeTristan LichtHenning WachsmuthBenno SteinMatthias HagenMartin PotthastPublished in: ECIR (1) (2019)
Keyphrases
- named entity disambiguation
- world knowledge
- natural language text
- wikipedia pages
- short texts
- semantic information
- information retrieval
- knowledge base
- semi automatically
- keywords
- text corpus
- wikipedia articles
- anchor text
- free text
- text retrieval
- wordnet
- text mining
- text documents
- text summarization
- web documents
- text corpora
- document structure
- metadata
- latent semantic analysis
- entity ranking
- plain text