Wikipedia Text Reuse: Within and Without.
Milad AlshomaryMichael VölskeTristan LichtHenning WachsmuthBenno SteinMatthias HagenMartin PotthastPublished in: CoRR (2018)
Keyphrases
- short texts
- named entity disambiguation
- world knowledge
- information retrieval
- wikipedia pages
- database
- short text
- natural language text
- text retrieval
- semantic information
- text mining
- document corpus
- wikipedia articles
- free text
- textual data
- wordnet
- learning objects
- software reuse
- text corpus
- entity extraction
- digital libraries
- anchor text
- document analysis
- semantic network
- named entities
- web documents
- document collections
- text classification