Extracting PROV provenance traces from Wikipedia history pages.
Paolo MissierZiyu ChenPublished in: EDBT/ICDT Workshops (2013)
Keyphrases
- link structure
- wikipedia pages
- wikipedia articles
- website
- metadata
- search engine
- web pages
- anchor text
- web documents
- keywords
- fine grained
- knowledge base
- data provenance
- semantic information
- named entities
- databases
- world knowledge
- web users
- entity ranking
- data extraction
- page content
- ranking algorithm
- scientific data
- log files
- data quality
- wordnet
- web search
- information retrieval