Surfin' Wikipedia: an analysis of the Wikipedia (non-random) surfer's behavior from aggregate access data.
Karl GyllstromMarie-Francine MoensPublished in: IIiX (2012)
Keyphrases
- data analysis
- statistical analysis
- data sets
- data collection
- data quality
- synthetic data
- document collections
- data sources
- knowledge discovery
- data processing
- raw data
- high dimensional data
- random walk
- database
- knowledge base
- data mining techniques
- data points
- probability distribution
- wordnet
- high quality
- training data
- missing data
- search engine
- databases
- semantic relations
- correlation analysis