The ContentMine Scraping Stack: Literature-scale Content Mining with Community-maintained Collections of Declarative Scrapers.
Richard Smith-UnnaPeter Murray-RustPublished in: D Lib Mag. (2014)
Keyphrases
- metadata
- digital collections
- online communities
- multimedia
- information retrieval
- data mining
- multimedia content
- text mining
- data sets
- digital libraries
- web mining
- web personalization
- community discovery
- data repositories
- itemsets
- document collections
- association rule mining
- domain independent
- sequential patterns
- community structure
- mining algorithm
- digital objects
- social web
- association rules
- scale space
- user communities
- digital content
- social networks
- data mining techniques
- knowledge sharing