Verifying Heaps' law using Google Books Ngram data.
Vladimir V. BochkarevEduard Yu. LernerAnna V. ShevlyakovaPublished in: CoRR (2016)
Keyphrases
- data analysis
- data sets
- knowledge discovery
- databases
- raw data
- n gram
- training data
- prior knowledge
- database
- statistical analysis
- data collection
- small number
- original data
- statistical methods
- data distribution
- missing data
- computer systems
- data mining techniques
- data mining
- data sources
- wireless sensor networks
- data structure
- decision trees
- search engine