Batch is back: CasJobs, serving multi-TB data on the Web
William O'MullaneNolan LiMaría A. Nieto-SantistebanAlexander S. SzalayAni ThakarJim GrayPublished in: CoRR (2005)
Keyphrases
- data sets
- raw data
- database
- web data
- data collection
- data analysis
- data processing
- web documents
- small number
- prior knowledge
- original data
- synthetic data
- website
- statistical analysis
- training data
- high quality
- web search
- end users
- constantly growing
- linked open data
- multi source
- textual data
- data mining
- data quality
- relational databases
- search engine
- data distribution
- spatial data
- data points
- image data
- computer systems
- database systems