Indexing HDFS Data in PDW: Splitting the data from the index.
Vinitha Reddy GankidiNikhil TeletiaJignesh M. PatelAlan HalversonDavid J. DeWittPublished in: Proc. VLDB Endow. (2014)
Keyphrases
- data sets
- database
- original data
- raw data
- high quality
- computer systems
- data collection
- image data
- complex data
- data distribution
- synthetic data
- statistical analysis
- information retrieval
- data sources
- data processing
- end users
- high dimensional data
- missing data
- spatial data
- data mining techniques
- data acquisition
- real world
- databases
- historical data