SIDR: structure-aware intelligent data routing in Hadoop.
Joe B. BuckNoah WatkinsGreg LevinAdam CrumeKleoni IoannidouScott A. BrandtCarlos MaltzahnNeoklis PolyzotisAaron TorresPublished in: SC (2013)
Keyphrases
- data sets
- data distribution
- data collection
- original data
- data processing
- raw data
- synthetic data
- databases
- high quality
- training data
- big data
- data quality
- small number
- neural network
- experimental data
- missing data
- complex data
- data objects
- statistical methods
- data acquisition
- real world
- data analysis
- wireless networks
- database
- open source
- data points
- prior knowledge
- xml documents
- relational databases