Tuning small analytics on Big Data: Data partitioning and secondary indexes in the Hadoop ecosystem.
Oscar RomeroVictor HerreroAlberto AbellóJaume FerraronsPublished in: Inf. Syst. (2015)
Keyphrases
- big data
- data partitioning
- query processing
- data management
- cloud computing
- big data analytics
- data intensive
- data processing
- highly scalable
- unstructured data
- query execution
- data analysis
- business intelligence
- similarity search
- knowledge discovery
- social media
- hierarchical clustering
- data warehousing
- database
- data warehouse
- parallel execution
- index structure
- decision making
- data analytics
- query optimization
- data mining