Scalable distributed data cube computation for large-scale multidimensional data analysis on a Spark cluster.
Suan LeeSeok KangJinho KimEun Jung YuPublished in: Clust. Comput. (2019)
Keyphrases
- data cube
- scalable distributed
- on line analytical processing
- data warehouse
- data analysis
- multi dimensional
- multi dimensional data
- storage space
- aggregate queries
- olap systems
- multidimensional databases
- cluster analysis
- online analytical processing
- multidimensional data
- data mining
- range sum queries
- data structure
- incremental maintenance
- business intelligence
- decision support
- olap queries
- clustering algorithm
- multidimensional data model
- data cube construction
- range queries
- data warehousing
- fact table
- file system
- real world
- data sets
- data management
- neural network
- main idea consists
- feature selection
- knowledge discovery
- data integration
- data visualization
- query execution
- data partitioning
- hierarchical clustering
- star schema
- feature space
- machine learning
- efficient computation
- parallel computation