Optimised Transformation Algorithm For Hadoop Data Loading in Web ETL Framework.
Gaurav GuptaNeelesh KumarIndu ChhabraPublished in: EAI Endorsed Trans. Scalable Inf. Syst. (2020)
Keyphrases
- input data
- data sets
- database
- learning algorithm
- noisy data
- data reduction
- preprocessing
- detection algorithm
- probabilistic model
- training data
- large scale data sets
- big data
- clustering method
- semantic web
- data processing
- open source
- end users
- dynamic programming
- data analysis
- objective function
- website
- np hard
- k means
- search space
- computational complexity
- data structure
- data sources
- data mining techniques
- information sources
- segmentation algorithm
- bayesian framework
- massive scale