Parallel Collection of Live Data Using Hadoop.
Kyriacos TalattinisAikaterini SidiropoulouKonstantinos ChalkiasGeorge StephanidesPublished in: Panhellenic Conference on Informatics (2010)
Keyphrases
- data sets
- database
- raw data
- experimental data
- data processing
- data points
- synthetic data
- data collection
- data structure
- knowledge discovery
- original data
- big data
- high quality
- data analysis
- training data
- end users
- information retrieval
- attribute values
- statistical analysis
- sensor data
- missing data
- parallel implementation
- open source
- high dimensional data
- document collections
- input data
- data mining techniques
- image data
- data sources
- clustering algorithm
- data mining
- databases
- real time