A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark.
Sergio Ramírez-GallegoSalvador GarcíaJosé Manuel BenítezFrancisco HerreraPublished in: Swarm Evol. Comput. (2018)
Keyphrases
- big data
- high volume
- data processing
- data intensive
- data intensive computing
- cloud computing
- commodity hardware
- data management
- map reduce
- open source
- big data analytics
- data analysis
- unstructured data
- distributed systems
- knowledge discovery
- massive data
- real time
- business intelligence
- information processing
- distributed environment
- vast amounts of data
- distributed computing
- social media
- parallel execution
- case study
- data science
- social computing
- case based reasoning
- health informatics
- data warehousing