A survey of open source tools for machine learning with big data in the Hadoop ecosystem.
Sara LandsetTaghi M. KhoshgoftaarAaron N. RichterTawfiq HasaninPublished in: J. Big Data (2015)
Keyphrases
- big data
- analytic tools
- open source
- machine learning
- data analytics
- data science
- data analysis
- knowledge discovery
- cloud computing
- business intelligence
- social media
- data stores
- data intensive
- data management
- data visualization
- data processing
- unstructured data
- high volume
- vast amounts of data
- commodity hardware
- massive data
- data warehousing
- big data analytics
- information extraction
- map reduce
- massive datasets
- data mining
- predictive modeling
- model selection
- decision support
- business analytics
- information retrieval
- database management systems
- object oriented
- management system
- database systems
- information systems
- artificial intelligence
- data sets
- statistical and machine learning