Scalable Manifold Learning for Big Data with Apache Spark.
Frank SchoenemanJaroslaw ZolaPublished in: CoRR (2018)
Keyphrases
- big data
- manifold learning
- data intensive computing
- cloud computing
- map reduce
- low dimensional
- nonlinear dimensionality reduction
- open source
- data analysis
- commodity hardware
- dimensionality reduction
- diffusion maps
- high dimensional data
- semi supervised
- social media
- data management
- data processing
- high dimensional
- data visualization
- data science
- dimension reduction
- big data analytics
- sparse representation
- knowledge discovery
- locally linear embedding
- manifold structure
- feature extraction
- business intelligence
- feature space
- information processing
- data warehousing
- decision support
- data analytics
- real world
- object oriented
- nearest neighbor
- database systems
- decision making
- feature selection
- information retrieval
- machine learning
- data mining