Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies.
Alex GittensAditya DevarakondaEvan RacahMichael F. RingenburgLisa GerhardtJey KottalamJialin LiuKristyn J. MaschhoffShane CanonJatin ChhuganiPramod SharmaJiyan YangJames DemmelJim HarrellVenkat KrishnamurthyMichael W. Mahoney PrabhatPublished in: CoRR (2016)
Keyphrases
- matrix factorization
- data analytics
- collaborative filtering
- recommender systems
- nonnegative matrix factorization
- low rank
- factorization methods
- missing data
- big data
- parallel implementation
- data mining
- parallel algorithm
- internet search
- implicit feedback
- message passing
- databases
- test collection
- cloud computing
- open source
- information retrieval
- real world