Publication: On Scalability of Distributed Machine Learning with Big Data on Apache Spark.