Publication: A distributed evolutionary based instance selection algorithm for big data using Apache Spark.