Author:Mark Hall <mhall{[at]}pentaho.com>
Category:Distributed
Changes:Fixed a bug where MTJ-related jar files were not getting into the classpath for Spark jobs running on a cluster.
Date:2017-01-18
Depends:weka (>=3.9.1), distributedWekaBase (>=1.0.14)
Description:Provides Spark wrappers for the classes in distributedWekaBase. Includes generic Spark 1.1.1 libraries, which are sufficent for running local mode on the local filesystem out of the box. To run against Hadoop/HDFS, it is necessary to delete all the libraries in ${user.home}/wekafiles/distributedWekaSpark/lib and copy in the spark-assembly-A.B.C-hadoopX.Y-Z.jar file that is bundled with the distribution of Spark compiled for your version of Hadoop.
License:GPL 3
Maintainer:Mark Hall <mhall{[at]}pentaho.com>
MessageToDisplayOnInstall:Includes generic Spark 1.1.1 libraries, which are sufficent for running local mode on the local filesystem out of the box. To run against Hadoop/HDFS, it is necessary to delete all the libraries in ${WEKA_HOME}/distributedWekaSpark/lib and copy in the spark-assembly-a.b.c-hadoopX.Y-Z.jar file that is bundled with the distribution of Spark compiled for your version of Hadoop.
PackageURL:http://downloads.sourceforge.net/weka/weka-packages/distributedWekaSpark1.1.8.zip
URL:http://markahall.blogspot.co.nz/2015/03/weka-and-spark.html
Version:1.1.8