WEKA Packages

IMPORTANT: make sure there are no old versions of Weka (<3.7.2) in your CLASSPATH before starting Weka

Installation of Packages

A GUI package manager is available from the "Tools" menu of the GUIChooser

java -jar weka.jar

For a command line package manager type:
java weka.core.WekaPackageManager -h


Running packaged algorithms from the command line

java weka.Run [algorithm name]

Substring matching is also supported. E.g. try:

java weka.Run Bayes


Available Packages (128)

AnDEClassificationAveraged N-Dependence Estimators (includes A1DE and A2DE)
ArabicStemmers_LightStemmersPreprocessingArabic Stemmer / Light Stemmer
CHIRPClassificationCHIRP: A new classifier based on Composite Hypercubes on Iterated Random Projections
CLOPEClusteringCLOPE: a fast and effective clustering algorithm for transactional data
DMNBtextText classificationClass for building and using a Discriminative Multinomial Naive Bayes classifier
DTNBClassificationClass for building and using a decision table/naive bayes hybrid classifier.
DilcaDistanceDistance Learning distance measure for categorical data
DistributionBasedBalancePreprocessingDistribution-based balancing of datasets
EMImputationPreprocessingReplaces missing numeric values using Expectation Maximization with a multivariate normal model.
EvolutionarySearchAttribute selectionAn Evolutionary Algorithm (EA) to explore the space of attributes.
GPAttributeGenerationClassification, PreprocessingGenetic Programming Attribute Generation
J48graftClassificationClass for generating a grafted (pruned or unpruned) C4.5 decision tree
JDBCDriversDummyPackageMiscDummy package that provides a place to drop JDBC driver jar files so that they get loaded by the system.
LVQClusteringCluster data using the Learning Vector Quantization algorithm.
LibLINEARClassificationA wrapper class for the liblinear classifier
LibSVMClassificationA wrapper class for the libsvm tools
NNgeClassificationNearest-neighbor-like algorithm using non-nested generalized exemplars (which are hyperrectangles that can be viewed as if-then rules)
PCPVisualizationParallel Coordinates Plot
PSOSearchAttribute selectionAn implementation of the Particle Swarm Optimization (PSO) algorithm to explore the space of attributes.
RBFNetworkClassification/regressionClasses that implement radial basis function networks.
RPluginR integrationExecute R Scripts
RerankingSearchAttribute selectionMeta-Search algorithm which performs a Hybrid feature selection based on re-ranking
SMOTEPreprocessingResamples a dataset by applying the Synthetic Minority Oversampling TEchnique (SMOTE).
SPegasosClassificationImplements the stochastic variant of the Pegasos (Primal Estimated sub-GrAdient SOlver for SVM) method of Shalev-Shwartz et al. (2007).
SSFAttribute SelectionSimplified Silhouette Filter
SVMAttributeEvalAttribute selectionEvaluates the worth of an attribute by using an SVM classifier.
SelfOrganizingMapClusteringCluster data using the Kohonen's Self-Organizing Map algorithm.
SparseGenerativeModelText classificationSparse Generative Model
TPPVisualizationTargeted Projection Pursuit
WekaExcelConverterWEKA MS Excel loader/saver
WekaODFConverterWEKA ODF loader/saver
XMeansClusteringCluster data using the X-means algorithm.
alternatingDecisionTreesClassificationBinary-class alternating decision trees and multi-class alternating decision trees.
anonymizationPackagePreprocessingA Filter to apply k-anonymization and l-diversity
associationRulesVisualizerVisualizationA visualization component for displaying association rules that uses a modified version of the Association Rules Viewer from DESS IAGL of Lille.
attributeSelectionSearchMethodsAttribute selectionFour search methods for attribute selection: ExhaustiveSearch, GeneticSearch, RandomSearch and RankSearch.
averagedOneDependenceEstimatorsClassificationFamily of methods for learning ensembles of naive Bayes-like classifiers (with relaxed independence assumption) - includes AODE, WAODE and AODEsr
bayesianLogisticRegressionText classificationImplements Bayesian Logistic Regression for both Gaussian and Laplace Priors
bestFirstTreeClassificationClass for building a best-first decision tree classifier.
cascadeKMeansClusteringk-means clustering with automatic selection of k
cassandraConvertersConvertersLoader and saver for the cassandra NoSQL database
chiSquaredAttributeEvalAttribute selectionAttribute evaluator that evaluates the worth of an attribute by computing the value of the chi-squared statistic with respect to the class.
citationKNNMulti-instance learningModified version of the Citation kNN multi instance classifier
classAssociationRulesAssociationsClass association rules algorithms (including an implementation of the CBA algorithm).
classificationViaClusteringClassificationA simple meta-classifier that uses a clusterer for classification.
classificationViaRegressionClassificationClass for doing classification using regression methods.
classifierBasedAttributeSelectionAttribute selectionA subset evaluator and an attribute evaluator for evaluating the merit of subsets and single attributes respectively using a classifier.
classifierErrorsVisualizationA visualization component for displaying errors from numeric schemes using the JMathTools library.
complementNaiveBayesClassificationClass for building and using a Complement class Naive Bayes classifier.
conjunctiveRuleClassificationThis class implements a single conjunctive rule learner that can predict for numeric and nominal class labels.
consistencySubsetEvalAttribute selectionEvaluates the worth of a subset of attributes by the level of consistency in the class values when the training instances are projected onto the subset of attributes.
costSensitiveAttributeSelectionAttribute selectionTwo meta attribute selection evaluators (one attribute-based and the other subset-based) for performing cost-sensitive attribute selection.
daggingEnsemble learningThis meta classifier creates a number of disjoint, stratified folds out of the data and feeds each chunk of data to a copy of the supplied base classifier.
decorateEnsemble learningDECORATE is a meta-learner for building diverse ensembles of classifiers by using specially constructed artificial training examples
denormalizePreprocessingAn instance filter that collapses instances with a common grouping ID value into a single instance.
ensembleLibraryEnsemble learningManages a libary of ensemble classifiers
ensemblesOfNestedDichotomiesEnsemble learningA meta classifier for handling multi-class datasets with 2-class classifiers by building an ensemble of nested dichotomies.
extraTreesClassificationPackage for generating a single Extra-Tree
fastCorrBasedFSAttribute selectionFeature selection method based on correlation measureand relevance and redundancy analysis
filteredAttributeSelectionAttribute selectionTwo meta attribute selection evaluators (one attribute-based and the other subset-based) for filtering data before performing attribute selection.
functionalTreesClassificationClassifier for learning Functional Trees
fuzzyLaticeReasoningClassificationThe Fuzzy Lattice Reasoning Classifier uses the notion of Fuzzy Lattices for creating a Reasoning Environment
fuzzyUnorderedRuleInductionClassificationFuzzy Unordered Rule Induction Algorithm
gaussianProcessesRegressionImplements Gaussian Processes for regression without hyperparameter-tuning.
generalizedSequentialPatternsAssociationsClass implementing a GSP algorithm for discovering sequential patterns in a sequential data set
gradingEnsemble learningImplements Grading. The base classifiers are "graded".
gridSearchClassificationPerforms a grid search of parameter pairs for the a classifier.
hiddenNaiveBayesClassificationContructs Hidden Naive Bayes classification model with high classification accuracy and AUC
hiveJDBCMiscA package containing the JDBC driver and dependencies for the Apache Hive database, along with a DatabaseUtils.props file for use with Weka.
hotSpotAssociationsHotSpot learns a set of rules (displayed in a tree-like structure) that maximize/minimize a target variable/value of interest.
hyperPipesClassificationClass implementing a HyperPipe classifier.
isolationForestOutlierClass for building and using a classifier built on the Isolation Forest anomaly detection algorithm.
isotonicRegressionRegressionLearns an isotonic regression model.
jfreechartOffscreenRendererKnowledgeFlowOffscreen chart renderer plugin for the Knowledge Flow that uses JFreeChart
jsonFieldExtractorKnowledge FlowExtract fields from repeating JSON structures.
kfGroovyKnowledgeFlowA Knowledge Flow plugin that provides a Knowledge Flow step that wraps around a Groovy script.
kfKettleKnowledgeFlowA Knowledge Flow plugin that serves as a data source for data coming from the Kettle ETL tool.
kfPMMLClassifierScoringKnowledgeFlowA Knowledge Flow plugin that provides a Knowledge Flow step for scoring test sets or instance streams using a PMML classifier.
latentSemanticAnalysisPreprocessingPerforms latent semantic analysis and transformation of the data
lazyAssociativeClassifierClassificationLazy Associative Classifier
lazyBayesianRulesClassificationLazy Bayesian Rules Classifier
leastMedSquaredRegressionImplements a least median squared linear regression utilizing the existing weka LinearRegression class to form predictions.
levenshteinEditDistanceDistance measureComputes the Levenshtein edit distance between two strings
linearForwardSelectionAttribute selectionExtension of BestFirst that takes a restricted number of k attributes into account.
localOutlierFactorOutlierFilter implementing the Local Outlier Factor (LOF) outlier/anomaly detection algorithm.
massiveOnlineAnalysisData streamsMOA (Massive On-line Analysis).
metaCostClassificationThis metaclassifier makes its base classifier cost-sensitive using Pedro Domingo's method.
multiBoostABEnsemble learningClass for boosting a classifier using the MultiBoosting method.
multiInstanceFiltersPreprocessingA collection of filters for manipulating multi-instance data.
multiInstanceLearningMulti-instance learningA collection of multi-instance learning classifiers.
multiLayerPerceptronsClassification/RegressionThis package currently contains classes for training multilayer perceptrons with one hidden layer for both classification and regression problems,
multilayerPerceptronCSClassificationAn extension of the standard MultilayerPerceptron classifier in Weka that adds context-sensitive Multiple Task Learning (csMTL)
naiveBayesTreeClassificationClass for generating a decision tree with naive Bayes classifiers at the leaves.
normalizePreprocessingAn instance filter that normalize instances considering only numeric attributes and ignoring class index
oneClassClassifierClassificationPerforms one-class classification on a dataset.
optics_dbScanClusteringThe OPTICS and DBScan clustering algorithms
ordinalClassClassifierClassificationMeta classifier that allows standard classification algorithms to be applied to ordinal class problems.
ordinalLearningMethodClassificationAn implementation of the Ordinal Learning Method (OLM)
ordinalStochasticDominanceClassificationAn implementation of the Ordinal Stochastic Dominance Learner
paceRegressionRegressionClass for building pace regression linear models and using them for prediction.
partialLeastSquaresPreprocessingPartial least squares filter and classifier for performing PLS regression.
predictiveAprioriAssociationsClass implementing the predictive apriori algorithm for mining association rules.
prefuseGraphVisualizationA visualization component for displaying graphs that uses the prefuse visualization toolkit.
prefuseGraphViewerKnowledgeFlowA Knowledge Flow visualization component for displaying trees and graphs that uses the prefuse visualization toolkit.
prefuseTreeVisualizationA visualization component for displaying trees that uses the prefuse visualization toolkit.
probabilisticSignificanceAEAttribute SelectionEvaluates the worth of an attribute by computing the Probabilistic Significance as a two-way function
raceSearchAttribute SelectionRaces the cross validation error of competing attribute subsets.
racedIncrementalLogitBoostEnsemble learningClassifier for incremental learning of large datasets by way of racing logit-boosted committees.
realAdaBoostEnsemble learningClass for boosting a 2-class classifier using the Real Adaboost method.
regressionByDiscretizationRegressionA regression scheme that employs any classifier on a copy of the data that has the class attribute (equal-width) discretized.
ridorClassificationAn implementation of a RIpple-DOwn Rule learner.
rotationForestEnsemble learningEnsembles of decision trees trained on rotated subsamples of the training data.
sasLoaderConverterSAS sas7bdat file reader
scatterPlot3DVisualizationA visualization component for displaying a 3D scatter plot of the data using Java 3D.
scriptingClassifiersClassificationWrapper classifiers for Jython and Groovy scripting code.
sequentialInformationalBottleneckClustererClusteringCluster data using the sequential information bottleneck algorithm.
simpleCARTClassificationClass implementing minimal cost-complexity pruning.
simpleEducationalLearningSchemesClassificationSimple learning schemes for educational purposes (Prism, Id3, IB1 and NaiveBayesSimple).
stackingCEnsemble learningImplements StackingC (more efficient version of stacking)
tabuAndScatterSearchAttribute selectionSearch methods contributed by Adrian Pino (ScatterSearchV1, TabuSearch)
tertiusAssociationsFinds rules according to confirmation measure (Tertius-type algorithm)
thresholdSelectorClassificationA metaclassifier that selecting a mid-point threshold on the probability output by a Classifier.
timeseriesForecastingTime seriesTime series forecasting environment.
userClassifierClassification/regressionInteractively classify through visual means.
votingFeatureIntervalsClassificationClassification by voting feature intervals.
waveletPreprocessingA filter for wavelet transformation.
wekaServerServerSimple server for executing Weka tasks.
winnowClassificationImplements Winnow and Balanced Winnow algorithms by Littlestone