WEKA Packages

IMPORTANT: make sure there are no old versions of Weka (<3.7.2) in your CLASSPATH before starting Weka

Installation of Packages

A GUI package manager is available from the "Tools" menu of the GUIChooser

java -jar weka.jar

For a command line package manager type:
java weka.core.WekaPackageManager -h


Running packaged algorithms from the command line

java weka.Run [algorithm name]

Substring matching is also supported. E.g. try:

java weka.Run Bayes


Available Packages (211)

AffectiveTweetsText classificationText Filters for Analyzing Sentiment and Emotions of Tweets
AnDEClassificationAveraged N-Dependence Estimators (includes A1DE and A2DE)
AnalogicalModelingClassificationAnalogical Modeling
ArabicStemmers_LightStemmersPreprocessingArabic Stemmer / Light Stemmer
Auto-WEKAClassification, Regression, Attribute SelectionAutomatically find the best model and parameters for a dataset.
BANGFileClusteringBANG-File Clustering
CAIRADClassificationCAIRAD: A Co-appearance based Analysis for Incorrect Records and Attribute-values Detection
CFWNBClassificationContructs Correlation-based Feature Weighted Naive Bayes (CFWNB)
CHIRPClassificationCHIRP: A new classifier based on Composite Hypercubes on Iterated Random Projections
CLOPEClusteringCLOPE: a fast and effective clustering algorithm for transactional data
CSForestClassificationCSForest
CVAttributeEvalAttribute selectionAn Variation degree Algorithm to explore the space of attributes.
DMIPreprocessingDMI
DMNBtextText classificationClass for building and using a Discriminative Multinomial Naive Bayes classifier
DTNBClassificationClass for building and using a decision table/naive Bayes hybrid classifier.
DilcaDistanceDistance Learning distance measure for categorical data
DistributionBasedBalancePreprocessingDistribution-based balancing of datasets
EAR4Regression, Ensemble learningCase-Based Regression Learner
EBMCClassificationEfficient Bayesian Multivariate Classifier
EMImputationPreprocessingReplaces missing numeric values using Expectation Maximization with a multivariate normal model.
EvolutionarySearchAttribute selectionAn Evolutionary Algorithm (EA) to explore the space of attributes.
ForExPlusPlusClassificationForEx++: A New Framework for Knowledge Discovery from Decision Forests
ForestPAClassificationForestPA: Constructs a Decision Forest by Penalizing Attributes used in Previous Trees.
GPAttributeGenerationClassification, PreprocessingGenetic Programming Attribute Generation
GenClustPlusPlusClusteringGenClust++
HMMClassification, Multiinstance, SequenceHidden Markov Model
IBkLGClassificationLog and Gaussian kernel for K-NN
IPCPVisualizationInterative Parallel Coordinates Plot
IWSSAttribute selectionIncremental Wrapper Subset Selection
IWSSembeddedNBAttribute selectionIncremental Wrapper Subset Selection with embedded NB classifier
J48ConsolidatedClassificationClass for generating a pruned or unpruned C45 consolidated tree
J48PartiallyConsolidatedClassification, Ensemble learningClass for generating a Partially Consolidated Tree-Bagging (PCTBagging) multiple classifier
J48graftClassificationClass for generating a grafted (pruned or unpruned) C4.5 decision tree
JCDTClassification, RegressionJava Credal Decision Tree (JCDT)
JCHAIDStarClassificationClass for generating a decision tree based on the CHAID* algorithm
JDBCDriversDummyPackageMiscDummy package that provides a place to drop JDBC driver jar files so that they get loaded by the system.
LVQClusteringCluster data using the Learning Vector Quantization algorithm.
LibLINEARClassificationA wrapper class for the liblinear classifier
LibSVMClassification, RegressionA wrapper class for the libsvm tools
MODLEMClassification, Ensemble learningMODLEM rule algorithm
MultiObjectiveEvolutionaryFuzzyClassifierClassificationMultiObjectiveEvolutionaryFuzzyClassifier
MultiObjectiveEvolutionarySearchAttribute selectionAn Multi-objective Evolutionary Algorithm (MOEA) to explore the attribute space.
NNgeClassificationNearest-neighbor-like algorithm using non-nested generalized exemplars (which are hyperrectangles that can be viewed as if-then rules)
OpenmlWekaClassification, ExperimenterOpenml Weka
OptimizedForestClassificationOptimizedForest
PSOSearchAttribute selectionAn implementation of the Particle Swarm Optimization (PSO) algorithm to explore the space of attributes.
PairwiseConsistencyAttributeEvalAttribute selectionAttribute evaluator that evaluates the worth of an attribute i by adding the consistency rates of the attribute subsets composed of attribute i and each of the other attributes.
PairwiseCorrelationAttributeEvalAttribute selectionAttribute evaluator that evaluates the worth of an attribute i by computing the mean of the worths (using CfsSubsetEval) of the attribute subsets composed of attribute i and each of the other attributes.
RBFNetworkClassification/regressionClasses that implement radial basis function networks.
RPluginR integrationExecute R scripts and learning algorithms
RankCorrelationMetricsRank Correlation Evaluation Metrics
RankerByDTClassificationClassification, VisualizationRanker Based on Decision Tree Classification
RerankingSearchAttribute selectionMeta-Search algorithm which performs a Hybrid feature selection based on re-ranking
RseslibClassificationRough Sets, Rule Induction and Analogy-Based Reasoning
SMOTEPreprocessingResamples a dataset by applying the Synthetic Minority Oversampling TEchnique (SMOTE).
SPAARCClassificationSPAARC: Constructs a Decision Tree using Split-Point Sampling and Node Attribute Subsampling.
SPMFWrapperAssociationsSPMFWrapper
SPegasosClassificationImplements the stochastic variant of the Pegasos (Primal Estimated sub-GrAdient SOlver for SVM) method of Shalev-Shwartz et al. (2007).
SVMAttributeEvalAttribute selectionEvaluates the worth of an attribute by using an SVM classifier.
SelfOrganizingMapClusteringCluster data using the Kohonen's Self-Organizing Map algorithm.
SmoothPrivateForestClassificationSmooth Private Forest for Differential Privacy
SparseGenerativeModelText classificationSparse Generative Model
StudentFiltersPreprocessingStudent Filters
SysForClassificationSysFor: Systematically Developed Forest of Multiple Decision Trees.
TPPVisualizationTargeted Projection Pursuit
WekaExcelConverterWEKA MS Excel loader/saver
WekaODFConverterWEKA ODF loader/saver
WekaPyScriptClassificationWekaPyScript
WiSARDClassificationmulti-class classifier using the WiSARD weightless neural network model.
XMeansClusteringCluster data using the X-means algorithm.
alternatingDecisionTreesClassificationBinary-class alternating decision trees and multi-class alternating decision trees.
alternatingModelTreesRegressionAlternating Model Trees
arxAnonymizerPreprocessingARX Anonymization Filter
associationRulesVisualizerVisualizationA visualization component for displaying association rules that uses a modified version of the Association Rules Viewer from DESS IAGL of Lille.
attributeSelectionSearchMethodsAttribute selectionFour search methods for attribute selection: ExhaustiveSearch, GeneticSearch, RandomSearch and RankSearch.
bayesianLogisticRegressionText classificationImplements Bayesian Logistic Regression for both Gaussian and Laplace Priors
bestFirstTreeClassificationClass for building a best-first decision tree classifier.
calibrationCurveVisualizationVisualizePlugin for plotting class probability calibration curves
cascadeKMeansClusteringk-means clustering with automatic selection of k
cassandraConvertersConvertersLoader and saver for the cassandra NoSQL database
chiSquaredAttributeEvalAttribute selectionAttribute evaluator that evaluates the worth of an attribute by computing the value of the chi-squared statistic with respect to the class.
citationKNNMulti-instance learningModified version of the Citation kNN multi instance classifier
classAssociationRulesAssociationsClass association rules algorithms (including an implementation of the CBA algorithm).
classificationViaClusteringClassificationA simple meta-classifier that uses a clusterer for classification.
classificationViaRegressionClassificationClass for doing classification using regression methods.
classifierBasedAttributeSelectionAttribute selectionA subset evaluator and an attribute evaluator for evaluating the merit of subsets and single attributes respectively using a classifier.
classifierErrorsVisualizationA visualization component for displaying errors from numeric schemes using the JMathTools library.
clojureClassifierClassificationWrapper classifiers for classifiers implemented in the Clojure programming language
complementNaiveBayesClassificationClass for building and using a Complement class Naive Bayes classifier.
conjunctiveRuleClassificationThis class implements a single conjunctive rule learner that can predict for numeric and nominal class labels.
consistencySubsetEvalAttribute selectionEvaluates the worth of a subset of attributes by the level of consistency in the class values when the training instances are projected onto the subset of attributes.
costSensitiveAttributeSelectionAttribute selectionTwo meta attribute selection evaluators (one attribute-based and the other subset-based) for performing cost-sensitive attribute selection.
daggingEnsemble learningThis meta classifier creates a number of disjoint, stratified folds out of the data and feeds each chunk of data to a copy of the supplied base classifier.
decorateEnsemble learningDECORATE is a meta-learner for building diverse ensembles of classifiers by using specially constructed artificial training examples
denormalizePreprocessingAn instance filter that collapses instances with a common grouping ID value into a single instance.
discriminantAnalysisClassificationClasses for linear and quadratic discriminant analysis
distributedWekaBaseDistributedGeneric configuration classes and distributed map/reduce type tasks for Weka
distributedWekaHadoopDistributedHadoop wrappers for Weka
distributedWekaHadoop2DistributedHadoop 2 wrappers for Weka
distributedWekaHadoop2LibsDistributedHadoop 2.x libraries for distributedWekaHadoop
distributedWekaHadoopCoreDistributedCore Hadoop wrappers for Weka
distributedWekaHadoopLibsDistributedHadoop 1.x libraries for distributedWekaHadoop
distributedWekaSparkDistributedSpark wrappers for Weka
distributedWekaSpark2DevDistributedSpark 2.x wrappers for Weka
distributedWekaSpark3DevDistributedSpark 3.x wrappers for Weka
distributedWekaSparkDevDistributedSpark wrappers for Weka
dualPerturbAndCombineClassification and regressionClass for building and using classification and regression trees based on the closed-form dual perturb and combine algorithm.
elasticNetRegressionAn implementation of the elastic net method for linear regression
ensembleLibraryEnsemble learningManages a libary of ensemble classifiers
ensemblesOfNestedDichotomiesEnsemble learningA meta classifier for handling multi-class datasets with 2-class classifiers by building an ensemble of nested dichotomies.
extraTreesClassificationPackage for generating a single Extra-Tree
fastCorrBasedFSAttribute selectionFeature selection method based on correlation measureand relevance and redundancy analysis
filteredAttributeSelectionAttribute selectionTwo meta attribute selection evaluators (one attribute-based and the other subset-based) for filtering data before performing attribute selection.
fourierTransformPreprocessingFilters for transforming using the fast fourier transform
functionalTreesClassificationClassifier for learning Functional Trees
fuzzyLaticeReasoningClassificationThe Fuzzy Lattice Reasoning Classifier uses the notion of Fuzzy Lattices for creating a Reasoning Environment
fuzzyUnorderedRuleInductionClassificationFuzzy Unordered Rule Induction Algorithm
gaussianProcessesRegressionImplements Gaussian Processes for regression without hyperparameter-tuning.
generalizedSequentialPatternsAssociationsClass implementing a GSP algorithm for discovering sequential patterns in a sequential data set
gradingEnsemble learningImplements Grading. The base classifiers are "graded".
graphgramClustering, VisualizationGraphGram - Visualization for Clusterings
gridSearchClassificationPerforms a grid search of parameter pairs for the a classifier.
hiddenNaiveBayesClassificationContructs Hidden Naive Bayes classification model with high classification accuracy and AUC
hiveJDBCMiscA package containing the JDBC driver and dependencies for the Apache Hive database, along with a DatabaseUtils.props file for use with Weka.
hotSpotAssociationsHotSpot learns a set of rules (displayed in a tree-like structure) that maximize/minimize a target variable/value of interest.
hyperPipesClassificationClass implementing a HyperPipe classifier.
imageFiltersPreprocessingA package that contains filters to process image files.
isolationForestOutlierClass for building and using a classifier built on the Isolation Forest anomaly detection algorithm.
isotonicRegressionRegressionLearns an isotonic regression model.
iterativeAbsoluteErrorRegressionRegressionA meta learner that fits a regression model to minimize absolute error.
javaFXScatterPlot3DVisualizationA visualization component for displaying a 3D scatter plot of the data using JavaFX 3D.
jfreechartOffscreenRendererKnowledgeFlowOffscreen chart renderer plugin for the Knowledge Flow that uses JFreeChart
jsonFieldExtractorKnowledge FlowExtract fields from repeating JSON structures.
kerasZooPython integrationProvides a wrapper classifier for zoo models available in Keras
kerasZoo10MonkeysExamplePython integrationProvides a Knowledge Flow template example for training a DL model on the Kaggle 10 monkeys data
kernelLogisticRegressionClassificationA package that contains a class to train a two-class kernel logistic regression model.
kfGroovyKnowledgeFlowA Knowledge Flow plugin that provides a Knowledge Flow step that wraps around a Groovy script.
kfKettleKnowledgeFlowA Knowledge Flow plugin that serves as a data source for data coming from the Kettle ETL tool.
kfPMMLClassifierScoringKnowledgeFlowA Knowledge Flow plugin that provides a Knowledge Flow step for scoring test sets or instance streams using a PMML classifier.
largeScaleKernelLearningPreprocessingA package that contains filters for large-scale kernel-based learning
latentSemanticAnalysisPreprocessingPerforms latent semantic analysis and transformation of the data
lazyAssociativeClassifierClassificationLazy Associative Classifier
lazyBayesianRulesClassificationLazy Bayesian Rules Classifier
leastMedSquaredRegressionImplements a least median squared linear regression utilizing the existing weka LinearRegression class to form predictions.
levenshteinEditDistanceDistance measureComputes the Levenshtein edit distance between two strings
linearForwardSelectionAttribute selectionExtension of BestFirst that takes a restricted number of k attributes into account.
localOutlierFactorOutlierFilter implementing the Local Outlier Factor (LOF) outlier/anomaly detection algorithm.
logarithmicErrorMetricsMetricsRoot mean square logarithmic error and mean absolute logarithmic error
massiveOnlineAnalysisData streamsMOA (Massive On-line Analysis).
metaCostClassificationThis metaclassifier makes its base classifier cost-sensitive using Pedro Domingo's method.
metaphorSearchMethodsAttribute selectionAn implementation of metaphor search methods to explore the space of attributes.
multiBoostABEnsemble learningClass for boosting a classifier using the MultiBoosting method.
multiInstanceFiltersPreprocessingA collection of filters for manipulating multi-instance data.
multiInstanceLearningMulti-instance learningA collection of multi-instance learning classifiers.
multiLayerPerceptronsClassification/regression, PreprocessingThis package currently contains classes for training multilayer perceptrons with one hidden layer for classification and regression, and autoencoders.
multilayerPerceptronCSClassificationAn extension of the standard MultilayerPerceptron classifier in Weka that adds context-sensitive Multiple Task Learning (csMTL)
multisearchClassificationMultiSearch Parameter Optimization
naiveBayesTreeClassificationClass for generating a decision tree with naive Bayes classifiers at the leaves.
netlibNativeLinuxLinear Algebranetlib-java wrappers and native libraries for BLAS, LAPACK and ARPACK under Linux
netlibNativeOSXLinear Algebranetlib-java wrappers and native libraries for BLAS, LAPACK and ARPACK under OS X
netlibNativeOSXarmLinear Algebranetlib-java wrappers and native libraries for BLAS, LAPACK and ARPACK under OS X (macOS) for Apple Arm processors
netlibNativeWindowsLinear Algebranetlib-java wrappers and native libraries for BLAS, LAPACK and ARPACK under Windows
newKnowledgeFlowStepExamplesExamplesExample Step implementations for the new Knowledge Flow, as described in the Weka manual
niftiLoaderConverterPackage for loading a directory with MRI data in NIfTI format into WEKA
normalizePreprocessingAn instance filter that normalize instances considering only numeric attributes and ignoring class index
oneClassClassifierClassificationPerforms one-class classification on a dataset.
optics_dbScanClusteringThe OPTICS and DBSCAN clustering algorithms
ordinalClassClassifierClassificationMeta classifier that allows standard classification algorithms to be applied to ordinal class problems.
ordinalLearningMethodClassificationAn implementation of the Ordinal Learning Method (OLM)
ordinalStochasticDominanceClassificationAn implementation of the Ordinal Stochastic Dominance Learner
paceRegressionRegressionClass for building pace regression linear models and using them for prediction.
partialLeastSquaresPreprocessingPartial least squares filter and classifier for performing PLS regression.
percentageErrorMetricsMetricsRoot mean square percentage error and mean absolute percentage error
predictiveAprioriAssociationsClass implementing the predictive apriori algorithm for mining association rules.
prefuseGraphVisualizationA visualization component for displaying graphs that uses the prefuse visualization toolkit.
prefuseGraphViewerKnowledgeFlowA Knowledge Flow visualization component for displaying trees and graphs that uses the prefuse visualization toolkit.
prefuseTreeVisualizationA visualization component for displaying trees that uses the prefuse visualization toolkit.
probabilisticSignificanceAEAttribute SelectionEvaluates the worth of an attribute by computing the Probabilistic Significance as a two-way function
probabilityCalibrationTreesProbability calibration, Ensemble learningProbability calibration trees plus ensemble learning using cascade generalization
raceSearchAttribute SelectionRaces the cross validation error of competing attribute subsets.
racedIncrementalLogitBoostEnsemble learningClassifier for incremental learning of large datasets by way of racing logit-boosted committees.
realAdaBoostEnsemble learningClass for boosting a 2-class classifier using the Real Adaboost method.
regressionByDiscretizationRegressionA regression scheme that employs any classifier on a copy of the data that has the class attribute (equal-width) discretized.
ridorClassificationAn implementation of a RIpple-DOwn Rule learner.
rotationForestEnsemble learningEnsembles of decision trees trained on rotated subsamples of the training data.
sasLoaderConverterSAS sas7bdat file reader
scatterPlot3DVisualizationA visualization component for displaying a 3D scatter plot of the data using Java 3D.
scriptingClassifiersClassificationWrapper classifiers for Jython and Groovy scripting code.
sequentialInformationalBottleneckClustererClusteringCluster data using the sequential information bottleneck algorithm.
simpleCARTClassificationClass implementing minimal cost-complexity pruning.
simpleEducationalLearningSchemesClassificationSimple learning schemes for educational purposes (Prism, Id3, IB1 and NaiveBayesSimple).
snowball-stemmersPreprocessingSnowball stemmers
stackingCEnsemble learningImplements StackingC (more efficient version of stacking)
streamingUnivariateStatsKnowledgeFlowA Knowledge Flow step to compute summary statistics incrementally
supervisedAttributeScalingPreprocessingA simple filter to rescale attributes to reflect their discriminative power.
tabuAndScatterSearchAttribute selectionSearch methods contributed by Adrian Pino (ScatterSearchV1, TabuSearch)
tertiusAssociationsFinds rules according to confirmation measure (Tertius-type algorithm)
thresholdSelectorClassificationA metaclassifier that selecting a mid-point threshold on the probability output by a Classifier.
tigerjythonScriptingTigerJython
timeSeriesFiltersFilters, Time SeriesTime Series Filters
timeseriesForecastingTime seriesTime series forecasting environment.
ultraBoostClassificationClass to adaptively boost heterogeneous classifiers
userClassifierClassification/regressionInteractively classify through visual means.
vinesDensity EstimationRegular Vines
votingFeatureIntervalsClassificationClassification by voting feature intervals.
waveletPreprocessingA filter for wavelet transformation.
wekaDeeplearning4jClassification/RegressionWeka wrappers for Deeplearning4j
wekaPythonPython integrationProvides integration with CPython in Weka.
wekaRAPIDSPython integrationProvides integration with RAPIDS in Weka.
wekaServerServerSimple server for executing Weka tasks.
winnowClassificationImplements Winnow and Balanced Winnow algorithms by Littlestone