Wednesday, July 29, 2009

Popular Data Mining Software

You can sort the table below by clicking on the column names.

Software Name Details
ACTwo Software AC2 http://www.alice soft.com/products/ac2.html AC2 is a set of C/C librairies allowing developper and IT professional to embed data mining functionalities into their ...
ARtool Software ARtool http://www.cs.umb.edu/~laur/ARtool/ ARtool contains several implementations of algorithms for mining frequent itemsets and association rules. ARtool and its ...
ASOCAGHeidelberg The Knowledge Processing Company ASOC AG Heidelberg The Knowledge Processing Company http://www.asoc.de Contributed by: ASOC AG Heidelberg The Knowledge Processing Company http://www ...
Add Software To Listing Add new software package to The Data Mine: "}% Click on "Software" on top bar to come back once you've done this.'}% %IF{"context authenticated" then ' Please fill ...
Aimm Software AIMM http://www.brandmarc.nl/ Contributed by: m.derksen #64;brandmarcSPAM BLOCKER.nl Note: This info converted from the original "The Data Mine" pages and pre dates ...
Alice Software Alice http://www.alice soft.com/products Alice is a powerful and easy to use Data Mining Tool. Use decision trees to explore exploit your data. Textual reports, ...
All Data Mining Software The table below lists all data mining software whose details have been checked. You may also want to view the OldListOfDataMiningSoftware. You can sort the table below ...
Auto Class C AutoClass C http://ic www.arc.nasa.gov/ic/projects/bayes group/group/autoclass/autoclass c program.html AutoClass C is a public domain version of AutoClass III ...
Bayesian Knowledge Discoverer Bayesian Knowledge Discoverer http://kmi.open.ac.uk/projects/bkd/ Bayesian Knowledge Discoverer (BKD) is a computer program able to learn Bayesian Belief Networks ...
Business 3 D IgorMalinka 22 Feb 2006 We are looking for interested parties for commercialization
CARTFrom Salford Systems CART from Salford Systems http://www.salford systems.com Commercial Contributed by: mlipsey #64;salford systemsSPAM BLOCKER.com Note: This info converted from ...
CARTRBy Salford Systems CART(r) by Salford Systems http://www.salford systems.com Robust decision tree technology for data mining, predictive modeling, and data pre processing. Contributed ...
CViz Software CViz http://www.alphaWorks.ibm.com/formula/cviz CViz is a visualization tool designed for analyzing high dimensional data (data with many elements) in large, complex ...
Castaneda DMS Castaneda DMS http://www.girgese.com/ A data mining suite for use on personal computers. It provides association rules, FOIL algorithm, clustering and decision ...
Clementine Software N.B. see page " SPSS Clementine " to edit this information
Codework Three Way Tangram Codework 3 way TANGRAM http://www.codework it.com/tangram/ 3 way TANGRAM is a desktop OLAP for the Windows platform. Write in APL code. Contributed by: codework ...
Commercial Tools For Data Mining Commercial tools for data mining http://www.cs.bham.ac.uk/~anp/dm docs/oudshoff.tools.posting Posting by Sandra Oudshoff on comp.ai summarizing information on a ...
Cygron Data Scope Cygron DataScope http://www.cygron.com Visual data mining and decision support tool with ODBC import capability, html export, 3D interactive graphs, automatic relation ...
Cypress The Integrated Document And Knowledge Server Cypress, The Integrated Document and Knowledge Server® http://www.cypressdelivers.com Commercial Cypress® is a document and knowledge management system that captures ...
Dat Gen Dataset Generator (DatGen) http://www.datasetgenerator.com One important way to test learning from example algorithms is to evaluate their performance against well ...
Data Analysis Software For Scientific Analysis Of Experimental Data Data analysis software for scientific analysis of experimental data gopher://calypso.oit.unc.edu:70/11/../.pub/academic/data analysis Misc software (editors note ...
Data Engine DataEngine http://www.mitgmbh.de DataEngine is a software product for data analysis using fuzzy technologies, neural networks, and conventional statistics. It has ...
Data Engine 3 1 DataEngine 3.1 http://www.mitgmbh.de DataEngine is the software for intelligent data analysis and data mining. By using neural networks, fuzzy logic and statistical ...
Data Intelligence Add In AIComponents Data Mining Add In for Excel http://www.aicomponents.com This tool allows you to apply clustering, decision trees, neural networks, and association ...
Data Miner Maximzier Inc Data Miner Maximzier Inc http://www.dmmax.com DMM is a predictive modeling software which was developed in order to maximize profit in business application(e.g ...
Data Mining Software By PMSI Data mining software by PMSI http://www.altern.org/pmsi/home gb.htm Lots of tutorials and shareware in English and French. Contributed by: pmsi #64;alternSPAM BLOCKER ...
Data Mining Suite Data Mining Suite http://www.datamining.com The Data Mining Suite™ is an integrated set of products that provide a powerful, complete and comprehensive solution ...
Data Mining Tool Easy Miner Data Mining Tool Easy Miner http://www.co.umist.ac.uk/~koundour/index.html A data mining tool (Easy Miner) for the areas of : association rules, classification ...
Data Mining Tools AGILE8 INSIGHT© PROCESS ANALYSIS TOOL KIT (from website www.agile8consulting.com) Most of the value added in today's organisations is not in systems, but in the minds ...
Data Mite DataMite http://www.lpa.co.uk/dtm.html DataMite enables rules and knowledge to be discovered in ODBC compliant relational databases. DataMite requires neither programming ...
Data Sage Datasage no longer exists. They were acquired by Vignette. Thanks to GaborMelli for the information AndyPryke 28 Oct 2001
Data Scope DataScope http://www.tiszanet.hu/cygron/DATASCP.HTM The key to knowledge is to display and manage your data in the most 'understandable' form. As you may have experienced ...
Data Surveyor Data Surveyor http://www.ddi.nl Data Surveyor is a data mining tool for expert users. It consists of a suite of powerful algorithms and provides support for all ...
Data XTm DataX(tm) http://www.zaptron.com/datax Contributed by: scott Ivan, scott #64;zaptronSPAM BLOCKER.com Note: This info converted from the original "The Data Mine" ...
Dataset Generator See: DatGen AndyPryke 28 Oct 2001
Db Bridge dbBridge http://www.internetivity.com/ Contributed by: Note: This info converted from the original "The Data Mine" pages and pre dates June 2001. Please remove this ...
Db Bridge Universal Remote Data Connectivity dbBridge Universal Remote Data Connectivity http://www.dbBridge.com Dalco Technologies dbBridge is a client side driver similar to other OLEDB/ODBC driver with ...
Db Probe dbProbe http://www.internetivity.com/ Contributed by: glenn #64;nonlinearSPAM BLOCKER.ca Note: This info converted from the original "The Data Mine" pages and pre ...
Db Prophet Neural Network Data Mining Tool By Trajecta dbProphet: neural network data mining tool by Trajecta http://www.trajecta.com Utilizing sophisticated neural network technologies, Trajecta offers a broad range ...
Decision Tree Decision Tree http://www.creditscore.co.nz Builds decision trees and logarithmic scorecards on any dataset (automatically handles discrete and continuous data) ...
Decisionhouse Software Decisionhouse http://www.quadstone.co.uk Contributed by: ANP Note: This info converted from the original "The Data Mine" pages and pre dates June 2001. Please remove ...
Dimensional Insight Inc Dimensional Insight, Inc. http://www.dimins.com Dimensional Insight offers business intelligence solutions, putting you in command of your business. Companies use ...
Explora Software Explora http://orgwis.gmd.de:80/explora/ An freely available and ftpable Machintosh KDD package. Note: This info converted from the original "The Data Mine" pages ...
FTPAble Machine Learning Software FTP able machine learning software http://www.cs.bham.ac.uk/~anp/dm docs/machine learning.software From comp.ai faq/part4. List of Ftpable machine learning software ...
Fast Mind
Gain Smarts GainSmarts http://www.urbanscience.com GainSmarts is an expert system using profilling and predictive modelling algorithm. The software is platform independent ...
Gornik System Górnik System Tool for advanced Data Mining and analysis including classification, segmentation, survival methods etc. and data processing tools. Runs on Windows ...
Graf FXGraphical Data Mining Shareware Graf FX Graphical Data Mining Shareware http://www.gr fx.com/graf fx.htm Contributed by: fx #64;bigpondSPAM BLOCKER.com Note: This info converted from the original ...
Graf Fx Graf fx ... The Data Mining Tool For Microsft Access http://www.gr fx.com/graf fx.htm Commercial Data mining shareware written entirely in all current versions ...
Guiding Inductive Learning With AQualitative Model Guiding Inductive Learning with a Qualitative Model http://www.cs.utexas.edu/users/pclark/software.html This package allows a qualitative model to bias induction ...
IBMIntelligent Miner For Data IBM Intelligent Miner for Data http://www 4.ibm.com/software/data/iminer/fordata/ Use the IBM DB2 Intelligent Miner for Data to gain new business insights and to ...
IBMVisualization Data Explorer IBM Visualization Data Explorer http://www.almaden.ibm.com/dx/ IBM Visualization Data Explorer is an interactive software program that allows scientists, engineers ...
ISoft Alice ALICE d'ISoft http://www.isoft.fr/ Alice is a powerful and easy to use Data Mining Tool. Use decision trees to explore exploit your data. Textual reports, SQL ...
IXLAnd IDISSoftware IXL and IDIS software http://www.cs.bham.ac.uk/~anp/dm docs/ixl/intern1.txt IXL was one the first commercial discovery and data mining programs which was followed ...
Inlen Project INLEN http://www.mli.gmu.edu/projects/inlen.html This project is concerned with the development of a large scale multi type reasoning system, called INLEN, for extracting ...
Insightful Miner InsightfulMiner, an affordable, scalable full life cycle data mining software. More info at http://www.insightful.com. JudyM 07 Mar 2002
Iris Software IRIS http://allanon.gmd.de/and/java/iris/Iris.html IRIS is a prototype system supporting visual analysis of spatially referenced data. IRIS automatically produces ...
Java Drill Down Demo Java Drill Down Demo http://www.itivity.com You need a Java enhanced browser to see this demo. It shows a demo of data access via "drill down". Note: This info ...
KDNuggets Software List Gregory Piatetsky Shapiro's KD Nuggets Software List: http://www.kdnuggets.com/software/index.html AndyPryke 28 Oct 2001
KXEN KXEN provides next generation business analytics software to drive better corporate decisions. KXEN's unmatched speed, ease of use and scalability enable leading companies ...
Knowledge Access Suite Knowledge Access Suite http://www.datamining.com The Knowledge Access Suite™ has delivered the first and only set of products ever to provide business users with ...
Knowledge Miner KnowledgeMiner http://www.scriptsoftware.com/km/ It discovers relationships in data and forecast using the self organizing GMDH approach. Contributed by: Gregory Ivakhnenko ...
Knowledge Sync Alert Messaging By Vineyardsoft KnowledgeSync Alert Messaging by Vineyardsoft http://www.vineyardsoft.com/ KnowledgeSync 2000 identifies potential business problems (e.g., a pending order for ...
Kovach Computing Services Kovach Computing Services http://www.kovcomp.co.uk/ Contains information about their shareware statistical software as well as links to other sites with statistical ...
Kxen Components KXEN components www.kxen.com Kxen components can be described as: Vapnik based algorithm Robust models open architecture Speed of modeling Ease of use ...
Level Five Quest LEVEL5 Quest http://www.l5r.com We at Level Five Research have developed an interesting twist in data mining which fills what we perceive to be a gap between heavy ...
MLCLibrary Utilities MLC Library / Utilities http://www.sgi.com/Technology/mlc/ MLC is a machine learning library developed in C . MLC is public domain and can be used free of charge ...
Maestro Software Maestro http://www.jjt.com Maestro a metadata driven SAS based statistical analysis tool particulary suited to semiconductor and flat panel display industries. However ...
Magnum Opus Magnum Opus http://www.giwebb.com/ Established software for fast effective discovery of real associations. Designed by data miners for data miners. Incorporates ...
Managed Reporting Environment MRE Managed Reporting Environment (MRE) http://www.SolutionsIQ.com/consulting/mre.html SolutionsIQ’s Managed Reporting Environment (MRE) is a centralized reporting ...
Method And System For Electronic Exchange Of Tax Information Method and system for Electronic Exchange of Tax Information www.cpa network.com Commercial Century Process Associates Patent Pending on a Method and system for ...
Mine Set MineSet http://mineset.sgi.com MineSet 2.5 released in May 1998. It is a fully integrated, comprehensive suite of easy to use analytical and visual data mining tools ...
Mine Set SGI MineSet (SGI) http://www.sgi.com/software/mineset/ the second release of SGI's product for exploratory data analysis. Combining powerful integrated, interactive ...
Model Quest Enterprise ModelQuest Enterprise http://www.abtech.com Highly automated predictive data mining software that includes Expert Mining Strategies, new proprietary modeling techniques ...
Model Quest Enterprise Ab Tech Corporation ModelQuest Enterprise AbTech Corporation http://www.abtech.com Contributed by: updated by Christine Gresser, sales #64;abtechSPAM BLOCKER.com Note: This info ...
Model Quest Market Miner Ab Tech Corporation ModelQuest MarketMiner AbTech Corporation http://www.abtech.com Contributed by: Contributed by Christine Gresser, sales #64;abtechSPAM BLOCKER.com Note: This ...
Monarch Software You need information but how do you get at it? As a professional working in today's competitive world, you'll be very aware of the importance of concise and relevant ...
Most Popular Data Mining Software Most Popular Data Mining Software Surveys conducted by Nuggets and Analytics have asked people involved in data mining what software they use. While it's not necessarily ...
NULL All text removed
Neural Net And Genetic Based DMSoftware Neural net and genetic based DM software http://www.altern.org/pmsi/home gb.htm Lots of classification/prediction/time series tutorials working demos. Contributed ...
Nuggets TM Nuggets(TM) http://www.Data mine.com Nuggets uses proprietary search algorithms called SiftAgents(TM) to develop English "if then" rules. These algorithms use genetic ...
ODBCMINE ODBCMINE http://www.intsysr.com/odbcmine.htm ODBCMINE analyzes ODBC data sources using the C4.5 algorithm, and outputs graphical decision trees in Scalable Vector ...
Old List Of Data Mining Software For up to date listings, see AllDataMiningSoftware. The table below lists all data mining software whose details have not been checked. You can sort the table below ...
Oracle Con Text Option Technical Oracle ConText Option technical http://technet.oracle.com/doc/context1x/CO11QCK/ch1.htm Oracle ConText Option is an option to Oracle, providing powerful search ...
Oracle Context Option Oracle Context Option http://technet.oracle.com/doc/context200/CO20APP/intro.htm This chapter provides an overview of the Oracle ConText Option. Contributed by ...
Orchestrate Software Orchestrate http://www.torrent.com Torrent’s Orchestrate simplifies and accelerates the development, deployment, and management of enterprise scale ...
Partek Software Partek http://www.partek.com Software for data mining and knowledge discovery based on statistical methods, data visualization, neural networks, fuzzy logic and genetic ...
Piping Systems Fluid Flow Software Piping Systems Fluid Flow Software http://www.fluidflowinfo.com Piping Systems Fluid Flow has been developed to provide the engineer with a total working environment ...
Pmsi New URL pmsi new URL pmsi.nfrance.com Thanks for updating for your page ! Contributed by: Note: This info converted from the original "The Data Mine" pages and pre ...
Prediction Works PredictionWorks http://www.predictionworks.com/analyze/ A free on line data mining service for smaller files. The service automatically tests several algorithms including ...
Pv Wave PV WAVE http://www.vni.com PV WAVE is a Rapid Application Development Environment for the visualization and analysis of data. Note: This info converted from the ...
QTMSQuantitative Target Marketing System QTMS : Quantitative Target Marketing System http://www.multivariate.com An expert system of multivariate modeling that highlights a new technique called " All Possible ...
QYield Software Q YIELD http://www.quadrillion.com/ Software for data mining semiconductor fab production data to determine possible production problems.
Real Time Stock Market Predictions From Textual News real time stock market predictions from textual news http://www.cs.ust.hk/~beat/Predict Beat Wuthrich beat #64;csSPAM BLOCKER.ust.hk for more info see www.cs.ust ...
Recent Contributions Data Mining Software Data Mining Software The information about the packages on this page has been taken from README files, and other information provided on the web ...
Ro CRobust Bayesian Classifier RoC (Robust Bayesian Classifier) http://kmi.open.ac.uk/projects/bkd/ RoC is a Bayesian supervised classifier able to handle incomplete databases with no assumption ...
Rosetta Toolkit Rosetta A Rough Set Toolkit for Analysis of Data http://www.idi.ntnu.no/~aleks/rosetta/ Contributed by: Note: This info converted from the original "The Data ...
SASInstitute Launches Enterprise Miner Software SAS Institute Launches Enterprise Miner software http://www.sas.com/software/data mining/ Commercial The respected French Analysts, Yphise have evaluated the Enterprise ...
Sav ZServer Sav Z Server http://sites.netscape.net/savtechno/ Sav Z (Web Data) Server is a Web based object relational database server implemented in JavaTM. Server generates ...
Sector Computing SOLAPOn The Web Sector Computing's OLAP On The Web http://ourworld.compuserve.com/homepages/SMGSecor/BI/Biindex.htm Commercial Contributed by: Note: This info converted from ...
See Five Software C5.0 / See5 http://www.rulequest.com Contributed by: quinlan #64;rulequestSPAM BLOCKER.com Note: This info converted from the original "The Data Mine" pages and ...
Set Enumeration Learn SE Learn http://www.isp.pitt.edu/~rymon/SE Learn.html An SE tree based induction and classification tool. Set Enumeration (SE) trees provide the basis for an induction ...
Silicon Graphics Mine Set Data Mining Silicon Graphics MineSet Data Mining http://www.sgi.com/Products/software/MineSet/ Commercial Commercial Software for data mining. Contributed by: Note: This ...
Sipina Pro Sipina W v2.0 and Sipina Pro http://eric.univ lyon2.fr/~ricco/sipina.html SIPINA W is a software for Knowledge Discovery in Databases. This version v2.0 contains ...
Snob Software Snob http://www.cs.monash.edu.au/~dld/Snob.html Snob (Wallace and Boulton, 1968) was probably the first (Bayesian) program to do clustering (or unsupervised learning ...
Software Form Name Type Size Values Tooltip message Name text 64 The name of the program or package Brief Summary text 64 Short summary ...
Source Forge HonweiMo 02 Nov 2003
Sphinx Vision By ASOC sphinxVision by ASOC http://www.asoc.com SOM neural network Contributed by: hans peter.neeb #64;ffm2SPAM BLOCKER.siemens.de Note: This info converted from the ...
Stat Soft KyleMiller 26 Jan 2009 StatSoft, Inc. was founded in 1984 and is now one of the largest global providers of analytic software worldwide. StatSoft is also the largest ...
Stat Soft Inc StatSoft, Inc. http://www.statsoft.com StatSoft, Inc., founded in 1984, is now one of the largest developers of enterprise and single user software for data analysis ...
Stat Soft STATISTICAData Mining StatSoft STATISTICA Data Mining http://www.statsoft.com/datamining.html Contributed by StatSoft.com info@statsoft.com Contributed by: Note: This info converted ...
Super Query SuperQuery http://www.azmy.com SuperQuery: A Database analysis software that has a knowledge discovery engine. You can download Free Trial version. You can also ...
Svm Light SVMlight is an implementation of Vapnik's Support Vector Machine Vapnik, 1995 for the problem of pattern recognition, for the problem of regression, and for the ...
Synthetic Classification Data Sets Synthetic Classification Data Sets program SCDS has been renamed DatGen
TMiner Personal Edition TMiner Personal Edition http://frontdb.ugr.es Free Java Data Mining software downloadable from http://frontdb.ugr.es (Research section). TMiner collects some algorithms ...
Test Add Software To Listing Add new software package to The Data Mine: "}% Click on "Software" on top bar to come back once you've done this.'}% %IF{"context authenticated" then ' ...
Tetralogie Software Tetralogie http://atlas.irit.fr Techniques and Technologies for Information Retrieval and Resource Discovery, Contributed by: Taoufik Dkaki , Bernard Dousset, Said ...
Text Analyst TextAnalyst http://www.megaputer.com TextAnalyst performs semantic analysis of texts in an arbitrary application domain. It is based on proprietary neural net technology ...
The Data Mining Suite The Data Mining Suite http://www.datamining.com Contributed by: The Data Mining Suite http://www.datamining.com Contributed by: Contributed by: datamine ...
The Knowledge Access Suite The Knowledge Access Suite http://www.datamining.com Contributed by: Note: This info converted from the original "The Data Mine" pages and pre dates June 2001 ...
The Knowledge Access Suite And The Data Mining Suite The Knowledge Access Suite and The Data Mining Suite (Information Discovery, Inc.) http://www.datamining.com/ The Knowledge Access Suite™ has delivered the first ...
Thinkbase Data Mining Product Thinkbase's Data Mining Product http://www.ThinkBase.com/ Note: This info converted from the original "The Data Mine" pages and pre dates June 2001. Please remove ...
Thinking Machine Data Mining Product Thinking Machine's Data Mining Product http://www.think.com/html/products/products.htm It includes Neural Networks, Classification and Regression Trees (CART), ...
Ti MBLTilburg Memory Based Learner TiMBL Tilburg Memory Based Learner http://ilk.kub.nl/software.html Contributed by: Jakub Zavrel Note: This info converted from the original "The Data Mine" pages ...
Tooldiag Software Tooldiag http://documents.cfar.umd.edu/resources/source/tooldiag.html A software toolbox for the analysis of multidimensional data. C source and documentation included ...
Visua Links VisuaLinks http://www.visualanalytics.com VisuaLinks is state of the art Java technology supporting link analyses and data visualization. VisuaLinks uses an intuitive ...
Visual Text VisualText http://www.textai.com VisualText is a comprehensive GUI development environment for creating text analyzers. Resulting analyzers can run as C executables ...
Web Atom TWiki's Software web
Web Changes
Web Create New Topic
Web Index
Web Left Bar
Web Mining web mining Date: Location: Final Date For Submissions: Contributed by: Note: This info converted from the original "The Data Mine" pages and pre dates ...
Web Notify
Web Preferences Software Web Preferences The following settings are web preferences of the Software web. These preferences overwrite the site level preferences in . and ...
Web Right Bar " warn "off"}% Web List of Software Most Popular Software Add Software to List Data Mining /$name ...
Web Rss " else "TWiki's Software web"}% /Software The Documentation Web of TWiki. TWiki is an Enterprise Collaboration Platform.
Web Search
Web Search Advanced
Web Statistics Statistics for Software Web Month: Topic views: Topic saves: File uploads: Most popular topic views: Top contributors for topic save and ...
Web Topic List
Weka Software N.B. see page " Weka " to edit this information
Win Viz WinViz http://www.iti.gov.sg/iti RnD/infosheet/is/winviz.html WinViz is a Visual Data Analysis tool designed to complement spreadsheets, databases, executive information ...
Winrosa Software WINROSA http://www.mitgmbh.de WINROSA is a software tool which generates automatically Fuzzy If Then Rules from your data. The generated data set can be run by most ...
Wiz Rule For Windows WizRule for Windows http://www.wizsoft.com Discovers rules and identified exceptions to those rules. A demo version of the software is available online. Note ...
Wiz Why WizWhy http://www.wizsoft.com WizWhy reveals all if then rules (with no limit as to the number of clauses) and mathematical formula rules, and predicts the value of ...
Wonder Owl Wonder Owl Commercial Wonder Owl is the leading data mining and personalization package for managers and business people. Simple and intuitive to use, yet powerful ...
Xmdv Tool XmdvTool http://wwwcip.informatik.uni erlangen.de/user/tntimm/XmdvTool.html The XmdvTool allows users to visually explore multivariate data in a variety of methods ...
Xpert Rule XpertRule http://www.attar.com Data Mining using high performance parallel SQL technology Knowledge Induction can be achieved by a Windows PC client being able to ...
Yphise Software Evaluation Reports Yphise Software Evaluation Reports http://www.yphise.com Commercial Yphise provides software evaluation of interest to IT managers. Yphise software evaluation Report ...

Data Mining Software in Java

Weka 3: Data Mining Software in Java

Weka is a collection of machine learning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. It is also well-suited for developing new machine learning schemes.

Weka is open source software issued under the GNU General Public License.

Downloading and installing Weka

  • Snapshots

    Every night a snapshot of the Subversion repository is taken, compiled and put together in ZIP files. For those who want to have the latest bugfixes, they can download these snapshots here.

  • Stable GUI version

    Weka 3.6 is the latest stable version of Weka. There are different options for downloading and installing it on your system:

    • Windows

      Click here to download a self-extracting executable that includes Java VM 1.5
      (weka-3-6-1jre.exe; 35,086,636 bytes)

      Click here to download a self-extracting executable without the Java VM
      (weka-3-6-1.exe; 18,727,059 bytes)

      These executables will install Weka in your Program Menu. Download the second version if you already have Java 1.5 (or later) on your system.

    • Mac OS X

      Click here to download a disk image for OS X
      (weka-3-6-1.dmg; 24,743,737 bytes)

    • Other platforms (Linux, etc.)

      Click here to download a zip archive containing Weka
      (weka-3-6-1.zip; 18,926,302 bytes)

      First unzip the zip file. This will create a new directory called weka-3-6-1. To run Weka, change into that directory and type

      java -jar weka.jar

      Note that Java needs to be installed on your system for this to work. Also note, that using -jar will override your current CLASSPATH variable and only use the weka.jar.

  • Book version

    Weka 3.4 is a stable version of Weka, and the one described in the data mining book.

    • Windows

      Click here to download a self-extracting executable that includes Java VM 1.4
      (weka-3-4-15jre.exe; 24,650,153 bytes)

      Click here to download a self-extracting executable without the Java VM
      (weka-3-4-15.exe; 10,534,837 bytes)

      These executables will install Weka in your Program Menu. Download the second version if you already have Java 1.4 (or later) on your system.

    • Mac OS X

      Click here to download a disk image for OS X
      (weka-3-4-15.dmg; 13,792,534 bytes)

    • Other platforms (Linux, etc.)

      Click here to download a zip archive containing Weka
      (weka-3-4-15.zip; 10,628,039 bytes)

      First unzip the zip file. This will create a new directory called weka-3-4-15. To run Weka, change into that directory and type

      java -jar weka.jar

      Note that Java needs to be installed on your system for this to work. Also note, that using -jar will override your current CLASSPATH variable and only use the weka.jar.

  • Developer version

    • Windows

      Click here to download a self-extracting executable that includes Java VM 5.0
      (weka-3-7-0jre.exe; 35,767,243 bytes)

      Click here to download a self-extracting executable without the Java VM
      (weka-3-7-0.exe; 19,407,663 bytes)

      These executables will install Weka in your Program Menu. Download the second version if you already have Java 5.0 (or later) on your system.

    • Mac OS X

      Click here to download a disk image for OS X
      (weka-3-7-0.dmg; 25,720,977 bytes)

    • Other platforms (Linux, etc.)

      Click here to download a zip archive containing Weka
      (weka-3-7-0.zip; 19,619,175 bytes)

      First unzip the zip file. This will create a new directory called weka-3-7-0. To run Weka, change into that directory and type

      java -jar weka.jar

      Note that Java needs to be installed on your system for this to work. Also note, that using -jar will override your current CLASSPATH variable and only use the weka.jar.

  • Old versions

    All old versions of Weka are available from the Sourceforge website.

Tuesday, July 28, 2009

What is SQL Injection?

A SQL Injection vulnerability is a type of security hole that is found in a multi-tiered application; it is where an attacker can trick a database server into running an arbitrary, unauthorized, unintended SQL query by piggybacking extra SQL elements on top of an existing, predefined query that was intended to be executed by the application. The application, which is generally, but not necessarily, a web application, accepts user input and embeds this input inside an SQL query. This query is sent to the application’s database server where it is executed. By providing certain malformed input, an attacker can manipulate the SQL query in such a way that its execution will have unintended consequences.

This all sounds like a bit of a mouthful – one that can be more easily swallowed with an example.
Consider an online bookstore. This bookstore’s web server allows users to search for books by a given author. This search facility is implemented by querying a list of book titles in a backend database server limited by who (or what) the user enters as the author. This search functionality happens to be vulnerable to SQL injection. Instead of returning a list of book titles, by exploiting the vulnerability an attacker can trick the application into returning a list of all usernames, passwords, email addresses and credit card numbers of everyone that has ever used the bookstore. This scenario is not sensationalist - it's an easy attack that can be leveled against a SQL injection vulnerability.
In terms of risk, SQL injection is right up there at the top with problems like buffer overflows. I recently had an argument with a large database software vendor about this. They rated SQL injection problems as a low risk issue and were greatly more concerned with buffer overflows in their RDBMS offering. I pointed out to them that to exploit a buffer overflow in the database server one must first get past the organization’s firewall. A nice easy way of doing this is through SQL injection: the firewall must allow inbound connections to the web server from the Internet so customers can access the web application; and if the application is vulnerable to SQL injection then the attacker can get access to the database and exploit the buffer overflow vulnerability. But here’s the kicker – why would an attacker bother exploiting the buffer overflow vulnerability if they can arbitrarily gain access to the data in the database anyway? Since this discussion, I’m glad to report that the large database software vendor has since “upgraded” the status of SQL injection vulnerabilities and give them the respect they’re due. In terms of prevalence, 6 in 10 web applications that connect to a database server are vulnerable to SQL injection. This statistic is based upon the number of new applications that were found to be vulnerable when performing security assessments for clients over 2003/2004. This is shockingly high.
SQL injection is vendor agnostic: it doesn’t matter whether the application is running Oracle, SQL Server, DB2, MySQL or Informix on Active/Java Server Pages, Cold Fusion Management, PHP or Perl – it can be vulnerable to SQL injection – though, as we’ll see later, some are more at risk that others.