KDnuggets : News : 2002 : n12 : item2    (previous | next)

Features


From: Gregory Piatetsky-Shapiro

Date: 17 June, 2002

Subject: Poll results: Data mining tools you regularly use

The previous KDnuggets Poll asked: Which Data mining tools you regularly use? (multiple choices allowed). Over 500 data miners have taken part, each on average choosing about two tools. Excluding about 20% unresolved domain references, about half of the voters came from .com and .net domains. The remaining domains in order of popularity were edu, uk, nz, fr, de, au, nl, ca, be, jp, it, es, br, sg, ru, pl, dk, cz, se, is, gr, gov, ch, pt, my, in, il, at, ve, us, ua, tw, th, si, org, mx, kr, hu, eg, co, cn, arpa, ar.

This poll is unscientific, and influenced by vendors interested in the results. Also, there is no space to include all the good tools that exist out there. Finally, despite several safeguards that poll software has against multiple voting, some people nevertheless managed to vote multiple time. Those duplicate votes were removed from the final results. Despite all the caveats, I believe that the results are still interesting and approximately reflect the actual usage. The results are:

SPSS Clementine (128)  13%
Weka (101)  10%
SAS (100)  10%
CART/MARS (89)  9%
SPSS/AnswerTree (76)  8%
SAS Enterprise Miner (67)  7%
Other commercial tools (65)  7%
Other free/open-source tools (57)  6%
MATLAB (52)  5%
Microsoft SQLServer/Excel (40)  4%
Insightful Miner (36)  4%
IBM Intelligent Miner (35)  4%
KXEN (35)  4%
C4.5 / C4.8 (29)  3%
Angoss (26)  3%
Megaputer Polyanalyst (10)  1%
Neuralware (8)  1%
Oracle Suite (Darwin) (8)  1%
Quadstone (3)  0.3%
ThinkAnalytics (2)  0.2%
 
Weka, an open-source DM package developed in New Zealand, has clearly gained popularity.

See full results and comments at www.kdnuggets.com/polls/2002/data_mining_tools.htm


KDnuggets : News : 2002 : n12 : item2    (previous | next)

Copyright © 2002 KDnuggets.   Subscribe to KDnuggets News!