R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results
R remains the leading tool, with 49% share, but Python grows faster and almost catches up to R. RapidMiner remains the most popular general Data Science platform. Big Data tools used by almost 40%, and Deep Learning usage doubles.
Full Results and 3-year trends
The following table shows the poll results in detail, excluding Deep Learning tools for which 3 year results are not available.% alone is the percent of tool voters used only that tool alone, shown only for tools that have 5% or such votes. For example, 11.4% of RapidMiner users have used only Rapidminer.
What Analytics, Big Data, Data mining, Data Science software you used in the past 12 months for a real project? [2895 voters] | |
Legend: red: Free/Open Source tools
green: Commercial tools Fuchsia: Hadoop/Big Data tools |
% users in 2016
% users in 2015 % users in 2014 |
R (1419) | |
Python (1325) | |
SQL (1029) | |
Excel (972) | |
RapidMiner (944), 11.7 % alone | |
Hadoop (641) | |
Spark (624) | |
Tableau (536) | |
KNIME (521) | |
scikit-learn (497) | na |
Java (487) | na |
Anaconda (462) | na
na |
Hive (359) | na |
MLlib (337) | |
Weka (315) | |
Microsoft SQL Server (314) | |
Unix shell/awk/gawk (301) | |
MATLAB (263) | |
IBM SPSS Statistics (242) | |
Dataiku (227), 18.1 % alone | na |
SAS base (225) | |
IBM SPSS Modeler (222) | |
SQL on Hadoop tools (211) | na |
C/C++ (210) | na |
Other free analytics/data mining tools (198) | |
Other programming and data languages (197) | |
H2O (193) | |
Scala (180) | na |
SAS Enterprise Miner (162) | |
Microsoft Power BI (161) | na |
HBase (158) | na |
QlikView (153) | |
Microsoft Azure Machine Learning (147) | na |
Other Hadoop/HDFS-based tools (141) | |
Apache Pig (132) | |
IBM Watson (121) | na |
Rattle (103) | |
Salford SPM/CART/Random Forests/MARS/TreeNet (100), 63.0 % alone | |
Gnu Octave (89) | |
Orange (89) | |
Alteryx (87) | |
RapidInsight/Veera (87), 51.7 % alone | |
TIBCO Spotfire (80) | |
Apache Mahout (74) | |
Other paid analytics/data mining/data science software (71) | |
Dato (69) | |
Pentaho (68) | |
Perl (67) | |
IBM Cognos (64) | |
Splunk/ Hunk (63) | |
JMP (58) | |
C4.5/C5.0/See5 (58) | |
Amazon Machine Learning (55) | na |
Mathematica (53) | |
Microsoft other ML/Data Science tools (46) | na
na |
Vowpal Wabbit (45) | na |
Microstrategy (45) | na |
SAP Analytics (42) | |
Stata (39) | |
Dell/StatSoft (36), 8.3 % alone | |
XLMiner (35) | na
na |
SAP HANA (35) | na
na |
Julia (32) | |
Oracle Adv. Analytics (31) | |
BigML (25), 16.0 % alone | |
Zementis (25) | |
BayesiaLab (18) | |
Alpine Data Labs (16), 12.5 % alone | |
DataRobot (15), 6.7 % alone | na
na |
Datameer (13), 7.7 % alone | |
Lavastorm (12) | |
F# (11) | |
Clojure (11) | |
Actian (10) | |
WordStat (10) | |
Ayasdi (9) | na |
Skytree (8) | na |
Lisp (7) | |
Ontotext GraphDB (6) | na |
SiSense (5) | |
Birst (5) | na |
FICO Model Builder (5) | |
WPS World Programming System (4) | |
Angoss (3) | |
Predixion Software (2) |
Additional tools not included but mentioned in the comments include
- XLSTAT
- BeyondCore
- Timi and Anatella
- SAS/STAT
- Domino Data Lab
- MapR
- Neural Designer
- Javascript
- R leads RapidMiner, Python catches up, Big Data tools grow, Spark ignites, 2015
- KDnuggets 15th Annual Analytics, Data Mining, Data Science Software Poll: RapidMiner Continues To Lead, 2014
- KDnuggets 2013 Software Poll: RapidMiner and R vie for first place.
- KDnuggets 2012 Poll: Analytics, Data mining, Big Data software used
- KDnuggets 2011 Poll: Data Mining/Analytic Tools Used
- KDnuggets 2010 Poll: Data Mining / Analytic Tools Used
- KDnuggets 2009 Poll: Data Mining Tools Used
- KDnuggets 2008 Poll: Data Mining Software Used
- KDnuggets 2007 Poll: Data Mining/Analytics Software Tools