R, Python Duel As Top Analytics, Data Science software – KDnuggets 2016 Software Poll Results

R remains the leading tool, with 49% share, but Python grows faster and almost catches up to R. RapidMiner remains the most popular general Data Science platform. Big Data tools used by almost 40%, and Deep Learning usage doubles.



Full Results and 3-year trends

The following table shows the poll results in detail, excluding Deep Learning tools for which 3 year results are not available.


% alone is the percent of tool voters used only that tool alone, shown only for tools that have 5% or such votes. For example, 11.4% of RapidMiner users have used only Rapidminer.

What Analytics, Big Data, Data mining, Data Science software you used in the past 12 months for a real project? [2895 voters]
Legend: red: Free/Open Source tools
green: Commercial tools
Fuchsia: Hadoop/Big Data tools
% users in 2016
% users in 2015
% users in 2014
R (1419)
Python (1325)
SQL (1029)
Excel (972)
RapidMiner (944), 11.7 % alone
Hadoop (641)
Spark (624)
Tableau (536)
KNIME (521)
scikit-learn (497) na
Java (487) na
Anaconda (462) na
na
Hive (359) na
MLlib (337)
Weka (315)
Microsoft SQL Server (314)
Unix shell/awk/gawk (301)
MATLAB (263)
IBM SPSS Statistics (242)
Dataiku (227), 18.1 % alone na
SAS base (225)
IBM SPSS Modeler (222)
SQL on Hadoop tools (211) na
C/C++ (210) na
Other free analytics/data mining tools (198)
Other programming and data languages (197)
H2O (193)
Scala (180) na
SAS Enterprise Miner (162)
Microsoft Power BI (161) na
HBase (158) na
QlikView (153)
Microsoft Azure Machine Learning (147) na
Other Hadoop/HDFS-based tools (141)
Apache Pig (132)
IBM Watson (121) na
Rattle (103)
Salford SPM/CART/Random Forests/MARS/TreeNet (100), 63.0 % alone
Gnu Octave (89)
Orange (89)
Alteryx (87)
RapidInsight/Veera (87), 51.7 % alone
TIBCO Spotfire (80)
Apache Mahout (74)
Other paid analytics/data mining/data science software (71)
Dato (69)
Pentaho (68)
Perl (67)
IBM Cognos (64)
Splunk/ Hunk (63)
JMP (58)
C4.5/C5.0/See5 (58)
Amazon Machine Learning (55) na
Mathematica (53)
Microsoft other ML/Data Science tools (46) na
na
Vowpal Wabbit (45) na
Microstrategy (45) na
SAP Analytics (42)
Stata (39)
Dell/StatSoft (36), 8.3 % alone
XLMiner (35) na
na
SAP HANA (35) na
na
Julia (32)
Oracle Adv. Analytics (31)
BigML (25), 16.0 % alone
Zementis (25)
BayesiaLab (18)
Alpine Data Labs (16), 12.5 % alone
DataRobot (15), 6.7 % alone na
na
Datameer (13), 7.7 % alone
Lavastorm (12)
F# (11)
Clojure (11)
Actian (10)
WordStat (10)
Ayasdi (9) na
Skytree (8) na
Lisp (7)
Ontotext GraphDB (6) na
SiSense (5)
Birst (5) na
FICO Model Builder (5)
WPS World Programming System (4)
Angoss (3)
Predixion Software (2)


Additional tools not included but mentioned in the comments include
  • XLSTAT  
  • BeyondCore
  • Timi and Anatella
  • SAS/STAT  
  • Domino Data Lab
  • MapR
  • Neural Designer
  • Javascript
Here are the results of past polls