The previous KDnuggets Poll asked What programming languages you used for data mining / data analysis in the past 12 months?
Here are the results, based on 570 voters:
R (257) | ![]() |
SQL (184) | ![]() |
Python (140) | ![]() |
Java (139) | ![]() |
SAS (121) | ![]() |
MATLAB (83) | ![]() |
C/C++ (73) | ![]() |
Unix shell/awk/gawk/sed (59) | ![]() |
Perl (45) | ![]() |
Hadoop/Pig/Hive (35) | ![]() |
Lisp (4) | ![]() |
Other (70) | ![]() |
None (7) | ![]() |
Notes
Among the top 5 languages, only about 15-25% were used alone.
- 43.2% of voters used 1 language
- 24.9% used 2 languages
- 17.4% used 3 languages
- 8.4% used 4 languages
- 6.1% used 5 or more languages
The breakdown by region is:
- US/Canada, 42%
- Europe, 30%
- Asia, 16%
- Latin America, 4.9%
- AU/NZ, 2.8%
- Africa/MidEast, 3.0%