|
What programming/statistics languages you used for an analytics / data mining / data science work in 2013? [713 votes total] |
|
| R (434 voters in 2013) |
|
| Python (277) |
|
| SQL (261) |
|
| SAS (148) |
|
| Java (118) |
|
| MATLAB (89) |
|
| High-level data mining suite (80) |
not asked in 2012 |
| Unix shell/awk/sed (79) |
|
| C/C++ (66) |
|
| Pig Latin/Hive/other Hadoop-based languages (57) |
|
| Other low-level language (42) |
|
| GNU Octave (40) |
|
| Perl (32) |
|
| Ruby (16) |
|
| Scala (16) |
|
| F# (12) |
not asked in 2012 |
| Lisp/Clojure (7) |
|
| Julia (5) |
|
| None (2) |
|
Comments
A number of comments, such as one below, pointed that SPSS also has its own language similar to SAS - will include it in the next poll.
Ralph Winters, SPSS Language
It seems odd to exclude SPSS based upon a definition of what is or what is not language. Especially for a language which has such legacy roots, and is backed by IBM. I could argue that both Matlab and R are both not true progamming language, and SAS, as flexible as it is, I would not consider a standarized programming language as well.
Comparing with a similar KDnuggets Poll in Aug 2012:
What programming/statistics languages you used for analytics / data mining in the past 12 months?
the language with the highest growth was Julia, which doubled in popularity (but still was used only by 0.7% in 2013).
Among more common languages, the largest relative increases in share of usage were for
- Pig Latin/Hive/other Hadoop-based languages, 19% growth, from 6.7% in 2012 to 8.0% in 2013
- R, 16% growth
- SQL, 14% growth (perhaps the result of increasing number of SQL interfaces to Hadoop and other Big Data systems?)
- Lisp/Clojure, 77% down
- Perl, 50% down
- Ruby, 41% down
- C/C++, 35% down
- Unix shell/awk/sed, 25% down
- Java, 22% down
- US/Canada, 50.8%,
- Europe: 25.7%,
- Asia: 11.8%,
- Latin America: 6.7%,
- AU/NZ: 3.2%,
- Africa/Middle East: 1.5%