| Which methods/algorithms did you use for data analysis in 2011? [311 voters] | |
| Decision Trees/Rules (186) | |
| Regression (180) | |
| Clustering (163) | |
| Statistics (descriptive) (149) | |
| Visualization (119) | |
| Time series/Sequence analysis (92) | |
| Support Vector (SVM) (89) | |
| Association rules (89) | |
| Ensemble methods (88) | |
| Text Mining (86) | |
| Neural Nets (84) | |
| Boosting (73) | |
| Bayesian (68) | |
| Bagging (63) | |
| Factor Analysis (58) | |
| Anomaly/Deviation detection (51) | |
| Social Network Analysis (44) | |
| Survival Analysis (29) | |
| Genetic algorithms (29) | |
| Uplift modeling (15) | |
| Did you use analytics in the cloud, Hadoop, EC2, etc in 2011? | |
| Yes | |
| No | |
| Employment type: | Percent all | Avg Num Algorithms |
| Industry analyst/consultant (172) | 6.3 | |
| Academic researcher (85) | 5.1 | |
| Student (37) | 4.3 | |
| Government/Other (17) | 5.0 |
Regional breakdown is
- US/Canada, 40.2%
- Europe, 37.6%
- Asia, 10.3%
- Latin America, 5.8%
- Africa/Middle East, 3.2%
- Australia/NZ 2.9%
N(Alg,Ind_Gov) / N(Alg,Aca_Stu)Thus algorithm with affinity 1.5 is used 50% more in Industry/Government than by Academic Researchers or students, and the algorithm with affinity 0.6 is used only 60% as much in Industry.
----------------------------------
N(Ind_Gov) / N(Aca_Stu)
The most "industrial" algorithms ( with the highest Industry / Gov "affinity") are:
- Uplift modeling, INF (no academic users)
- Survival Analysis, 2.47
- Regression, 2.00
The most "academic" algorithms ( with the lowest Industry / Gov "affinity") are:
- Genetic algorithms, 0.60
- Support Vector (SVM), 0.66
- Association Rules, 0.83
| Algorithm | Academic/ Student Affinity |
Industry / Gov Affinity |
|---|---|---|
| Uplift modeling | ||
| Survival Analysis | ||
| Regression | ||
| Visualization | ||
| Statistics | ||
| Boosting | ||
| Time series/Sequence analysis | ||
| Bagging | ||
| Factor Analysis | ||
| Anomaly/Deviation detection | ||
| Text Mining | ||
| Decision Trees | ||
| Neural Nets | ||
| Clustering | ||
| Ensemble methods | ||
| Social Network Analysis | 0.93 |
|
| Bayesian | 0.92 |
|
| Association rules | 0.83 |
|
| Support Vector -SVM | 0.66 |
|
| Genetic algorithms | 0.60 |