What was the largest database / dataset you analyzed? [148 votes]|
Comparing the results of 2011 poll with a similar
2010 Poll: Largest Database Data Mined / Analyzed, we see that
median in 2011 is in 10-20 GB range.
while the median dataset in 2010 was in 8-10 GB range.
We observe steady growth of analysts with experience in the upper range of datasets.
In 2011 about 35.4% reported analyzing over databases over 100 GB (vs 32.2% in 2010), and
21.4% - over 1 Terabyte (vs 18.3% in 2010).
Regional breakdown shows that US leads in percent of data miners who worked with terabyte range datasets (about 30%).
(Note: Australia/NZ region not included, since not enough responses were received).
|Region (voters)||Largest Dataset Analyzed (median)||% analyzed TB+ data|
|Latin America (15)
|Africa/Middle East (7)
Here is another breakdown of
Largest Dataset Analyzed
Hélder Quintela, Normal
As it is expected (at least for me) it is almost normal distribution. Very small databases are not much interesting for Analysis and Knowledge Discovery to improve and impact Business, and very large databases are not so much available.
Ajay Ohri, Poll on Database Size
It would be interesting to see interaction effects and co-relation between size of database used and software name.