Most Popular Slideshare Presentations on Big Data
Hadoop, the cloud, and Microsoft Azure are just a few of the many topics covered by the top Big Data SlideShare presentations retrieved from the SlideShare API.
SlideShare is a platform for uploading, annotating, sharing, and commenting on slide-based presentations. The platform has been around for some time, and has accumulated a great wealth of presentations on technical topics like Big Data.
Figure 1: SlideShare Big Data Tag CloudThis visualization of the tags associated with the tag “big data” on SlideShare shows how integral Hadoop is and the rising importance of the cloud for big data.
Today, we will look at some of these top Big Data presentations found on SlideShare. Similar to how we collected data for the previous article, these presentations were retrieved by using a Python script and the Slideshare search_slideshow API, and then hand-curated to select the best, most relevant presentations. The slideshows and their associated metrics are shown below:
Looking at the data quickly, we have averages of about 5300 views, 200 downloads, and 8 favorites. Keep in mind, this is with the presentations with downloads disabled being filtered out. This is impressive – compared to the data science figures of 20000 views and 165 downloads, there are many more downloads/view on these big data slideshows. Building a Business on Hadoop, HBase, and Open Source Distributed Computing, a very comprehensive presentation on Visible Technology’s big data stack, alone accounts for 1770 downloads.
Looking at the authors in this table, we see that there are exactly two users with more than one presentation in this list: msitpro and AsterData. The presentations by msitpro cover the big data use cases of Microsoft Azure. AsterData, on the other hand, covers topics involving the use of big data in business decisions.
Figure 2: Big Data Presentation Views/Week vs. Publication DateThis chart shows the number of views/week presentations received against their publication date. This shows that recent publications and older publications receive similar traffic over time, and over time there are particular presentations that receive exceptionally large numbers of views across the entire time range. One other interesting property of the top big data presentations is that they are no older than 2009, even though SlideShare has been around since its inception in 2006.