Most Popular Slideshare Presentations on Data Science

Top SlideShare data science presentations provide a unique view on topics like data science management, using Python and NumPy in your data science project, and leveraging data science for enterprise big data.

By Grant Marshall, Nov 2014.

Slideshare is a platform for uploading, annotating, sharing, and commenting on slide-based presentations. The platform has been around for some time, and has accumulated a great wealth of presentations on technical topics like Data Science.

SlideShare Data Science Wordle

Figure 1: Tags related to Data Science on SlideShare

This visualization of the tags associated with the tag “data science” on SlideShare illustrates the essential interactions between data science, big data, and open datasets.

Today, we will look at some of these top "Data Science" presentations found on SlideShare. Similar to how we collected data for the previous post, these presentations were retrieved by using a Python script and the Slideshare search_slideshow API, and then hand-curated to select the best, most relevant presentations. The slideshows and their associated metrics are shown below:

Title By Date Views Down
The Science of Managing Data Scientists katemats 2013-02-26 58219 189 29
Big Data [sorry] & Data Science: What Does a Data Scientist Do? datasciencelondon 2013-01-26 54778 0 158
Myths and Mathemagical Superpowers of Data Scientists davidpittman1 2012-11-14 29485 297 21
A Statistician’s View on Big Data and Data Science (Version 1) kuonen 2013-11-25 25277 0 67
25 Insights about Dunkin’ Drinkers and Starbucks Drinkers TrueLens 2013-08-05 21573 0 34
Intro to Data Science for Enterprise Big Data pacoid 2012-08-20 19807 969 70
Data Tactics Analytics Brown Bag (Aug 22, 2013) rheimann04 2013-08-22 29651 5 0
Innovation at Google: the physics of data [PARC Forum] PARCInc 2010-05-04 20550 0 34
Analytics & Big Data Idiro 2012-10-01 16057 14 2
Data By The People, For The People dtunkelang 2012-11-02 14377 101 13
User Centric Digital Identity, Talk for Computer Science and Telecommunications Board, National Academies Kaliya 2010-09-24 14841 141 5
Data Science: Not Just For Big Data RevolutionAnalytics 2013-11-15 13772 110 7
MSR 2013 Preview tom.zimmermann 2013-04-02 14390 0 1
How The Open Data Community Died - A Warning From The Future countculture 2011-10-21 13849 65 22
OSCON 2013: Using Cascalog to build an app with City of Palo Alto Open Data pacoid 2013-07-29 14911 14 3
A Cabinet Of Web2.0 Scientific Curiosities IanMulvany 2009-09-22 14380 343 22
A Simple Guide To The Analysis Of Social Science Quantitative Data paulbourne01 2010-01-10 14264 375 1
Dataiku - for Data Geek Paris@Criteo - Close the Data Circle Dataiku 2013-11-29 12601 9 0
How to Become a Data Scientist ryanorban 2014-07-02 12312 23 87
Geo Analytics Tutorial - Where 2.0 2011 pskomoroch 2011-04-19 12163 454 29
NumPy and SciPy for Data Mining and Data Analysis Including iPython, SciKits, and matplotlib bytemining 2011-09-11 12618 234 18
Open Access, Open Data. Open Research? CameronNeylon 2009-01-27 12174 291 18

Some quick stats: the average views, downloads, favorites, and comments for a slideshow in this category are approximately 20000, 165, 29, and 2 respectively. Comparing these figures to those of the data science slides, we see lower numbers across the board. This is partially because many big slideshows in with the data science tag were actually more data mining oriented, and subsequently filtered out of this list. By doing this, many of the most viewed slideshows were eliminated.

Looking at the publication dates of these presentations, it’s interesting to note that 2013 is the most commonly occurring date. Considering the fact that SlideShare has been around since its launch eight years ago, this indicates that in recent years, data science has been increasing in popularity, making new presentations more popular.

Views vs. Downloads

Figure 2: Data Science Presentation Downloads vs. Views

Note that most of the presentations with 0 downloads have downloads disabled. There is an interesting lack of correlation between the number of views on a particular presentation and the download count. It seems that there are certain types of presentations that elicit downloads that aren’t necessarily more/less popular than those that do not. It seems instructive presentations, Intro to Data Science for Enterprise Big Data for example, seem to encourage downloads, perhaps for offline studying.