GigaOm Data Challenge, sponsored by GigaOm, Kaggle and Splunk: predict which people will "like" which blog posts from across 90k active blogs on WordPress.com. Splunk platform is available for the competitors.
... Margit Zwemer (@MPZwemer), Data Scientist and Evil Genius in Residence at
Kaggle was excited to use
Splunk for her data science endeavors-I'm always excited to use something new, and with Splunk it's amazing how I can get in and start looking at the data right away. It's a huge time savings."
So we decided to partner with Kaggle to expose Splunk to data scientists. This morning at
( #structureconf ),
a conference aimed at managing large-scale cloud infrastructures, GigaOm, Kaggle and Splunk using data from WordPress.com announced the
GigaOm Data Challenge.
The goal of the competition is to predict which people will "like" which blog posts from across 90k active blogs on WordPress.com. (They host 74 million WordPress sites globally; over 16% of all domains on the web). The winning solutions may be used by WordPress.com in a recommendation engine, but winning solutions must be open-sourced, so they could be used by anyone to solve a similar problem using similar data in a similar domain.
For Splunk's part, we're making the Splunk platform available for the GigaOm Data Challenge. We believe that understanding the connections and trends in your data can open up all sorts of new discoveries (#datajourney), and we are excited to work with data scientists across the globe to uncover the patterns and intelligence in massive data sets across different domains. There's a prize earmarked solely for the best use of Splunk-so we hope to see many folks participating in the contest!