- Interview: Kirk Borne, Data Scientist, GMU on Big Data in Astrophysics and Correlation vs. Causality - May 30, 2014.
We discuss how to build the best data models, significance of correlation and causality in Predictive Analytics, and impact of Big Data on Astrophysics.
Correlation, Interview, Kirk D. Borne, Predictive Analytics, Recommendations
- Vowpal Wabbit: Fast Learning on Big Data - May 26, 2014.
Vowpal Wabbit is a fast out-of-core machine learning system, which can learn from huge, terascale datasets faster than any other current algorithm. We also explain the cute name.
Fast Learning, John Langford, Machine Learning, Microsoft, Vowpal Wabbit
- Where to Learn Deep Learning – Courses, Tutorials, Software - May 26, 2014.
Deep Learning is a very hot Machine Learning techniques which has been achieving remarkable results recently. We give a list of free resources for learning and using Deep Learning.
Andrew Ng, Deep Learning, Geoff Hinton, Machine Learning, Yann LeCun
- Interview: Richard Wendell, VP, Data Science, TE Connectivity on Strategy for Analytics Projects - May 23, 2014.
We discuss the last mile of the execution path of Analytics projects, five critical pillars of success and data-driven decision making through advanced analytics.
Advanced Analytics, Big Data Strategy, Project Fail, Richard Wendell, TE Connectivity
- Stacking the Deck: The Next Wave of Opportunity in Big Data - May 20, 2014.
A leading venture capitalist explains why Big Data infrastructure market is mostly mature and where lies the next big area of opportunities related to Big Data.
Chip Hazard, Full Stack Analytics, Machine Learning, Network Effects, Startups, VC
- Exclusive: Tamr at the New Frontier of Big Data Curation - May 19, 2014.
Our exclusive profile of Tamr (former Data Tamer), the latest startup from legendary Michael Stonebraker, which emerged from stealth mode to address the new field of Big Data Curation.
Andy Palmer, Data Curation, Machine Learning, Michael Brodie, Michael Stonebraker, Startups, Tamr
- Poll Results: Data Types/Sources Analyzed - May 17, 2014.
Trends in data sources for data mining include: table data dominates, followed by time series and text; audio, JSON grows in popularity, while itemsets decline; 70% access DB engines, but only 20% access NoSQL stores; Hadoop, MongoDB used more for text; Europe is lagging in NoSQL usage.
Data types, Hadoop, NoSQL, Poll, Relational Databases
- Predict Soccer World Cup 2014 Winner, Get Prizes from RapidMiner - May 16, 2014.
Use a free edition of RapidMiner to have fun and bring sports predictions to another level by making a prediction of Soccer (Futbol) World Cup 2014, which starts on June 12 in Brazil.
Boston-MA, Brazil, Competition, RapidMiner, Soccer, World Cup
- Big Data Landscape, v 3.0, analyzed - May 15, 2014.
We analyze the Big Data Landscape and identify the most popular market segments in Analytics, Infrastructure, Applications, Open Source, and Data Sources categories. It is still early - only 4.5% of companies had exits.
Big Data, Big Data Analytics, Data Platform, Infrastructure, Landscape, Open Source, Startups
- Guide to Data Science Cheat Sheets - May 12, 2014.
Selection of the most useful Data Science cheat sheets, covering SQL, Python (including NumPy, SciPy and Pandas), R (including Regression, Time Series, Data Mining), MATLAB, and more.
Cheat Sheet, Data Science, Python, R, SQL
- Cartoon: Data Visualization meets 3-D Printer - May 11, 2014.
New KDnuggets Cartoon looks at what happens when Data Visualization meets 3-D Printer.
3-D Printing, Cartoon, Data Visualization
- Did Target Really Predict a Teen’s Pregnancy? The Inside Story - May 7, 2014.
We examine the origin and the facts behind this explosive story, the importance of headlines, and how unsubstantiated assumptions gain traction and mainstream attention and help create myths around Predictive Analytics.
Book, Charles Duhigg, Eric Siegel, Predictive Analytics, Pregnancy, Target
- JMP White Paper: Advantages of Bootstrap Forest for Yield Analysis - May 7, 2014.
This white paper highlights practical examples on how to use partitioning techniques for semiconductor manufacturing data. These methods also have wider applicability.
Bootstrap Forests, JMP, White Paper
- Poincare Conjecture, Perelman way, and Topology of social networks - May 3, 2014.
We examine the connections between the $1 million proof of Poincare conjecture by a reclusive math genius and the topological behavior and information diffusion over social networks.
Mathematics, Social Networks, Topology