Interview: Kirk Borne, Data Scientist, GMU on Big Data in Astrophysics and Correlation vs. Causality
We discuss how to build the best data models, significance of correlation and causality in Predictive Analytics, and impact of Big Data on Astrophysics.
on May 30, 2014 in Correlation, Interview, Kirk D. Borne, Predictive Analytics, Recommendations
Vowpal Wabbit: Fast Learning on Big Data
Vowpal Wabbit is a fast out-of-core machine learning system, which can learn from huge, terascale datasets faster than any other current algorithm. We also explain the cute name.
on May 26, 2014 in Fast Learning, John Langford, Machine Learning, Microsoft, Vowpal Wabbit
Where to Learn Deep Learning – Courses, Tutorials, Software
Deep Learning is a very hot Machine Learning techniques which has been achieving remarkable results recently. We give a list of free resources for learning and using Deep Learning.
on May 26, 2014 in Andrew Ng, Deep Learning, Geoff Hinton, Machine Learning, Yann LeCun
Interview: Richard Wendell, VP, Data Science, TE Connectivity on Strategy for Analytics Projects
We discuss the last mile of the execution path of Analytics projects, five critical pillars of success and data-driven decision making through advanced analytics.
on May 23, 2014 in Advanced Analytics, Big Data Strategy, Project Fail, Richard Wendell, TE Connectivity
Stacking the Deck: The Next Wave of Opportunity in Big Data
A leading venture capitalist explains why Big Data infrastructure market is mostly mature and where lies the next big area of opportunities related to Big Data.
on May 20, 2014 in Chip Hazard, Full Stack Analytics, Machine Learning, Network Effects, Startups, VC
Exclusive: Tamr at the New Frontier of Big Data Curation
Our exclusive profile of Tamr (former Data Tamer), the latest startup from legendary Michael Stonebraker, which emerged from stealth mode to address the new field of Big Data Curation.
on May 19, 2014 in Andy Palmer, Data Curation, Machine Learning, Michael Brodie, Michael Stonebraker, Startups, Tamr
Poll Results: Data Types/Sources Analyzed
Trends in data sources for data mining include: table data dominates, followed by time series and text; audio, JSON grows in popularity, while itemsets decline; 70% access DB engines, but only 20% access NoSQL stores; Hadoop, MongoDB used more for text; Europe is lagging in NoSQL usage.
on May 17, 2014 in Data types, Hadoop, NoSQL, Poll, Relational Databases
Predict Soccer World Cup 2014 Winner, Get Prizes from RapidMiner
Use a free edition of RapidMiner to have fun and bring sports predictions to another level by making a prediction of Soccer (Futbol) World Cup 2014, which starts on June 12 in Brazil.
on May 16, 2014 in Boston-MA, Brazil, Competition, RapidMiner, Soccer, World Cup
Big Data Landscape, v 3.0, analyzed
We analyze the Big Data Landscape and identify the most popular market segments in Analytics, Infrastructure, Applications, Open Source, and Data Sources categories. It is still early - only 4.5% of companies had exits.
on May 15, 2014 in Big Data, Big Data Analytics, Data Platform, Infrastructure, Landscape, Open Source, Startups
Guide to Data Science Cheat Sheets
Selection of the most useful Data Science cheat sheets, covering SQL, Python (including NumPy, SciPy and Pandas), R (including Regression, Time Series, Data Mining), MATLAB, and more.
on May 12, 2014 in Cheat Sheet, Data Science, Python, R, SQL
Cartoon: Data Visualization meets 3-D Printer
New KDnuggets Cartoon looks at what happens when Data Visualization meets 3-D Printer.
on May 11, 2014 in 3-D Printing, Cartoon, Data Visualization
Did Target Really Predict a Teen’s Pregnancy? The Inside Story
We examine the origin and the facts behind this explosive story, the importance of headlines, and how unsubstantiated assumptions gain traction and mainstream attention and help create myths around Predictive Analytics.
on May 7, 2014 in Book, Charles Duhigg, Eric Siegel, Predictive Analytics, Pregnancy, Target
JMP White Paper: Advantages of Bootstrap Forest for Yield Analysis
This white paper highlights practical examples on how to use partitioning techniques for semiconductor manufacturing data. These methods also have wider applicability.
on May 7, 2014 in Bootstrap Forests, JMP, White Paper
Poincare Conjecture, Perelman way, and Topology of social networks
We examine the connections between the $1 million proof of Poincare conjecture by a reclusive math genius and the topological behavior and information diffusion over social networks.
on May 3, 2014 in Mathematics, Social Networks, Topology
|