**Making Predictive Models Robust: Holdout vs Cross-Validation** - Aug 11, 2017.

The validation step helps you find the best parameters for your predictive model and prevent overfitting. We examine pros and cons of two popular validation strategies: the hold-out strategy and k-fold.

Tags: Cross-validation, Dataiku, Overfitting

**Understanding the Bias-Variance Tradeoff: An Overview** - Aug 8, 2016.

A model's ability to minimize bias and minimize variance are often thought of as 2 opposing ends of a spectrum. Being able to understand these two types of errors are critical to diagnosing model results.

Tags: Bias, Cross-validation, Model Performance, Variance

**How to Compute the Statistical Significance of Two Classifiers Performance Difference** - Mar 30, 2016.

To determine whether a result is statistically significant, a researcher would have to calculate a p-value, which is the probability of observing an effect given that the null hypothesis is true. Here we are demonstrating how you can compute difference between two models using it.

Tags: Classifier, Cross-validation, Model Performance

**3 Things About Data Science You Won’t Find In Books** - May 11, 2015.

There are many courses on Data Science that teach the latest logistic regression or deep learning methods, but what happens in practice? Data Scientist shares his main practical insights that are not taught in universities.

**Pages:** 1 2

Tags: Cross-validation, Data Preparation, Data Science, Feature Engineering, Feature Extraction, Overfitting

**11 Clever Methods of Overfitting and how to avoid them** - Jan 2, 2015.

Overfitting is the bane of Data Science in the age of Big Data. John Langford reviews "clever" methods of overfitting, including traditional, parameter tweak, brittle measures, bad statistics, human-loop overfitting, and gives suggestions and directions for avoiding overfitting.

Tags: Cross-validation, John Langford, Overfitting

**Top KDnuggets tweets, Apr 18-20** - Apr 22, 2014.

Cross-validation pitfalls for regression/classification and how to avoid them; Data Workflows for Machine Learning ; Apache Spark, the hot new trend in Big Data ; Visual Analysis Best Practices - download a free guidebook from Tableau.

Tags: Apache Spark, Cross-validation, Machine Learning, Tableau