- Web Scraping for Data Science with Python - Dec 6, 2017.
We take a quick look at how web scraping can be useful in the context of data science projects, eg to construct a social graph based of S&P 500 companies, using Python and Gephi.
Bart Baesens, Data Science, Python, S&P 500, Web Mining, Web Scraping
- Understanding Rare Events and Anomalies: Why streaks patterns change - Jan 8, 2016.
We often look back at the past year and an overall history of rare events, and try to then extrapolate future odds of the same rare event, based on that. We illustrate here, that rare past events have no usefulness in understanding the rarity of the same events in the future!
Pages: 1 2
Anomaly Detection, Predictions, S&P 500
- The Cardinal Sin of Data Mining and Data Science: Overfitting - Jun 14, 2014.
Overfitting leads to public losing trust in research findings, many of which turn out to be false. We examine some famous examples, "the decline effect", Miss America age, and suggest approaches for avoiding overfitting.
Dean Abbott, John Ioannidis, Kirk D. Borne, Overfitting, S&P 500