Tag: Dmitry Petrov (3)
- How to Check Hypotheses with Bootstrap and Apache Spark - Jan 28, 2016.
Learn how to leverage bootstrap sampling to test hypotheses, and how to implement in Apache Spark and Scala with a complete code example.
- What No One Tells You About Real-Time Machine Learning - Nov 9, 2015.
Real-time machine learning has access to a continuous flow of transactional data, but what it really needs in order to be effective is a continuous flow of labeled transactional data, and accurate labeling introduces latency.
- Beginners Guide: Apache Spark Machine Learning with Large Data - Nov 5, 2015.
This informative tutorial walks us through using Spark's machine learning capabilities and Scala to train a logistic regression classifier on a larger-than-memory dataset.