- How to Check Hypotheses with Bootstrap and Apache Spark - Jan 28, 2016.
Learn how to leverage bootstrap sampling to test hypotheses, and how to implement in Apache Spark and Scala with a complete code example.
Apache Spark, Bootstrap sampling, Dmitry Petrov, Statistical Analysis
- What No One Tells You About Real-Time Machine Learning - Nov 9, 2015.
Real-time machine learning has access to a continuous flow of transactional data, but what it really needs in order to be effective is a continuous flow of labeled transactional data, and accurate labeling introduces latency.
Dmitry Petrov, Machine Learning, Real-time
- Beginners Guide: Apache Spark Machine Learning with Large Data - Nov 5, 2015.
This informative tutorial walks us through using Spark's machine learning capabilities and Scala to train a logistic regression classifier on a larger-than-memory dataset.
Pages: 1 2 3
Apache Spark, Dmitry Petrov, Machine Learning