- MLlib: Apache Spark component for machine learning - Jul 24, 2014.
MLlib, the machine learning component of Apache Spark, has developed into a tool that supports many common machine learning algorithms and now comes with more mature documentation and a stable API.
Apache Spark, Daniel D. Gutierrez, Machine Learning, MLlib
- YARN is All the Rage at Hadoop Summit 2014 - Jun 12, 2014.
Apache YARN, which enables much broader types of computations than MapReduce, is quickly becoming an integral part of Hadoop projects. We review best practices considerations for a YARN cluster.
Apache, Apache Spark, Daniel D. Gutierrez, Hadoop, Summit, YARN
- Big Data Use Case: Zookeeper at Rubicon Project - May 27, 2014.
What is the big idea with ZooKeeper - a summary of an excellent Big Data use case using Apache ZooKeeper for Hadoop implementation.
Daniel D. Gutierrez, Hadoop, Jan Gelin, ZooKeeper