Tag: Daniel D. Gutierrez (3)
- MLlib: Apache Spark component for machine learning - Jul 24, 2014.
MLlib, the machine learning component of Apache Spark, has developed into a tool that supports many common machine learning algorithms and now comes with more mature documentation and a stable API.
- YARN is All the Rage at Hadoop Summit 2014 - Jun 12, 2014.
Apache YARN, which enables much broader types of computations than MapReduce, is quickly becoming an integral part of Hadoop projects. We review best practices considerations for a YARN cluster.
- Big Data Use Case: Zookeeper at Rubicon Project - May 27, 2014.
What is the big idea with ZooKeeper - a summary of an excellent Big Data use case using Apache ZooKeeper for Hadoop implementation.