Topics: AI | Data Science | Data Visualization | Deep Learning | Machine Learning | NLP | Python | R | Statistics

KDnuggets Home » News » 2013 » Jun » News Briefs » Mu Sigma launches muHPC, packaged MapReduce algorithms for Hadoop ( 13:n16 )

Mu Sigma launches muHPC, packaged MapReduce algorithms for Hadoop


muHPC is a library of popular statistical algorithms written in MapReduce, designed for enterprise-class Big Data analysis in Hadoop environments. muHPC initially includes R functions for linear modeling, exploratory data analysis, and clustering.



Chicago, IL, June 27, 2013.

Mu SigmaMu Sigma, the largest pure-play provider of decision sciences and analytics solutions for global enterprise customers, launched a new addition to its series of analytical products.

muHPC™ (for High Performance Computing) is a library of popular statistical algorithms written in MapReduce, designed for enterprise-class Big Data analysis in Hadoop environments. As with Mu Sigma's other products, muHPC was successfully and extensively used within Mu Sigma on many client engagements before the company brought it to market.

Traditionally, enterprises that wanted to leverage R and Hadoop for Big Data analysis have had to write their own algorithms, or rely on open-source options that had not been widely used or tested. Quality varied, and it was a challenge for companies to acquire talent with relevant skills and competencies in order to code their own algorithms. Mu Sigma's offering enables enterprises to accelerate their R and Hadoop initiatives, and their overall Big Data analysis programs.

In testing, muHPC packages were consistently 2-4 times faster than a leading commercial software in equivalent procedures.

muHPC consists of three packages currently:

  • muGLM: Offers easy-to-use R functions for building a wide variety of generalized linear models (OLS, Logistic, Poisson, Negative Binomial, Gamma etc.) on Big Data.
  • muEDA: Offers easy-to-use R functions for performing exploratory analysis on Big Data.
  • muKMeans: Offers easy-to-use R functions for data clustering on Big Data using the K-means algorithm.

Mu Sigma leveraged technology from Cloudera and Revolution Analytics to build muHPC. "Cloudera's Distribution including Apache Hadoop is a great platform for organizations to store and analyze data," said Tim Stevens, vice president, Business Development, Cloudera. "Mu Sigma developing and certifying their new solution on top of Cloudera ensures that their approach to solving big data challenges is both innovative and effective."

muHPC is available now, and comes with an annual subscription license on a per-cluster basis with an incremental price per-node.

To learn more, visit www.mu-sigma.com/muhpc.


Sign Up

By subscribing you accept KDnuggets Privacy Policy