CARY, NC (Mar. 06, 2012) - SAS increases access to critical data sources with Hadoop support in its updated SAS Enterprise Data Integration Server. Exploiting the popular open source data architecture helps customers using analytics from SAS, the leader in business analytics , increase the value of big data assets.
Hadoop joins more than three dozen supported data sources in
SAS Enterprise Data Integration Server, including Oracle, DB2, SQL Server, Teradata including Teradata Aster, Sybase, Netezza, EMC Greenplum and MySQL. SAS support for Hadoop access is a key requirement for many organizations who are adding Hadoop to their environment. These enterprises include Macys.com, the online operation of the iconic retailer.
"Hadoop is facilitating big data analytics at Macys.com as our data assets continue to grow exponentially. SAS Hadoop support will let us fully leverage our analytics talent, our data and our long-term investment in SAS," said Kerem Tomak, Vice President of Marketing Analytics at Macys.com. "SAS with Hadoop is critical to our big data plan."
Sponsored by the Apache Software Foundation, Hadoop is an open source Java-based framework for processing large data sets in a distributed computing environment. SAS integrates with the Apache Hadoop distribution.
SAS' deep integration with Hadoop applies the parallelism of MapReduce, the distributed computing framework commonly associated with Hadoop. SAS, Hadoop and data warehouse infrastructure Hive match perfectly in analyzing large data sets, simplifying the most common big data analysis and analytic use cases.
Numerous features within enhanced SAS Enterprise Data Integration Server support Hadoop and big data, including:
- SAS "write once, run anywhere" extends to Hadoop deployments.
- SAS features such as job flow builder, visual editor, syntax checker and others are extended to Hive, Pig, MapReduce and HDFS commands.
- SAS augments native Hadoop security with SAS data security provisions, including authorization and data lineage.
- SAS supports popular Hadoop distributions such as Cloudera, HortonWorks and EMC Greenplum.
- SAS data quality and profiling cover data moving in or out of Hadoop.
- SAS access extends SAS capabilities such as visual analytics explorer, text mining and analytics to Hadoop data.
- Hadoop data can be federated along with data from other sources, including the ability to embed the federated query in a data management job flow.