Hadoop as a Service: 18 Cloud Options

Hadoop as a service in the cloud makes big data applications and projects easier to approach and these 18 platforms each provide their own unique solutions.

Hadoop elephant in cloud Hadoop has come to be an incredibly important technology for many big data projects and applications. But without the proper time or training, it can be difficult to leverage this technology. Hadoop-as-a-service has grown to satisfy the need created by this situation, but there are many options. Below are 18 of the best options.

  1. Aleron promotes a range of big data services – including Hadoop-focused offerings.
  2. Altiscale has developed a purpose-built, petabyte-scale infrastructure that delivers Apache Hadoop as a cloud service.
  3. Amazon EMR provides a managed Hadoop framework to distribute and process vast amounts data across dynamically scalable Amazon EC2 (Elastic Compute Cloud) instances.
  4. CenturyLink, the cloud services provider, has six Hadoop blueprints.
  5. CSC, the large integrator and MSP, offers Big Data Platform as a Service (BDPaaS).
  6. Gold Coast blends application development and consulting services – helping customers to optimize Cloudera on Amazon Web Services, for instance.
  7. With the Google Cloud Storage connector for Hadoop, you can perform MapReduce jobs directly on data in Google Cloud Storage, without copying to local disk and running Hadoop Distributed File System (HDFS).
  8. HP Cloud provides an elastic cloud computing and cloud storage platform to analyze and index large data volumes in the hundreds of petabytes in size, HP asserts.
  9. IBM BigInsights on Cloud provides Hadoop-as-a-service on IBM’s SoftLayer global cloud infrastructure – a bare metal design.
  10. Microsoft’s Hadoop cloud service scales to petabytes on demand; processes unstructured and semi-structured data; deploys on Windows or Linux; integrates with on-premises Hadoop clusters (if needed); and supports multiple development languages including Java and .Net.
  11. Qubole's main focus is Hadoop as a service.
  12. Rackspace offers several options for running Apache Hadoop including deploying Hadoop on Rackspace managed dedicated servers; spinning up Hadoop on Rackspace’s public cloud via virtual servers or on dedicated bare-metal cloud servers; or configuring your own private cloud.
  13. Sahara provides push-button provisioning of mainstream Hadoop distributions and elastic data processing (EDP) capability similar to Amazon Elastic MapReduce (EMR).
  14. Skytap’s infrastructure offers virtual environments in which you can create, deploy, and remove Hadoop instances as needed.
  15. Tieto introduced a big data PaaS platform in 2012.
  16. Verizon’s Enterprise business inked a Cloudera partnership in 2013, and the IT services giant now offers Cloudera atop its cloud infrastructure.
  17. Vendors also offer integrated Hadoop solutions (including all required hardware and software) that can be deployed on-premises or in a third-party data center. Options include Avnet, Dell, Cisco, and EMC, among many others.
  18. One of the easiest ways to find and track new Hadoop cloud providers is to check in regularly with Hadoop distribution providers like Cloudera, Hortonworks and MapR.

This post is a summary of the InformationManagement slideshow