Microsoft HDInsight favicon

Microsoft HDInsight

  • A Data Lake service
  • Scale to petabytes on demand
  • Crunch all data—structured, semi-structured, unstructured
  • Develop in Java, .NET, and more
  • Skip buying and maintaining hardware
  • Spin up Apache Hadoop, Spark, and R clusters in the cloud
  • Use Excel or your favorite BI tool to visualize Hadoop data
  • Connect on-premises Hadoop clusters with the cloud

Scale elastically on demand Azure HDInsight is an Apache Hadoop distribution powered by the cloud. This means that it handles any amount of data, scaling from terabytes to petabytes on demand. Spin up any number of nodes at any time. We charge only for the compute and storage that you use.

Crunch all data—structured, semi-structured, unstructured Because it's 100 percent Apache Hadoop, HDInsight can process unstructured or semi-structured data from web clickstreams, social media, server logs, devices and sensors, and more. This lets you analyze new sets of data and uncover new business possibilities that drive your organization forward. Develop in your favorite language HDInsight has powerful programming extensions for languages including C#, Java, and .NET. Use your programming language of choice on Hadoop to create, configure, submit, and monitor Hadoop jobs.

Skip the hardware purchase and maintenance With HDInsight, deploy Hadoop in the cloud without buying new hardware or incurring other up-front costs. There’s also no time-consuming installation or set up. Azure does it for you. Launch your first cluster in minutes.

Use Excel or your favorite BI tool to visualize Hadoop data Because it's integrated with Excel, HDInsight lets you visualize and analyze your Hadoop data in compelling new ways using a tool that's familiar to your business users. From Excel, users can select HDInsight as a data source.

Cloudera CDH

Cloudera CDH

Cloudera's open-source Apache Hadoop distribution, CDH (Cloudera Distribution Including Apache Hadoop), targets enterprise-cla ...