Kickstart Hadoop In HDInsight On Windows

In this article, you will learn about Hadoop in HDInsight on Windows.

Introduction

This article will help you learn Hadoop using HDInsight on Windows. Hadoop helps you draw some meaningful information from a set of unstructured data, using Hive query.

What is Hadoop?

Hadoop is a framework that helps for distributed storage and distributed processing of very large data sets on computer clusters, built from commodity hardware.

What is HDInsight?

HDInsight provides data storage concepts like HDFS (Hadoop Distributed File System) and a simple MapReduce programming model to process and analyse the data.

Here, as a first module, we will be working with creating Hadoop Clusters:

Pre-requisites

  • An Azure Subscription.

Note: Click here to get a free trial account of Azure.

Follow the below steps, now, to create Hadoop Clusters.

Step 1 - Login to the Azure Portal, using www.portal.azure.com, and sign in with your credentials.

Login

Step 2 - Here comes the Azure Portal.

Azure Portal

Click on New - Data + Analytics.

Analytics

Step 3 - Enter the following details needed.

table

Surf the below images for reference and the below image for Cluster Type.

Cluster Type

Below image is for credentials.

credentials

For Data Source

Data Source

For Pricing

Pricing

For Resource Group

Resource Group

Click on Create button now. It will take somewhere between 15 to 20 minutes to create a Cluster.