5
Enterprise Data Preparation is a collaborative self-service data discovery and preparation solution for data analysts and data scientists. Analysts can rapidly discover data and turn raw data into insights that data scientists can use to ensure data quality, visibility, and governance. Enterprise Data Preparation helps you derive more value from your Hadoop-based data lake and make data available to all users in the organization. Use this Quick Start to deploy Enterprise Data Preparation on the Microsoft Azure Marketplace. Enterprise Data Preparation on Microsoft Azure Marketplace Quick Start © Copyright Informatica LLC 2020. Make sure that you meet the following prerequisites: Your Microsoft Azure subscription includes the owner role. The owner role has access and permissions to create the following resources on the Azure platform: Virtual network Network security group Virtual machines You have downloaded valid Enterprise Data Preparation license to your local machine or a location on your network. You have a sufficient number of CPU cores where you plan to deploy the Enterprise Data Preparation solution. For more information, see prerequisites. If you want to deploy Enterprise Data Preparation on an existing HDInsight cluster, see HDInsight Cluster prerequisites. Launch Wizard Configure Basics Configure Enterprise Data Preparation Configure Microsoft HDInsight Configure Bastion Server Configure Infrastructure Settings Review Configuration and Deploy Monitor Deployment View Output

Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

  • Upload
    others

  • View
    6

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

Enterprise Data Preparation is a collaborative self-service data discovery and preparation solution for data analysts and data scientists. Analysts can rapidly discover data and turn raw data into insights that data scientists can use to ensure data quality, visibility, and governance.

Enterprise Data Preparation helps you derive more value from your Hadoop-based data lake and make data available to all users in the organization.

Use this Quick Start to deploy Enterprise Data Preparation on the Microsoft Azure Marketplace.

Enterprise Data Preparation onMicrosoft Azure Marketplace

Quick Start

© Copyright Informatica LLC 2020.

Make sure that you meet the following prerequisites:

• Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to create the following resources on the Azure

platform: • Virtual network • Network security group• Virtual machines

• You have downloaded valid Enterprise Data Preparation license to your local machine or a location on your network.

• You have a sufficient number of CPU cores where you plan to deploy the Enterprise Data Preparation solution. For more information, see prerequisites.

If you want to deploy Enterprise Data Preparation on an existing HDInsight cluster, see HDInsight Cluster prerequisites.

LaunchWizard

ConfigureBasics

ConfigureEnterprise Data

Preparation

Configure Microsoft HDInsight

Configure Bastion Server

Configure Infrastructure

Settings

Review Configuration

and Deploy

MonitorDeployment

ViewOutput

Page 2: Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

1. Log in to the Azure marketplace website.

2. Search for and select Enterprise Data Preparation.

3. To launch the deployment wizard, click GET IT NOW.

4. Click Continue and configure the properties.

On the Basics panel:

1. Enter the Microsoft Azure subscription.

2. Select the resource group that contains the virtual network or create a new resource group.

3. Select the region where you want to deploy Enterprise Data Preparation.

Subscription, Resource Group, and Region

GET IT NOW

© Copyright Informatica LLC 2020.

Page 3: Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

© Copyright Informatica LLC 2020.

On the Informatica Enterprise Data Preparation panel:

1. Enter the Informatica license key.2. Specify the Informatica server

size.3. Select one of the following

deployment cluster types:• Small• Medium• Large

4. Specify the embedded Hadoop cluster virtual machine size.

5. Optionally, toggle to enable Informatica High Availability.

6. Enter and confirm the password for SSH, RDP, database, and database user.

Informatica License Key, Informatica Server, Database Server, Deployment Type, Embedded Hadoop Cluster Virtual Machine Size,Informatica High Availability, and Password

On the Microsoft HDInsight Settings panel, enter details for an existing HDInsight cluster or for a new HDInsight cluster.

For an existing Microsoft HDInsight cluster:1. Enter the HDInsight cluster URL.2. Enter the user name for the cluster

and the cluster secure shell.3. Enter the password for the user

account.4. Enter Microsoft Azure Data Lake

Storage Gen2 account name and resource group.

Note: The Microsoft HDInsight resource location and the vnet location must be in the same region.

For a new Microsoft HDInsight Cluster, specify the HDInsight virtual machine size.

HDInsight Cluster URL, Cluster login Username, Cluster Secure Shell (SSH) Username, Password, Azure Data Lake Storage Gen2 account name, and Azure Data Lake Storage Gen2 account Resource Group

Select to enter options for a new or existing HDInsight cluster

Page 4: Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

© Copyright Informatica LLC 2020.

Bastion Server Size

On the Bastion Server panel:

1. To deploy a bastion server as your primary access point, toggle Yes.

2. Specify the virtual machine size for the bastion server.

On the Infrastructure Settings panel:

1. Enter the IP address range.2. To assign a public IP address to

the network interface of the virtual machine, toggle Yes.

3. Select the virtual network.4. Select the identifier for the subnet.5. Specify the path to the MySQL

database JAR file.

CIDR IP Address Range, Assign Public IP Address, Virtual network, Subnet, upload MySQL jar file

On the Review + create panel:

1. Verify the configuration details.2. Read the terms of use.3. Click Create.4. Recycle the Informatica application

services. For more information, see Recycling Informatica Services.

Page 5: Enterprise Data Preparation on Microsoft Azure Marketplace€¦ · • Your Microsoft Azure subscription includes the owner role. • The owner role has access and permissions to

© Copyright Informatica LLC 2019.

Need more info? Check out the Enterprise Data Preparation help.

Monitor the deployment

For help with troubleshooting, see the Logs chapter in the Deploying Enterprise Data Preparation on Microsoft Azure Marketplace guide.

On the Enterprise Data Preparation dashboard, select the resource group that contains the deployment you configured for HDInsights:

1. To view the list of resources in the resource group, click Overview.

2. To view information about failed resource deployments, click Error details.

3. To sort by resource name, type, or status of the resource, click the column headings.

View the output properties of the main template

After the configuration deploys successfully, you can view the output properties of the main template:

1. On the Resource Group page, click Outputs.

2. Click the deployment named <maintemplate> > Outputs.

Sort the listView the output properties of the main template

Overview