Upload
letuyen
View
225
Download
2
Embed Size (px)
Citation preview
BigInsights 4.0 Data Scientist
Module Install IBM Cloud Infrastructure for Analytics
November 5, 2015
2
Table of Contents
Prerequisites……………………………………………………………………………………………………………………………………………3
Install R……………………………………………………………………………………………………………………………………………………3
Install BigR…………………………………………………………………………………………………………………………………………….12
Prerequisites…………………………………………………………………………………………………………………………….12
Install Text Analytics………………………………………………………………………………………………………………………………17
3
1. Prerequisites:
i. BI 4.0 Cluster should be up and running with following services:
a. IOP
b. BigInsights Home
c. BigSheets
d. Bigsql.
ii. Refer section 5.2 in Known Issues and Workaround document for accessing web UI of BigInsights
Home, BigSheets and BigSQL.
2. Install R
Actions -> Add Service
4
Note: Keep default selected parameters in Assign Masters Wizard for all the services as it selects master
hostname automatically.
Assign Slaves and Client Wizard
It should be assigned to both master and compute node of the cluster.
5
Accept by typing Y in user.accept.R.Licenses section of Advanced r.
6
You may see this config validation error. It can be ignored for the time being. This is because the YARN container has grown beyond its heap size setting.
7
The config review page should look as below.
Note: Repositories refer to locally provided repo path for installation.
This should be the final state if all the above steps were followed correctly
8
Restart the services that show the restart icon on Ambari UI.
9
Note: After restarting all the services with restart icon, it should look as below.
Quick way to restart services:
Go to ambari ui:
Hosts -> Master node -> Restart
Repeat the same for compute nodes
10
Note: After restart if there is any service that is stopped (marked as red), then start it again before proceeding further.
11
12
3. Install BigR
3.1 Prerequisites :
i. R should be installed before installing BigR.
ii. Download below mentioned tars on Master and copy it to all compute nodes where BigR needs
to be installed.
base64enc_0.1-2.tar.gz
rJava_0.9-7.tar.gz
chron_2.3-45.tar.gz
data.table_1.9.6.tar.gz
[root@vmkdev0007 ~]# mkdir BigR_Packages
[root@vmkdev0007 ~]# cd BigR_Packages/
Download all the packages inside this directory using wget [Note: Internet access is needed on
Master node].
wget https://cran.r-project.org/src/contrib/base64enc_0.1-3.tar.gz
wget https://cran.r-project.org/src/contrib/rJava_0.9-7.tar.gz
wget https://cran.r-project.org/src/contrib/chron_2.3-47.tar.gz
wget https://cran.r-project.org/src/contrib/data.table_1.9.6.tar.gz
Copy it to all the compute nodes using below command:
ssh root@hostname ‘mkdir –p /root/BigR_Packages/’ && scp * root@hostname:/root/
BigR_Packages/
Sample command:
ssh root@vmkdev0008 ‘mkdir –p /root/BigR_Packages/’ && scp * root@vmkdev0008:/root/
BigR_Packages/
Confirm all the tars are available inside the BigR_Packages directory using below command:
[root@vmkdev0007 BigR_Packages]# ls -ltr
total 4364
-rw-r--r-- 1 root root 3709070 Sep 19 16:13 data.table_1.9.6.tar.gz
-rw-r--r-- 1 root root 36242 Sep 30 02:53 chron_2.3-45.tar.gz
-rw-r--r-- 1 root root 7191 Sep 30 02:53 base64enc_0.1-2.tar.gz
-rw-r--r-- 1 root root 711181 Oct 13 02:54 rJava_0.9-7.tar.gz
13
[root@vmkdev0007 BigR_Packages]# R CMD INSTALL base64enc_0.1-2.tar.gz
* installing to library â/usr/lib64/R/libraryâ
* installing *source* package âbase64encâ ...
Similarly, install all the other tars using below commands with exact sequence:
R CMD INSTALL base64enc_0.1-2.tar.gz R CMD INSTALL rJava_0.9-7.tar.gz
R CMD INSTALL chron_2.3-45.tar.gz R CMD INSTALL data.table_1.9.6.tar.gz
After installing all the tars go to ambari UI:
Add Service -> BigR ->
Select compute node for Big R client installation.
Set bigr user password to ambari services password.
14
This should be the final state if all the above steps were followed correctly.
15
Restart the services that show the restart icon on Ambari UI.
16
17
4. Install Text Analytics:
Actions -> Add Service -> Text Analytics ->
Set database password to ambari services password. You can change the database username, but any changes to the database hostname are ignored.
18
19
This should be the final state if all the above steps were followed correctly
20
Restart the services that show the restart icon on Ambari UI. Restart BigInsights Home service again to reload all the value add-on services.
Final dashboard after installing all 3 services should look as below:
21