6
Set Up Guide of Spark on HDFS CIS612 Big Data and Parallel Database Processing Systems Downloaded and installed Spark successfully on ubuntu and create collection on Spark RDD with JSON Files move the extracted directory to /opt:

CIS 612 Lab4 2 Spark HDFS Installation 2020

  • Upload
    others

  • View
    25

  • Download
    0

Embed Size (px)

Citation preview

Page 1: CIS 612 Lab4 2 Spark HDFS Installation 2020

Set Up Guide of Spark on HDFS

CIS612 Big Data and Parallel Database Processing Systems

Downloaded and installed Spark successfully on ubuntu and create collection on Spark RDD with JSON Files

move the extracted directory to /opt:

Page 2: CIS 612 Lab4 2 Spark HDFS Installation 2020

Configure the Environment

Page 3: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 4: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 5: CIS 612 Lab4 2 Spark HDFS Installation 2020
Page 6: CIS 612 Lab4 2 Spark HDFS Installation 2020

Reference:

https://www.edureka.co/blog/apache-hive-installation-on-ubuntu/comment-page-2/#comments

https://bigdataprogrammers.com/load-csv-file-in-hive/

https://www.liquidweb.com/kb/how-to-install-apache-spark-on-ubuntu/