CIS 612 Lab4 2 Spark HDFS Installation 2020

Preview:

Citation preview

Set Up Guide of Spark on HDFS

CIS612 Big Data and Parallel Database Processing Systems

Downloaded and installed Spark successfully on ubuntu and create collection on Spark RDD with JSON Files

move the extracted directory to /opt:

Configure the Environment

Reference:

https://www.edureka.co/blog/apache-hive-installation-on-ubuntu/comment-page-2/#comments

https://bigdataprogrammers.com/load-csv-file-in-hive/

https://www.liquidweb.com/kb/how-to-install-apache-spark-on-ubuntu/