Advanced Topics in Databases 2022-2023 NTUA

Installation

First of all, you must have a local network with 2 or more vms connected.

To install Hadoop and Spark follow the steps below:

First, install Hadoop follow the link -> Hadoop

Then, set up Yarn Cluster follow the link -> Yarn

Finally, install Spark with pdf adove or follow the link -> Spark

How to execute the program (with 2 workers)

1.Connect to master vm:

ssh (master vm connection string)

2.Start Hadoop and Spark in master vm:

start-dfs.sh

start-master.sh

3.Upload data in Hadoop Distributed File System (HDFS), according to the example below:

hadoop fs -put ./yellow_tripdata_2022-01.parquet hdfs://master:9000/par/yellow_tripdata_2022-01.parquet

4.Start a worker in master:

start-worker.sh spark://192.168.0.2:7077

5.Start a worker in slave by typing the following instructions in the master vm:

ssh (slave vm connection string)

start-worker.sh spark://192.168.0.2:7077

6.Submit the task in Spark environment (in the master vm and in the directory of the file):

spark-submit (filename)

7.Results

See the results in the terminal

Team members:

Dimitris Kalathas, Dimitris Bakalis

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
ATDB_Anafora.pdf		ATDB_Anafora.pdf
ATDB_Examiniaia.py		ATDB_Examiniaia.py
AdvancedDB_project_2022.pdf		AdvancedDB_project_2022.pdf
README.md		README.md
Spark_Install_instructions.pdf		Spark_Install_instructions.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Advanced Topics in Databases 2022-2023 NTUA

Installation

How to execute the program (with 2 workers)

Team members:

The assignment is in greek.

About

Releases

Packages

Languages

dbakalis8/Advanced_Topics_in_DataBases_NTUA_2022-2023

Folders and files

Latest commit

History

Repository files navigation

Advanced Topics in Databases 2022-2023 NTUA

Installation

How to execute the program (with 2 workers)

Team members:

The assignment is in greek.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages