Exploratory Data Analysis on FIFA Dataset Using Spark Scala APIs in Apache Zeppelin Notebook.
wget https://downloads.apache.org/zeppelin/zeppelin-0.10.0/zeppelin-0.10.0-bin-all.tgz
sudo tar xf zeppelin-*-bin-all.tgz -C /home/hosniadel/app/
sudo mv /home/hosniadel/app/zeppelin-*-bin-all /home/hosniadel/app/zeppelin
sudo useradd -d /home/hosniadel/app/zeppelin -s /bin/false zeppelin
sudo chown -R zeppelin:zeppelin /home/hosniadel/app/zeppelin
sudo nano /etc/systemd/system/zeppelin.service
Paste the following in zeppelin.service
file
[Unit]
Description=Zeppelin service
After=syslog.target network.target
[Service]
Type=forking
ExecStart=/home/hosniadel/app/zeppelin/bin/zeppelin-daemon.sh start
ExecStop=/home/hosniadel/app/zeppelin/bin/zeppelin-daemon.sh stop
ExecReload=/home/hosniadel/app/zeppelin/bin/zeppelin-daemon.sh reload
User=zeppelin
Group=zeppelin
Restart=always
[Install]
WantedBy=multi-user.target
- in
/home/hosniadel/app/zeppelin/conf/zeppelin-site.xml
, set the port to 9090 by default is 8080
<property>
<name>zeppelin.server.port</name>
<value>9090</value>
<description>Server port.</description>
</property>
- Set JAVA_HOME and SPARK_HOME in zeppeline-env.sh.
- Start Zeppelin service
sudo systemctl start zeppelin
sudo systemctl status zeppelin
sudo systemctl stop zeppelin
- Now, you have access to Zeppelin through http://localhost:8080
I have spark v3.0.0 installed
export JAVA_HOME=/usr/lib/jvm/java-1.8.0-openjdk-amd64
export SPARK_HOME=/opt/spark
- Create new note
- Load FIFA dataset as csv and make your EDA
Note: Use
Ctrl+.
to autocomplete in Zeppeline
https://www.vultr.com/docs/how-to-install-apache-zeppelin-on-ubuntu-16-04/