Skip to content

Latest commit

 

History

History
67 lines (47 loc) · 3.4 KB

README.md

File metadata and controls

67 lines (47 loc) · 3.4 KB

Zeppelin Notebooks

Web-based notebooks created with Apache Zeppelin

License

Summary

This project contains notebooks created for:

  • Research
  • Exploratory Analysis
  • Fast Prototyping

Additional information about the notebooks here available:

  • Their main programming language is Scala / Spark (however other languages as Angular have been used as well)
  • They can be visualized online by means of ZeppelinHub (see column View below)
  • They require some libraries to be imported in the corresponding Interpreters (TODO: add new column for that)

Notebooks

| Description | Components | Code | View | | ------------- | ----- | ---------- | ------------ | -------- | -------- | | Account Analysis | Spark | json | view | | Twitter Streaming | Twitter / Spark Streaming / Angular / Leaflet | json | view | | LDAP Authentication | Python, LDAP3 | json | N/A | | FASTA KPIs Evaluation | Spark | json | N/A |

Libraries

Extra libraries added to the Spark interpreter for the previously listed notebooks:

Getting Started with Zeppelin

Install Zeppelin (OS X)

  1. Edit /etc/hosts adding your hostname: 127.0.0.1 localhost <hostname>
  2. Install Zeppelin: brew install apache-zeppelin
  3. Start Zeppelin daemon: ./usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh start
  4. Go to Zeppelin: open http://localhost:8080
  5. Stop Zeppelin daemon: ./usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh stop

For simplicity add aliases to your bash profile, for instance the following:

  • alias zep="/usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh start && open http://localhost:8080"
  • alias zepx="/usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh stop"
  • alias zepl="tail -f /usr/local/Cellar/apache-zeppelin/0.6.2/libexec/logs/zeppelin-interpreter-spark-username-hostname.log"

Note: update 'username' and 'hostname' according to your system

Configure Zeppelin

  • Enable GIT storage
  • Enable user authentication (Shiro)
  • Extend interpreters with additional libraries (Spark-Streaming-Twitter, jTDS...)

Configure Zeppelin Hub

  • Create account in Zeppelin Hub
  • Download Zeppelin Hub library and include in Zeppelin installation
  • Add new Zeppelin instance in Zeppelin Hub to obtain API token and user key
  • Add API token and user key in Zeppelin shell script
  • Launch Zeppelin daemon

Other