Web-based notebooks created with Apache Zeppelin
This project contains notebooks created for:
- Research
- Exploratory Analysis
- Fast Prototyping
Additional information about the notebooks here available:
- Their main programming language is Scala / Spark (however other languages as Angular have been used as well)
- They can be visualized online by means of ZeppelinHub (see column View below)
- They require some libraries to be imported in the corresponding Interpreters (TODO: add new column for that)
| Description | Components | Code | View | | ------------- | ----- | ---------- | ------------ | -------- | -------- | | Account Analysis | Spark | json | view | | Twitter Streaming | Twitter / Spark Streaming / Angular / Leaflet | json | view | | LDAP Authentication | Python, LDAP3 | json | N/A | | FASTA KPIs Evaluation | Spark | json | N/A |
Extra libraries added to the Spark interpreter for the previously listed notebooks:
- nscala-time: Scala wrapper for Joda Time
- spark-streaming-twitter: library for reading social data from Twitter using Spark Streaming
- Edit
/etc/hosts
adding your hostname:127.0.0.1 localhost <hostname>
- Install Zeppelin:
brew install apache-zeppelin
- Start Zeppelin daemon:
./usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh start
- Go to Zeppelin:
open http://localhost:8080
- Stop Zeppelin daemon:
./usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh stop
For simplicity add aliases to your bash profile, for instance the following:
alias zep="/usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh start && open http://localhost:8080"
alias zepx="/usr/local/Cellar/apache-zeppelin/0.6.2/bin/zeppelin-daemon.sh stop"
alias zepl="tail -f /usr/local/Cellar/apache-zeppelin/0.6.2/libexec/logs/zeppelin-interpreter-spark-username-hostname.log"
Note: update 'username' and 'hostname' according to your system
- Enable GIT storage
- Enable user authentication (Shiro)
- Extend interpreters with additional libraries (Spark-Streaming-Twitter, jTDS...)
- Create account in Zeppelin Hub
- Download Zeppelin Hub library and include in Zeppelin installation
- Add new Zeppelin instance in Zeppelin Hub to obtain API token and user key
- Add API token and user key in Zeppelin shell script
- Launch Zeppelin daemon
- See also HortonWorks Gallery