This guide shows you how to install Hadoop 2.7 and Spark 2.4 along with HDFS, Java 8 SE and Yarn on a cluster of 3 Ubuntu 18.04 Machines with VirtualBox.
Distributed environment on virtual machines could be useful when you do performances test and want to analyze the scalability of your program in multi-node structure. Also you can follow the similar steps when you install these tools in multiple machines.
This guide contains 3 sub-guides:
1. Virtual Machine Installation & Configuration
- Don't hesitate to
- Contribute to this guide
- Open issues
- Report bugs
- Open pull request
- Once you contribute, your name will be added to Contributors section.