Welcome to guide to databricks spark certification ! This repository will help you:
- Learn about Apache Spark framework
- Learn to use pyspark in databricks enviroment
- Learn about the topics that are required study for the clear CRT020 examination
Throughout the guide more emphesis will be given to a code first methodology with minimal theory when covering topics. Following are the pre-requisites to start using this guide:
- Basic knowledge of python
- Basic knowledge of SQL
- Databricks community edition account
Once you setup the account download the .DBC files and upload it to your databricks account as shown below:
1.Log into databricks community edition and click on import 2.Click on file and browse 3. Upload the databricks-spark-certification.dbc file
This guide is suplemented with a google sheet where you can find topic wise breakup of material provided in the guide.