#ds1004-big-data-project
To replicate our results, please begin by downloading the NYPD crime data (2006-2016) available at https://data.cityofnewyork.us/Public-Safety/NYPD-Complaint-Data-Historic/qgea-i56i. Any .py scripts should be run using PySpark. Any .ipynb notebooks should be opened in Jupyter Notebook. More detailed READMEs on how to run each script or notebook can be found in the relevant subfolders.