- Python 3
- Jupyter Notebook
- Run pip install -r requirements.txt for libraries
- Step 0: Clone this repo
- Step 1: Download datasets
- Step 2: Rename
- For Vancouver: Unzip crimedata_csv_all_years.zip, rename crimedata_csv_all_years.csv to vancouvercrime.csv
- For Denver: Rename crime.csv to denvercrime.csv
- Step 3: Place 2 CSV datasets to ./Data
- Step 4: Run Date Event Preprocess.ipynb, build date dimension and event dimension
- Step 5: Run files
- Run Denver Preprocess.ipynb for dimensions related to Denver
- Run Vancouver Preprocess.ipynb for dimensions related to Vancouver
- Step 6: Run Merge Process.ipynb to combine Vancouver data with Denver data, and generate crime dimension, location dimension and fact table
- Step 7: All resulting data in ./Dimensions, run script in ./sql if needed
- Data for population
- Data for CSI weight
- CSI Calculation fomula