This repository contains the scripts that handle the data wrangling for the Penny Harvest Organization Finder. The data is provided in Excel format and transformed into individual markdown files, ready to be used by Jekyll.
This script takes care of the first clean-up of the Excel data, its transformation to CSV and calls geocode.py
that takes care of the geocoding.
Even though the Organization Finder only requires data about the organizations, this first script processes the data on schools and grants as well.
$ bash ph-xls2csv.sh -i [file-name]
For example: bash ph-processor.sh -i Map_data.xls
- csvkit
- python
- wget
- unzip
- sed
This python script generates Penny Harvest organization profiles in Markdown format from the CSV file prepared by ph-xls2csv.sh
.
python ph-xls2csv.py [folder_out] [file_in]
For example: python ph-xls2csv.py markdown ORGS.csv
Processes the grant information per organization and stores it in separate CSV files in the folder: csv-data/orgs
python ph-grants-org-py [file_in]
For example: python ph-grants-org-py GRANTS.csv
Daniel da Silva - Github
Ricardo Mestre - Github
Olaf Veerman - Github