New York City revolutionized the way cities were policed with their COMPSTAT approach. The police department releases partial reports of the data that they collect, but this information is hidden in PDF documents that disappear after a week. This scraper downloads all of the reports on the NYPD's crime statistics page and uploads them to an S3 bucket.
This scraper currently runs every day and uploads the results here:
https://nyc-compstat-api.s3.amazonaws.com/
- Install the required gems.
$ bundle
-
Run the tests.
$ rake
-
Run the main archiver.
$ ruby application.rb