OH missing earliest years from PDF #561

stucka · 2023-09-14T15:29:30Z

The Ohio scraper has been rebuilt and most of the archives were consolidated into a single CSV for download.

However, the CSV that Big Local News had been hosting contained badly parsed data from the PDFs of 2015 and 2016, containing a bunch of junk characters. We could use someone to parse out the two PDFs into a CSV format so we can get them added to our archival data.

The original PDFs are included in the ZIP, as is the then-consolidated snapshot of the CSV:

https://storage.googleapis.com/bln-data-public/warn-layoffs/oh_2015-2022.zip

The current scraper is grabbing 2017-2022 from a CSV similar to the one that's in the ZIP file here, other than the 2015, 2016, and 2023 data have been purged from it.

stucka added the easy An easy task. These are great to start with. label Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OH missing earliest years from PDF #561

OH missing earliest years from PDF #561

stucka commented Sep 14, 2023

OH missing earliest years from PDF #561

OH missing earliest years from PDF #561

Comments

stucka commented Sep 14, 2023