Develop/improve code to process B120 WSI forecast PDFs #50
Labels
code improvement
Code works, but needs modifications
enhancement
New feature or request
help wanted
Extra attention is needed
I hand-created a dataset f WSI forecasts from the Feb 1 reports. Starting with the raw data scraped from the PDFS using PDFTables (pay-per use API that works), need code to scrape the PDFs and munge the data into a structure suitable for merging with the other supply surrogates.
-
input.pdf
is the source PDF.input.zip
contains the raw scraped csv file.output.zip
contains the the desired output csv. It should only include data for the current report month on. For example, the output from the February report should only include data for February through September, The output from the March report should only include data for March through September, etc. (Report date is in row 2 of the csv).input.pdf
input.zip
output.zip
The text was updated successfully, but these errors were encountered: