Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop/improve code to process B120 WSI forecast PDFs #50

Open
jyeazell opened this issue Feb 22, 2021 · 0 comments
Open

Develop/improve code to process B120 WSI forecast PDFs #50

jyeazell opened this issue Feb 22, 2021 · 0 comments
Labels
code improvement Code works, but needs modifications enhancement New feature or request help wanted Extra attention is needed

Comments

@jyeazell
Copy link
Collaborator

I hand-created a dataset f WSI forecasts from the Feb 1 reports. Starting with the raw data scraped from the PDFS using PDFTables (pay-per use API that works), need code to scrape the PDFs and munge the data into a structure suitable for merging with the other supply surrogates.

-input.pdf is the source PDF.

  • input.zip contains the raw scraped csv file.
  • output.zip contains the the desired output csv. It should only include data for the current report month on. For example, the output from the February report should only include data for February through September, The output from the March report should only include data for March through September, etc. (Report date is in row 2 of the csv).

input.pdf
input.zip
output.zip

@jyeazell jyeazell added enhancement New feature or request help wanted Extra attention is needed code improvement Code works, but needs modifications labels Feb 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
code improvement Code works, but needs modifications enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant