Skip to content

Commit

Permalink
wk2
Browse files Browse the repository at this point in the history
  • Loading branch information
hantswilliams committed Aug 31, 2022
1 parent ad1b1ec commit 28d968c
Show file tree
Hide file tree
Showing 14 changed files with 191,143 additions and 3 deletions.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -8,3 +8,5 @@
# ignore all .json files inside of ingestion/bigquery/
ingestion/example_files/bigquery/**/*.json
ingestion/example_files/bigquery/*.json


20 changes: 20 additions & 0 deletions ingestion/example_files/json/Encounter-example-home.json
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,26 @@
"reference": "Patient/example"
},
"participant": [
{
"period": {
"start": "2015-01-17T16:00:00+10:00",
"end": "2015-01-17T16:30:00+10:00"
},
"actor": {
"reference": "Practitioner/example",
"display": "Dr Adam Careful"
}
},
{
"period": {
"start": "2015-01-17T16:00:00+10:00",
"end": "2015-01-17T16:30:00+10:00"
},
"actor": {
"reference": "Practitioner/example",
"display": "Dr Adam Careful"
}
},
{
"period": {
"start": "2015-01-17T16:00:00+10:00",
Expand Down
Binary file modified ingestion/example_files/xls_xlsx/AHQDataFile2016.xls
Binary file not shown.
7 changes: 4 additions & 3 deletions ingestion/ingestion.py
Original file line number Diff line number Diff line change
Expand Up @@ -66,7 +66,7 @@
# printing number of pages in pdf file
print(pdfReader.numPages)
# creating a page object
pageObj = pdfReader.getPage(0)
pageObj = pdfReader.getPage(1)
# extracting text from page
print(pageObj.extractText())

Expand Down Expand Up @@ -100,11 +100,11 @@

### WEB REQUESTS
## get request for https://jsonplaceholder.typicode.com/posts // https://jsonplaceholder.typicode.com/
r = requests.get('https://jsonplaceholder.typicode.com/posts') ## get request
r = requests.get('https://jsonplaceholder.typicode.com/users') ## get request
## load as json
r_data = r.json() ## load request as json
## load into dataframe
df = pd.read_json(r_data) ## read json file
df = pd.read_json(r_data, orient='records') ## read json file


### SQLALCHEMY
Expand Down Expand Up @@ -158,3 +158,4 @@
results = query_job.result() ## get results
## putresults into dataframe
df = pd.DataFrame(results.to_dataframe()) ## put results into dataframe
df.columns
Binary file not shown.

Large diffs are not rendered by default.

65,438 changes: 65,438 additions & 0 deletions transformation/dataFiles/raw/131624096_mount-sinai-hospital_standardcharges.csv

Large diffs are not rendered by default.

12,041 changes: 12,041 additions & 0 deletions transformation/dataFiles/raw/ECMC_Charge_Master_2022.csv

Large diffs are not rendered by default.

Loading

0 comments on commit 28d968c

Please sign in to comment.