Skip to content

Commit

Permalink
Simplify instruction
Browse files Browse the repository at this point in the history
Since the ingest.sh grabs data from GCS location for course
  • Loading branch information
lakshmanok authored Dec 19, 2022
1 parent d3fd309 commit e20c55d
Showing 1 changed file with 1 addition and 5 deletions.
6 changes: 1 addition & 5 deletions 02_ingest/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,16 +4,12 @@
* Go to the Storage section of the GCP web console and create a new bucket

### Populate your bucket with the data you will need for the book
The simplest way to get the files you need is to copy it from my bucket:

* Open CloudShell and git clone this repo:
```
git clone https://github.com/GoogleCloudPlatform/data-science-on-gcp
```
* Go to the 02_ingest folder of the repo
* Run the program ./ingest_from_crsbucket.sh and specify your bucket name.

Alternately, you can ingest from the original source of the data and carry out the cleanup steps as described in the text:
* Go to the 02_ingest folder of the repo
* Edit ./ingest.sh to reflect the years you want to process (at minimum, you need 2015)
* Execute ./ingest.sh bucketname

Expand Down

0 comments on commit e20c55d

Please sign in to comment.