Skip to content

Commit

Permalink
Initialise DVC
Browse files Browse the repository at this point in the history
We now use DVC to track our /data directory. All files are versioned
and cached using DVC, with remote storage on S3.

closes #161
  • Loading branch information
doogyb committed Sep 23, 2022
1 parent f13117b commit 760f75d
Show file tree
Hide file tree
Showing 128 changed files with 17 additions and 219,927 deletions.
3 changes: 3 additions & 0 deletions .dvc/.gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
/config.local
/tmp
/cache
4 changes: 4 additions & 0 deletions .dvc/config
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
[core]
remote = remote
['remote "remote"']
url = s3://dap-dv-backends
3 changes: 3 additions & 0 deletions .dvcignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# Add patterns of files dvc should ignore, which could improve
# the performance. Learn more at
# https://dvc.org/doc/user-guide/dvcignore
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -6,4 +6,5 @@ node_modules
**/secrets/**
src/bin/spotlight/nginx.conf
src/servers/**/terraform
src/servers/**/nginx.conf
src/servers/**/nginx.conf
/data
5 changes: 5 additions & 0 deletions data.dvc
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
outs:
- md5: 2961519e5dd56934effbdacf079132b3.dir
size: 28522654
nfiles: 125
path: data
101 changes: 0 additions & 101 deletions data/ai_in_genomics/inputs/ai_genomics_cb_samples.csv

This file was deleted.

821 changes: 0 additions & 821 deletions data/ai_in_genomics/inputs/ai_genomics_gtr_samples.csv

This file was deleted.

167 changes: 0 additions & 167 deletions data/ai_in_genomics/inputs/ai_genomics_openalex_samples.csv

This file was deleted.

101 changes: 0 additions & 101 deletions data/ai_in_genomics/inputs/ai_genomics_patent_samples.csv

This file was deleted.

101 changes: 0 additions & 101 deletions data/ai_in_genomics/outputs/ai_genomics_cb_samples.csv

This file was deleted.

821 changes: 0 additions & 821 deletions data/ai_in_genomics/outputs/ai_genomics_gtr_samples.csv

This file was deleted.

167 changes: 0 additions & 167 deletions data/ai_in_genomics/outputs/ai_genomics_openalex_samples.csv

This file was deleted.

101 changes: 0 additions & 101 deletions data/ai_in_genomics/outputs/ai_genomics_patent_samples.csv

This file was deleted.

27,340 changes: 0 additions & 27,340 deletions data/ai_map/inputs/ai_map_orgs_places.json

This file was deleted.

46,728 changes: 0 additions & 46,728 deletions data/ai_map/outputs/ai_map_orgs_places_populated.json

This file was deleted.

534 changes: 0 additions & 534 deletions data/ai_map/outputs/ai_map_orgs_places_populated_subset.json

This file was deleted.

73,566 changes: 0 additions & 73,566 deletions data/ai_map/outputs/entity_details.json

This file was deleted.

Loading

0 comments on commit 760f75d

Please sign in to comment.