Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tableslurp performance expectation #94

Open
pdehlke opened this issue Mar 15, 2018 · 0 comments
Open

Tableslurp performance expectation #94

pdehlke opened this issue Mar 15, 2018 · 0 comments

Comments

@pdehlke
Copy link

pdehlke commented Mar 15, 2018

I regularly tell people, after 25 years in data centers, that nobody cares about backups. The only thing people actually care about is restores.

To that end I have been testing tablesnap with a small keyspace. The current test dataset is ~28 GB and has a fairly decent rate of churn; I have been running tablesnap for two weeks and have tablechop set to prune the data at 7 days. The S3 bucket currently holds about 145 GB.

I invoked tableslurp like so:

tableslurp -k <Key> -s <secret> --aws-region us-west-2 -r -n ip-10-14-193-47 my-cassandra-backups-us-west-2 /data/cassandra/data/test_keyspace ./cassandra/data/test_keyspace

After letting that run for 6 hours, tableslurp had created directories for 5 of my 179 tables and had not yet downloaded any files. I killed that process, and restarted with -t 50, to give me 50 threads, and went to bed.

12 hours later, tableslurp had created directories for 33 of the 179 tables, and still had downloaded exactly zero files.

Is this expected behavior? If not, what have I got wrong?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant