Error on the fifth shard #3

clemley · 2020-12-11T02:37:56Z

On the first request of the fifth shard I believe there is an index error as it causes an error. All other pieces run properly aside from the fifth shard. Is there a way to fix this?

huxi2 · 2021-07-02T08:22:42Z

I found that the number of data in purchase2_train.npy generated by running init.sh was 249215, which was different from the number in the datasetfile.
So I fix this by modifying this code in prepare_data.py :
X_train, X_test, y_train, y_test = train_test_split(data, label, **test_size=0.1**)

Hope that helps

swagStar123-code · 2022-12-12T13:13:01Z

According to the proposal, the change from 0.2 to 0.1 still has the above problems.

KatieHYT · 2023-06-08T11:12:14Z

Same here.
Even I change from 0.2 to 0.1, there is still the index error IndexError: index 280367 is out of bounds for axis 0 with size 280367.

any suggestion till now?

nimeshagrawal · 2023-08-11T11:10:41Z

Any solution found regarding this issue?

nimeshagrawal · 2023-08-11T12:32:30Z

The problem is there in the datasets/purchase/datasetfile. They have hard coded train and test sample size. The prepare_data.py splits according to test_size = 0.2, but "datasetfile" has sample sizes according to test_size = 0.1. Hence, change train & test sample size in "datasetfile". (Replace with nb_train = 249215 and nb_test = 62304)

scottshufe · 2023-09-14T10:55:47Z

Thanks for your solution. It solved my problem perfectly.

The problem is there in the datasets/purchase/datasetfile. They have hard coded train and test sample size. The prepare_data.py splits according to test_size = 0.2, but "datasetfile" has sample sizes according to test_size = 0.1. Hence, change train & test sample size in "datasetfile". (Replace with nb_train = 249215 and nb_test = 62304)

GM-git-dotcom · 2023-11-15T03:50:44Z

The problem is there in the datasets/purchase/datasetfile. They have hard coded train and test sample size. The prepare_data.py splits according to test_size = 0.2, but "datasetfile" has sample sizes according to test_size = 0.1. Hence, change train & test sample size in "datasetfile". (Replace with nb_train = 249215 and nb_test = 62304)

This. And remember to run python prepare_data.py after making this change.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error on the fifth shard #3

Error on the fifth shard #3

clemley commented Dec 11, 2020

huxi2 commented Jul 2, 2021

swagStar123-code commented Dec 12, 2022

KatieHYT commented Jun 8, 2023

nimeshagrawal commented Aug 11, 2023

nimeshagrawal commented Aug 11, 2023

scottshufe commented Sep 14, 2023

GM-git-dotcom commented Nov 15, 2023

Error on the fifth shard #3

Error on the fifth shard #3

Comments

clemley commented Dec 11, 2020

huxi2 commented Jul 2, 2021

swagStar123-code commented Dec 12, 2022

KatieHYT commented Jun 8, 2023

nimeshagrawal commented Aug 11, 2023

nimeshagrawal commented Aug 11, 2023

scottshufe commented Sep 14, 2023

GM-git-dotcom commented Nov 15, 2023