I noticed that you didn't do data augmentation like in Kdeep original post. #2

AberChiong · 2023-09-05T09:57:05Z

Line 53 in c3e3a15

pred[i] = _pred.mean().cpu().numpy()

Hello!

I'm refactoring your code as an important part of my master's project.

After I completed the entire process of pytorch-based programming (referring to your code), I immediately conducted a performance evaluation test of the model. The result is this:
{'RMSE': 1.5596018, 'Spearman Correlation': 0.6496260994698196, 'Pearson Correlation': 0.6565482435856405}

This is extremely inconsistent with the performance evaluation results in the original text of Kdeep, and then I carefully read the original text and your code. I found the following issues:

original text: We augment our data by rotating each subgrid 90 deg, providing 24 times our initial training size. This augmenting methodology is also used at prediction time and then averaged out, reducing variance.
but you: I didn't find any code for data augmentation, which also appeared in the validation of the model (It may also be because my programming ability is too weak or this part of the code has not been seriously discovered).

So I would like to ask you, do you think data augmentation should be done in this model? What is the performance of the model trained in your environment?

Bast wish to u, Im waiting for your reply. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I noticed that you didn't do data augmentation like in Kdeep original post. #2

I noticed that you didn't do data augmentation like in Kdeep original post. #2

AberChiong commented Sep 5, 2023

I noticed that you didn't do data augmentation like in Kdeep original post. #2

I noticed that you didn't do data augmentation like in Kdeep original post. #2

Comments

AberChiong commented Sep 5, 2023