-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Direct Transfer results #1
Comments
@liangbh6 @Simon4Yan Which framework you use? Caffe or pytorch? |
@Simon4john pytorch. So, the reason is the differences between pytorch and caffe? If I want to reproduce your results using pytorch, do you have some suggestion, about the learning rate, data augmentation, or testing tricks like normalization? Actually I have tried to normalized the features but it helped a little. |
Thanks for your question. And we conduct experiment to see that difference between pytroch and caffe, we find the BN leads to the this performance gap. I will give the experiment details about it after I come back to school. |
With the help of Houjing Huang (his homepage is here ), I find the performance gap on pytorch and caffe is caused by BN. I give huang's experiments here: whether you set BN layer to We train models using pytorch, and the settings are the same with caffe. (1) When setting BN layer to
(2) When setting BN layer to
Therefore, we believe that BN is the key factor to the performance gap between caffe and pytorch. |
@Simon4Yan Excellent work! Thanks a lot. |
The text was updated successfully, but these errors were encountered: