Custom Weight Initialization #24

glenn-jocher · 2020-04-10T20:46:32Z

I noticed you use code for custom weight initialization:

Lines 162 to 169 in 2c90e67

    
           def _initialize_weights(self): 
        
               for m in self.modules(): 
        
                   if isinstance(m, nn.Conv2d): 
        
                       nn.init.kaiming_normal_(m.weight, mode='fan_out', nonlinearity='relu') 
        
                   elif isinstance(m, nn.BatchNorm2d): 
        
                       m.weight.data.fill_(1) 
        
                       m.bias.data.zero_()

I've not seen this before. Is there a reason behind this specific strategy? Do you know the effect this has on the training, and have you compared this with the pytorch default weight initialization? Thank you!

iamhankai · 2020-04-11T05:32:59Z

kaiming_normal_ is a commonly used initialization strategy.

glenn-jocher · 2020-04-11T17:19:07Z

@iamhankai thank you! Do you know what the default pytorch weights init strategy is?

I suppose this makes for easier comparisons with the TF version of ghostnet to use the same strategy on both?

iamhankai · 2020-04-12T04:30:26Z

@glenn-jocher TF version of ghostnet also used Kaiming normal initialization.

d-li14 mentioned this issue Apr 20, 2020

Custom Weight Initialization Effect d-li14/ghostnet.pytorch#7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom Weight Initialization #24

Custom Weight Initialization #24

glenn-jocher commented Apr 10, 2020

iamhankai commented Apr 11, 2020 •

edited

Loading

glenn-jocher commented Apr 11, 2020

iamhankai commented Apr 12, 2020

Custom Weight Initialization #24

Custom Weight Initialization #24

Comments

glenn-jocher commented Apr 10, 2020

iamhankai commented Apr 11, 2020 • edited Loading

glenn-jocher commented Apr 11, 2020

iamhankai commented Apr 12, 2020

iamhankai commented Apr 11, 2020 •

edited

Loading