Fix the multi-output, dict-input, parameter counting and calculation overflow problem. #165

cainmagi · 2021-02-27T07:26:17Z

Update report

Fix the bug of parameter number calculation when there are more than one output variables, including both sequence case and dict case (mentioned in Cannot get the summary #162).
Make multiple output variables split into multiple lines.
Remove the last line break of summary_string().
Enable argument device to accept both str and torch.device.
Fix a bug when the model requires batch_size to be a specific number.
Fix a bug caused by multiple input cases when dtypes=None.
Add text auto wrap when the layer name is too long.
Support counting all parameters instead of weight and bias (a different solution of Fix parameter count #142, the package does not count "torch.nn.parameter" #148).
Drop the np.sum/prod to fix the overflow problem during calculating the total size (mentioned in RuntimeWarning: overflow encountered in long_scalars #158).
Fix the bug caused by layers with dict input values (mentioned in Cannot get the summary #162).
Add docstring.

Example for verifying this update

The following code is not compatible with the base repository:

import torch
import torch.nn as nn
from torchsummary import summary

class VeryLongNameSimpleMultiConv(nn.Module):
    def __init__(self):
        super(VeryLongNameSimpleMultiConv, self).__init__()
        self.features_1 = nn.Sequential(
            nn.Conv2d(1, 1, kernel_size=3, stride=1, padding=1),
            nn.ReLU(),
        )
        self.features_2 = nn.Sequential(
            nn.Conv2d(1, 2, kernel_size=3, stride=1, padding=1),
            nn.ReLU(),
        )

    def forward(self, x):
        x1 = self.features_1(x)
        x2 = self.features_2(x)
        return x1, x2
    
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = VeryLongNameSimpleMultiConv().to(device)

summary(model, (1, 16, 16))

Now the output is:

----------------------------------------------------------------
        Layer (type)               Output Shape         Param #
================================================================
            Conv2d-1            [-1, 1, 16, 16]              10
              ReLU-2            [-1, 1, 16, 16]               0
            Conv2d-3            [-1, 2, 16, 16]              20
              ReLU-4            [-1, 2, 16, 16]               0
VeryLong...ltiConv-5            [-1, 1, 16, 16]               0
                                [-1, 2, 16, 16]
================================================================
Total params: 30
Trainable params: 30
Non-trainable params: 0
----------------------------------------------------------------
Input size (MB): 0.00
Forward/backward pass size (MB): 0.02
Params size (MB): 0.00
Estimated Total Size (MB): 0.02
----------------------------------------------------------------

1. Fix the bug of parameter number calculation when there are more than one output variables, including both sequence case and dict case. 2. Make multuple output variables split into multiple lines. 3. Remove the last line break of summary_string() 4. Enable argument "device" to accept both str and torch.device. 5. Fix a bug when the model requires "batch_size" to be a specific number. 6. Fix a bug caused by multiple input case when "dtypes=None". 7. Add text auto wrap when the layer name is too long. 8. Add docstring.

Support counting all parameters instead of `weight` and `bias`.

Using numpy sum/prod to calculate the total size may cause overflow problem. This modification would drop the numpy and use the python built-in method to calculate the size.

Fix the bug caused by layers with dict input values.

Fix the data type of the output params_info from torch.tensor to int.

cainmagi and others added 2 commits February 27, 2021 01:21

Fix parameter counting problem.

18bf210

Support counting all parameters instead of `weight` and `bias`.

cainmagi changed the title ~~Fix the multi-output problem.~~ Fix the multi-output and parameter counting problem. Feb 28, 2021

cainmagi mentioned this pull request Feb 28, 2021

Cannot get the summary #162

Open

Fix the long int overflow problem.

c8836d5

Using numpy sum/prod to calculate the total size may cause overflow problem. This modification would drop the numpy and use the python built-in method to calculate the size.

cainmagi changed the title ~~Fix the multi-output and parameter counting problem.~~ Fix the multi-output, parameter counting and calculation overflow problem. Feb 28, 2021

cainmagi mentioned this pull request Feb 28, 2021

RuntimeWarning: overflow encountered in long_scalars #158

Open

Fix dict input problem.

37f8e5a

Fix the bug caused by layers with dict input values.

cainmagi changed the title ~~Fix the multi-output, parameter counting and calculation overflow problem.~~ Fix the multi-output, dict-input, parameter counting and calculation overflow problem. Feb 28, 2021

Fix the output params_info type.

37ab4ad

Fix the data type of the output params_info from torch.tensor to int.

cainmagi mentioned this pull request Feb 28, 2021

the package does not count "torch.nn.parameter" #148

Open

GCS-ZHN mentioned this pull request Oct 26, 2022

Fix None Type error while using MultiHeadAttention #191

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the multi-output, dict-input, parameter counting and calculation overflow problem. #165

Fix the multi-output, dict-input, parameter counting and calculation overflow problem. #165

cainmagi commented Feb 27, 2021 •

edited

Loading

Fix the multi-output, dict-input, parameter counting and calculation overflow problem. #165

Are you sure you want to change the base?

Fix the multi-output, dict-input, parameter counting and calculation overflow problem. #165

Conversation

cainmagi commented Feb 27, 2021 • edited Loading

Update report

Example for verifying this update

cainmagi commented Feb 27, 2021 •

edited

Loading