Question about the training process #9

zhaoxin94 · 2021-11-30T03:51:07Z

Thanks very much for sharing your code!

I have a question about the training process of TS-CAM.

During training, you do not use the cls_token to conduct classification, use remaining tokens instead.
In this circumstance, can the first row of the attention weight (the attention for the cls_token) reflect the relative importance of differnt patches for classification accuractly?

Look forward to your rely!
Thanks!

vasgaowei · 2021-12-02T08:26:33Z

Hi, thanks for your attention.
I think the attention for the cls_token can reflect the relative importance of different patches for classification. The reason is that cls_token is updated in every transformer layer, by weighted summation with attention weight. The attention weight is calculated between cls_token and other patch tokens.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the training process #9

Question about the training process #9

zhaoxin94 commented Nov 30, 2021

vasgaowei commented Dec 2, 2021

Question about the training process #9

Question about the training process #9

Comments

zhaoxin94 commented Nov 30, 2021

vasgaowei commented Dec 2, 2021