MNIST classification #1

janisgp · 2019-06-23T09:09:58Z

I am curious how the classification setting works. You mention in your paper that you use the cross entropy loss.

Do you use as final layer a softmax? How do you propagate the variance through the softmax?

js05212 · 2019-07-18T01:43:14Z

Hi,

Thanks for the interest and good question! For MNIST classification, we use elementwise sigmoid followed by cross entropy. The output mean of the sigmoid will take the mean and variance from the previous layer (pre-activation linear layer) as input. This is how both the mean and variance can affect the final prediction.

There has also been follow-ups of NPN (e.g., work from ICLR 2018 if I remember correctly) trying to extend it with softmax layer.

Hao

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MNIST classification #1

MNIST classification #1

janisgp commented Jun 23, 2019

js05212 commented Jul 18, 2019

MNIST classification #1

MNIST classification #1

Comments

janisgp commented Jun 23, 2019

js05212 commented Jul 18, 2019