Why does the cost function not hit zero for the adder when every pair is correct? #8

hippietrail · 2023-05-24T09:22:55Z

I'm trying to understand how the cost function works. I noticed in the adder, where all the numbers are discrete and have exact solutions, that the cost function is still wiggling and nonzero even when every pair of numbers adds to the exact correct solution.

If I understand correctly the loss is the mean of the squares of the differences between each actual and expected result. So I would expect that to hit zero. Any ideas what I'm missing?

rexim · 2023-05-24T14:43:12Z

Because the output bits do not have to be perfectly 0 or perfectly 1. We consider signal <=0.5 a zero and >0.5 a one. (Completely arbitrary choice).

hippietrail · 2023-05-25T04:59:46Z

Are we rounding them in the display?
I thought maybe we're taking a float and then doing int maths on it then I noticed that z is both a size_t and a float (-:

            size_t z = 0.0f;
            for (size_t i = 0; i < BITS; ++i) {
                size_t bit = MAT_AT(NN_OUTPUT(nn), 0, i) > 0.5;
                z = z|(bit<<i);
            }
            bool overflow = MAT_AT(NN_OUTPUT(nn), 0, BITS) > 0.5;

I wonder if it's ever possible for the cost to be lower for a wrong answer than a right one? Say if many are extremely close to 0.0 but at least one is > 0.5?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does the cost function not hit zero for the adder when every pair is correct? #8

Why does the cost function not hit zero for the adder when every pair is correct? #8

hippietrail commented May 24, 2023

rexim commented May 24, 2023

hippietrail commented May 25, 2023

Why does the cost function not hit zero for the adder when every pair is correct? #8

Why does the cost function not hit zero for the adder when every pair is correct? #8

Comments

hippietrail commented May 24, 2023

rexim commented May 24, 2023

hippietrail commented May 25, 2023