Final exam: Losing 5 points because of the 17th decimal accuracy

Chyavan_MC · March 12, 2023, 5:39am

I’m losing 5 points because one of the test cases fails with an error in the 17th decimal place. I tried changing the sigmoid function definition, and the matrix multiplication in the forward propagation, but nothing seems to correct this one accuracy mistake.

Abdul_Ahad_Barrawas · March 16, 2023, 2:24pm

I have received the same error, the last decimal is rounded differently.

paulinpaloalto · March 16, 2023, 3:10pm

We are doing floating point here, so you can have several ways to express a given series of operations which are equivalent mathematically, but have different rounding behavior. Here’s one contrived example:

np.random.seed(42)
m = 10000
a = np.random.randn(1,m)
z1 = np.sum(np.exp(a)/m)
z2 = 1/m * np.sum(np.exp(a))
z3 = np.mean(np.exp(a))
print("z1 = {:0.17f}\nz2 = {:0.17f}\nz3 = {:0.17f}".format(z1, z2, z3))
assert(z1 == z2)
z1 = 1.65292081707434368
z2 = 1.65292081707434346
z3 = 1.65292081707434346
---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
<ipython-input-11-3a94d76d7d93> in <module>
      6 z3 = np.mean(np.exp(a))
      7 print("z1 = {:0.17f}\nz2 = {:0.17f}\nz3 = {:0.17f}".format(z1, z2, z3))
----> 8 assert(z1 == z2)

AssertionError: 

---------------------------------------------------------------------------

So the answers differ in the 16th decimal place, but notice that I needed to make m pretty large in order to see that effect. In 64 bit floating point the resolution is approximately 10^{-16} or 10^{-17} so rounding errors start out at that scale, but can accumulate if you are doing a serial computation and it’s not “stable”. A “stable” computation is one in which the rounding errors tend to cancel each other out and not accumulate, but not all serial computations have that property. There is a field of mathematics called Numerical Analysis that studies this type of phenomenon among others and there are precise ways to analyze and characterize this type of behavior.

I am not a student or mentor for M4ML, so I’m not familiar with this assignment. None of the M4ML mentors have responded yet, but all I can suggest is to examine the algorithm and consider whether you can think of different equivalent ways to perform the required operations.

paulinpaloalto · March 16, 2023, 10:35pm

Actually here’s an older thread with perhaps a simpler example of rounding differences and a bit more explanation.

Topic		Replies	Views
C1_W2_Lab02: result difference in 4 Supervised ML: Regression and Classification week-module-2	10	44	November 24, 2024
Question about backprop Advanced Learning Algorithms week-module-2	11	25	September 23, 2024
C2W3 Final assignment exercise 5 expected output accuracy Calculus for Machine Learning and Data Science week-module-3	3	35	January 15, 2025
[DLS1] Week 3 - exercise 6: error by 1e-8 Neural Networks and Deep Learning coursera-platform	5	681	June 3, 2023
Float type noise AI Discussions ai-discussions	5	76	March 12, 2024

Final exam: Losing 5 points because of the 17th decimal accuracy

Related topics