Formulas in "Quantizing Weights & Activations for Inference" do not match (video vs notebook)

finnfalter · September 9, 2024, 8:02pm

I do not understand the following formula given in Jupyter:

dequantized_weight = q_w.to(torch.float32) * s_w + z_w

What worries me even more is that the summary given in the video (at 1:03) seems to show a different formular:

Given r = s * (q-z) from the first chapter, I would rather stick with the video or, in other words: doesn’t show the code in the notebook the wrong sign for z_w and aren’t in the notebook the parenthesis missing?
Is there an error in the Jupyter notebook?

jagdish007 · March 5, 2025, 4:46am

Good catch. I think formula in the video is correct as it also matches from the first chapter as well as in the code of section of “Quantization and Dequantization of Tensor”.
I believe in the code section which you are referring it doesn’t matter adding or subtracting z_w, becuase it’s Zero due to the nature of symmetric quantization.

Topic		Replies	Views
Week 3 Exercise 6, Back propagation Neural Networks and Deep Learning coursera-platform	2	498	December 26, 2022
Week 3 lab: Which video is this discussed? Calculus for Machine Learning and Data Science week-3	6	30	August 2, 2024
C1W2 - Batch Normalization Build Basic Generative Adversarial Networks week-2	5	315	January 3, 2024
I have stuck with Course 5 Week 4 Assignments1 Ex3 Sequence Models week-4 , coursera-platform	9	630	August 17, 2024
Week2 Exercise6 update_parameters_with_adam : Wrong Value Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	531	October 19, 2022

Formulas in "Quantizing Weights & Activations for Inference" do not match (video vs notebook)

Related topics