Confusion regarding PSNR and differences in inference performance

Nevermnd · May 23, 2024, 7:21am

Ok,

So I am a little confused about the difference in inference between the cloud and on device as represented by PSNR.

I mean presuming you are using the same fundamental types/bits in both cases (float, double, etc), of course I might expect on-device inference to take longer-- But why would it be different ?

I’m not sure I’m getting that…

Alireza_Saei · May 23, 2024, 7:37am

Hi @Nevermnd ,

The fundamental types are the same, but quantization (uses lower precision that can reduce accuracy and PSNR), hardware differences (limited computational power and memory), memory constraints (requires smaller models or more aggressive optimizations, impacting accuracy), and software libraries (Lighter inference engines) can lead to different performances and PSNR between cloud and on-device inference.

Nevermnd · May 23, 2024, 8:08pm

@Alireza_Saei again, my question was ‘apples-to-apples’-- So quantization is a totally different question, that is changing the underlying processing of the model-- and hardware, memory can all be handled (i.e. say with paging) via clever programming tricks.

As mentioned, I have no idea in my mind on a cell phone compared to a desktop with an RTX 4090, you are going to take a huge performance hit-- meaning, it would run really, really slow. But technically the results shouldn’t be different.

Further, this use of PSNR as a metric at least has me suspecting it is a ‘cover’ for the fact that if it is low, well, now your ‘accuracy’ isn’t so great.

But no one trying to sell/pitch hardware wants to say ‘well, on device your accuracy went down’, because anyone that knows English knows that word.

So I am left wondering if there is a real, legitimate technical reason we are now using this term as measurement ?

Alireza_Saei · May 24, 2024, 6:40am

Hey there @Nevermnd ,

I think you are trying to say that both do the same thing—one is slow but the other is fast—so the results must be the same! However, in practice, as I said, some factors can impact PSNR and accuracy in totally unequal training processes:

On-device models often require optimizations to fit within the limited resources that can impact accuracy.
Variations in hardware capabilities can lead to different numerical precision and performance, affecting results.
Different inference engines and libraries use distinct algorithms and optimizations, leading to slight differences in output (e.g. TFlite).

And the PSNR helps us to quantify these subtle differences in output quality!

Topic		Replies	Views
🌟 New Course! Enroll in Introduction to On-Device AI News and Announcements short-course , dl-ai-learning-platform	5	674	May 27, 2024
C1_W2_Lab_1_mnist.html strange behavior Browser-based Models with TensorFlow.js week-module-2	1	669	December 15, 2023
Assignment 1: How is Loss greater than (1- Accuracy)? Convolutional Neural Networks week-module-2 , coursera-platform	4	262	January 4, 2024
Running on device Introduction to On-Device AI	4	75	August 28, 2025
Checking On-Device Accuracy Introduction to On-Device AI	5	45	December 22, 2024

Confusion regarding PSNR and differences in inference performance

Related topics