Why training is to predict noise compared to clean image directly, but generating is step by step?

zxcheng95 · December 28, 2024, 7:18am

Training here is to predict the noise added in a certain step-t.

So for generation, to get the image (step-0), you need to revert from the random Gaussian distribution back, you need to incrementally call the network (predict the noise at certain step-t) and remove this predicted noise. And here you need to be cautious to not just minus noise but strictly follow the mathematic equation to mimic the reverse process of adding the noise.

Topic		Replies	Views
Question on noise prediction How Diffusion Models Work	5	353	June 9, 2023
Input "x" to Net during training : is it the original image(0) or noised image(t)? How Diffusion Models Work	0	91	February 22, 2024
A basic question How Diffusion Models Work	3	191	September 28, 2023
Question_Regarding_Training_process How Diffusion Models Work	0	14	December 18, 2024
Reasons for adding extra noise to the training data How Diffusion Models Work	3	398	June 9, 2023

Why training is to predict noise compared to clean image directly, but generating is step by step?

Related topics