Elucidating the Exposure Bias in Diffusion Models

Publication date

2024-06

Authors

Ning, MangORCID 0000-0001-6037-1661ISNI 0000000524306124
Li, Mingxiao
Su, Jianlin
Salah, Albert AliORCID 0000-0001-6342-428XISNI 0000000091147032
Önal Ertuğrul, ItirISNI 0000000512566076

Editors

Advisors

Supervisors

Document Type

Contribution to conference

License

Abstract

Diffusion models have demonstrated impressive generative capabilities, but their exposure bias problem, described as the input mismatch between training and sampling, lacks in-depth exploration. In this paper, we investigate the exposure bias problem in diffusion models by first analytically modelling the sampling distribution, based on which we then attribute the prediction error at each sampling step as the root cause of the exposure bias issue. Furthermore, we discuss potential solutions to this issue and propose an intuitive metric for it. Along with the elucidation of exposure bias, we propose a simple, yet effective, training-free method called Epsilon Scaling to alleviate the exposure bias. We show that Epsilon Scaling explicitly moves the sampling trajectory closer to the vector field learned in the training phase by scaling down the network output, mitigating the input mismatch between training and sampling. Experiments on various diffusion frameworks (ADM, DDIM, EDM, LDM, DiT, PFGM++) verify the effectiveness of our method. Remarkably, our ADM-ES, as a state-of-the-art stochastic sampler, obtains 2.17 FID on CIFAR-10 under 100-step unconditional generation. The code is at https://github.com/forever208/ADM-ES.

Keywords

Language and Linguistics, Computer Science Applications, Education, Linguistics and Language

Citation

Ning, M, Li, M, Su, J, Salah, A A & Ertugrul, I O 2024, 'Elucidating the Exposure Bias in Diffusion Models', Paper presented at 12th International Conference on Learning Representations, ICLR 2024, Hybrid, Vienna, Austria, 7/05/24 - 11/05/24. https://doi.org/10.48550/arXiv.2308.15321, conference