Compensation Sampling for Improved Convergence in Diffusion Models

Publication date

2025

Authors

Lu, HuiISNI 0000000524098097
Salah, Albert AliORCID 0000-0001-6342-428XISNI 0000000091147032
Poppe, R.W.ISNI 0000000389426288

Editors

Leonardis, Aleš
Ricci, Elisa
Roth, Stefan
Russakovsky, Olga
Sattler, Torsten
Varol, Gül

Advisors

Supervisors

Document Type

Part of book
Open Access logo

License

taverne

Abstract

Diffusion models achieve remarkable quality in image generation, but at a cost. Iterative denoising requires many time steps to produce high fidelity images. We argue that the denoising process is crucially limited by an accumulation of the reconstruction error due to an initial inaccurate reconstruction of the target data. This leads to lower quality outputs, and slower convergence. To address these issues, we propose compensation sampling to guide the generation towards the target domain. We introduce a compensation term, implemented as a U-Net, which adds negligible computation overhead during training. Our approach is flexible and we demonstrate its application in unconditional generation, face inpainting, and face de-occlusion on benchmark datasets CIFAR-10, CelebA, CelebA-HQ, FFHQ-256, and FSG. Our approach consistently yields state-of-the-art results in terms of image quality, while accelerating the denoising process to converge during training by up to an order of magnitude (Our code and models will be made publicly available upon acceptance of the paper.).

Keywords

Diffusion models, Image generation, Iterative denoising, Taverne, Theoretical Computer Science, General Computer Science

Citation

Lu, H, Salah, A A & Poppe, R 2025, Compensation Sampling for Improved Convergence in Diffusion Models. in A Leonardis, E Ricci, S Roth, O Russakovsky, T Sattler & G Varol (eds), Computer Vision – ECCV 2024 - 18th European Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15119 LNCS, Springer, pp. 183-201, 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29/09/24. https://doi.org/10.1007/978-3-031-73030-6_11, conference