Compensation Sampling for Improved Convergence in Diffusion Models

Lu, Hui; Salah, Albert Ali; Poppe, Ronald

doi:https://doi.org/10.1007/978-3-031-73030-6_11

Compensation Sampling for Improved Convergence in Diffusion Models

Files

978-3-031-73030-6_11.pdf (1.78 MB)

Publication date

2025

Authors

Lu, Hui

Salah, Albert Ali

Poppe, R.W.

Editors

Leonardis, Aleš

Ricci, Elisa

Roth, Stefan

Russakovsky, Olga

Sattler, Torsten

Varol, Gül

DOI

https://doi.org/10.1007/978-3-031-73030-6_11

Document Type

Part of book

Metadata

Show full item record

Collections

Utrecht University Repository

License

taverne

Abstract

Diffusion models achieve remarkable quality in image generation, but at a cost. Iterative denoising requires many time steps to produce high fidelity images. We argue that the denoising process is crucially limited by an accumulation of the reconstruction error due to an initial inaccurate reconstruction of the target data. This leads to lower quality outputs, and slower convergence. To address these issues, we propose compensation sampling to guide the generation towards the target domain. We introduce a compensation term, implemented as a U-Net, which adds negligible computation overhead during training. Our approach is flexible and we demonstrate its application in unconditional generation, face inpainting, and face de-occlusion on benchmark datasets CIFAR-10, CelebA, CelebA-HQ, FFHQ-256, and FSG. Our approach consistently yields state-of-the-art results in terms of image quality, while accelerating the denoising process to converge during training by up to an order of magnitude (Our code and models will be made publicly available upon acceptance of the paper.).

Keywords

Diffusion models, Image generation, Iterative denoising, Taverne, Theoretical Computer Science, General Computer Science

Citation

Lu, H, Salah, A A & Poppe, R 2025, Compensation Sampling for Improved Convergence in Diffusion Models. in A Leonardis, E Ricci, S Roth, O Russakovsky, T Sattler & G Varol (eds), Computer Vision – ECCV 2024 - 18th European Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15119 LNCS, Springer, pp. 183-201, 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29/09/24. https://doi.org/10.1007/978-3-031-73030-6_11, conference

URI

https://dspace.library.uu.nl/handle/1874/482557