ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing

Diffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is a...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xi Yu, Xiang Gu, Xin Hu, Jian Sun
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	Sensors
Subjects:	diffusion models progressive image editing enhanced non-isotropic Gaussian diffusion model
Online Access:	https://www.mdpi.com/1424-8220/25/10/2970
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849326963956645888
author	Xi Yu Xiang Gu Xin Hu Jian Sun
author_facet	Xi Yu Xiang Gu Xin Hu Jian Sun
author_sort	Xi Yu
collection	DOAJ
description	Diffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is applied to the image, in this paper, we propose an Enhanced Non-isotropic Gaussian Diffusion Model (ENGDM) for progressive image editing, which introduces independent Gaussian noise with varying variances to each pixel based on its editing needs. To enable efficient inference without retraining, ENGDM is rectified into an isotropic Gaussian diffusion model (IGDM) by assigning different total diffusion times to different pixels. Furthermore, we introduce reinforced text embeddings, using a novel editing reinforcement loss in the latent space to optimize text embeddings for enhanced editability. And we introduce optimized noise variances by employing a structural consistency loss to dynamically adjust the denoising time steps for each pixel for better faithfulness. Experimental results on multiple datasets demonstrate that ENGDM achieves state-of-the-art performance in image-editing tasks, effectively balancing faithfulness to the source image and alignment with the desired editing target.
format	Article
id	doaj-art-c6f06f8996f94edea9f613a00e7d39d5
institution	Kabale University
issn	1424-8220
language	English
publishDate	2025-05-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj-art-c6f06f8996f94edea9f613a00e7d39d52025-08-20T03:48:01ZengMDPI AGSensors1424-82202025-05-012510297010.3390/s25102970ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image EditingXi Yu0Xiang Gu1Xin Hu2Jian Sun3School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaDiffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is applied to the image, in this paper, we propose an Enhanced Non-isotropic Gaussian Diffusion Model (ENGDM) for progressive image editing, which introduces independent Gaussian noise with varying variances to each pixel based on its editing needs. To enable efficient inference without retraining, ENGDM is rectified into an isotropic Gaussian diffusion model (IGDM) by assigning different total diffusion times to different pixels. Furthermore, we introduce reinforced text embeddings, using a novel editing reinforcement loss in the latent space to optimize text embeddings for enhanced editability. And we introduce optimized noise variances by employing a structural consistency loss to dynamically adjust the denoising time steps for each pixel for better faithfulness. Experimental results on multiple datasets demonstrate that ENGDM achieves state-of-the-art performance in image-editing tasks, effectively balancing faithfulness to the source image and alignment with the desired editing target.https://www.mdpi.com/1424-8220/25/10/2970diffusion modelsprogressive image editingenhanced non-isotropic Gaussian diffusion model
spellingShingle	Xi Yu Xiang Gu Xin Hu Jian Sun ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing Sensors diffusion models progressive image editing enhanced non-isotropic Gaussian diffusion model
title	ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_full	ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_fullStr	ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_full_unstemmed	ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_short	ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_sort	engdm enhanced non isotropic gaussian diffusion model for progressive image editing
topic	diffusion models progressive image editing enhanced non-isotropic Gaussian diffusion model
url	https://www.mdpi.com/1424-8220/25/10/2970
work_keys_str_mv	AT xiyu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting AT xianggu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting AT xinhu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting AT jiansun engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting

ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing

Similar Items