ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing

Diffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is a...

Full description

Saved in:
Bibliographic Details
Main Authors: Xi Yu, Xiang Gu, Xin Hu, Jian Sun
Format: Article
Language:English
Published: MDPI AG 2025-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/10/2970
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849326963956645888
author Xi Yu
Xiang Gu
Xin Hu
Jian Sun
author_facet Xi Yu
Xiang Gu
Xin Hu
Jian Sun
author_sort Xi Yu
collection DOAJ
description Diffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is applied to the image, in this paper, we propose an Enhanced Non-isotropic Gaussian Diffusion Model (ENGDM) for progressive image editing, which introduces independent Gaussian noise with varying variances to each pixel based on its editing needs. To enable efficient inference without retraining, ENGDM is rectified into an isotropic Gaussian diffusion model (IGDM) by assigning different total diffusion times to different pixels. Furthermore, we introduce reinforced text embeddings, using a novel editing reinforcement loss in the latent space to optimize text embeddings for enhanced editability. And we introduce optimized noise variances by employing a structural consistency loss to dynamically adjust the denoising time steps for each pixel for better faithfulness. Experimental results on multiple datasets demonstrate that ENGDM achieves state-of-the-art performance in image-editing tasks, effectively balancing faithfulness to the source image and alignment with the desired editing target.
format Article
id doaj-art-c6f06f8996f94edea9f613a00e7d39d5
institution Kabale University
issn 1424-8220
language English
publishDate 2025-05-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-c6f06f8996f94edea9f613a00e7d39d52025-08-20T03:48:01ZengMDPI AGSensors1424-82202025-05-012510297010.3390/s25102970ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image EditingXi Yu0Xiang Gu1Xin Hu2Jian Sun3School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaSchool of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, ChinaDiffusion models have made remarkable progress in image generation, leading to advancements in the field of image editing. However, balancing editability with faithfulness remains a significant challenge. Motivated by the fact that more novel content will be generated when larger variance noise is applied to the image, in this paper, we propose an Enhanced Non-isotropic Gaussian Diffusion Model (ENGDM) for progressive image editing, which introduces independent Gaussian noise with varying variances to each pixel based on its editing needs. To enable efficient inference without retraining, ENGDM is rectified into an isotropic Gaussian diffusion model (IGDM) by assigning different total diffusion times to different pixels. Furthermore, we introduce reinforced text embeddings, using a novel editing reinforcement loss in the latent space to optimize text embeddings for enhanced editability. And we introduce optimized noise variances by employing a structural consistency loss to dynamically adjust the denoising time steps for each pixel for better faithfulness. Experimental results on multiple datasets demonstrate that ENGDM achieves state-of-the-art performance in image-editing tasks, effectively balancing faithfulness to the source image and alignment with the desired editing target.https://www.mdpi.com/1424-8220/25/10/2970diffusion modelsprogressive image editingenhanced non-isotropic Gaussian diffusion model
spellingShingle Xi Yu
Xiang Gu
Xin Hu
Jian Sun
ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
Sensors
diffusion models
progressive image editing
enhanced non-isotropic Gaussian diffusion model
title ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_full ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_fullStr ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_full_unstemmed ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_short ENGDM: Enhanced Non-Isotropic Gaussian Diffusion Model for Progressive Image Editing
title_sort engdm enhanced non isotropic gaussian diffusion model for progressive image editing
topic diffusion models
progressive image editing
enhanced non-isotropic Gaussian diffusion model
url https://www.mdpi.com/1424-8220/25/10/2970
work_keys_str_mv AT xiyu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting
AT xianggu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting
AT xinhu engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting
AT jiansun engdmenhancednonisotropicgaussiandiffusionmodelforprogressiveimageediting