SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal

Gaze tracking technology in HMDs (Head-Mounted Displays) suffers from decreased accuracy due to highlight reflection noise from users' glasses. To address this, we present a denoising method which directly pinpoints the noisy regions through advanced segmentation models and then fills the flawe...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ting Lei, Jing Chen, Jixiang Chen
Format:	Article
Language:	English
Published:	Elsevier 2025-01-01
Series:	Alexandria Engineering Journal
Subjects:	Eye tracking Image segmentation Image denoising Large model
Online Access:	http://www.sciencedirect.com/science/article/pii/S1110016824012572
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841524942992048128
author	Ting Lei Jing Chen Jixiang Chen
author_facet	Ting Lei Jing Chen Jixiang Chen
author_sort	Ting Lei
collection	DOAJ
description	Gaze tracking technology in HMDs (Head-Mounted Displays) suffers from decreased accuracy due to highlight reflection noise from users' glasses. To address this, we present a denoising method which directly pinpoints the noisy regions through advanced segmentation models and then fills the flawed regions through advanced image inpainting algorithms. In segmentation stage, we introduce a novel model based on the recently proposed segmentation large model SAM (Segment Anything Model), called SF-SAM-Adapter (Spatial and Frequency aware SAM Adapter). It injects prior knowledge regarding the strip-like shaped in spatial and high-frequency in frequency of reflection noise into SAM by integrating specially designed trainable adapter modules into the original structure, while retaining the expressive power of the large model and better adapting to the downstream task. We achieved segmentation metrics of IoU (Intersection over Union) = 0.749 and Dice = 0.853 at a memory size of 13.9 MB, outperforming recent techniques, including UNet, UNet++, BATFormer, FANet, MSA, and SAM2-Adapter. In inpainting, we employ the advanced inpainting algorithm LAMA (Large Mask inpainting), resulting in significant improvements in gaze tracking accuracy by 0.502°, 0.182°, and 0.319° across three algorithms. The code and datasets used in current study are available in the repository: https://github.com/leiting5297/SF-SAM-Adapter.git.
format	Article
id	doaj-art-f57a436d5d754df5915cc8dd5b0f608b
institution	Kabale University
issn	1110-0168
language	English
publishDate	2025-01-01
publisher	Elsevier
record_format	Article
series	Alexandria Engineering Journal
spelling	doaj-art-f57a436d5d754df5915cc8dd5b0f608b2025-01-18T05:03:42ZengElsevierAlexandria Engineering Journal1110-01682025-01-01111521529SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removalTing Lei0Jing Chen1Jixiang Chen2School of optics and photonics, Beijing Institute of Technology, Beijing 100089, ChinaCorresponding author.; School of optics and photonics, Beijing Institute of Technology, Beijing 100089, ChinaSchool of optics and photonics, Beijing Institute of Technology, Beijing 100089, ChinaGaze tracking technology in HMDs (Head-Mounted Displays) suffers from decreased accuracy due to highlight reflection noise from users' glasses. To address this, we present a denoising method which directly pinpoints the noisy regions through advanced segmentation models and then fills the flawed regions through advanced image inpainting algorithms. In segmentation stage, we introduce a novel model based on the recently proposed segmentation large model SAM (Segment Anything Model), called SF-SAM-Adapter (Spatial and Frequency aware SAM Adapter). It injects prior knowledge regarding the strip-like shaped in spatial and high-frequency in frequency of reflection noise into SAM by integrating specially designed trainable adapter modules into the original structure, while retaining the expressive power of the large model and better adapting to the downstream task. We achieved segmentation metrics of IoU (Intersection over Union) = 0.749 and Dice = 0.853 at a memory size of 13.9 MB, outperforming recent techniques, including UNet, UNet++, BATFormer, FANet, MSA, and SAM2-Adapter. In inpainting, we employ the advanced inpainting algorithm LAMA (Large Mask inpainting), resulting in significant improvements in gaze tracking accuracy by 0.502°, 0.182°, and 0.319° across three algorithms. The code and datasets used in current study are available in the repository: https://github.com/leiting5297/SF-SAM-Adapter.git.http://www.sciencedirect.com/science/article/pii/S1110016824012572Eye trackingImage segmentationImage denoisingLarge model
spellingShingle	Ting Lei Jing Chen Jixiang Chen SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal Alexandria Engineering Journal Eye tracking Image segmentation Image denoising Large model
title	SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal
title_full	SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal
title_fullStr	SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal
title_full_unstemmed	SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal
title_short	SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal
title_sort	sf sam adapter sam based segmentation model integrates prior knowledge for gaze image reflection noise removal
topic	Eye tracking Image segmentation Image denoising Large model
url	http://www.sciencedirect.com/science/article/pii/S1110016824012572
work_keys_str_mv	AT tinglei sfsamadaptersambasedsegmentationmodelintegratespriorknowledgeforgazeimagereflectionnoiseremoval AT jingchen sfsamadaptersambasedsegmentationmodelintegratespriorknowledgeforgazeimagereflectionnoiseremoval AT jixiangchen sfsamadaptersambasedsegmentationmodelintegratespriorknowledgeforgazeimagereflectionnoiseremoval

SF-SAM-Adapter: SAM-based segmentation model integrates prior knowledge for gaze image reflection noise removal

Similar Items