Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method

This paper tackles the challenge of accurately segmenting images of Ming-style furniture, an important aspect of China’s cultural heritage, to aid in its preservation and analysis. Existing vision foundation models, like the segment anything model (SAM), struggle with the complex structures of Ming...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yingtong Wan, Wanru Wang, Meng Zhang, Wei Peng, He Tang
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Sensors
Subjects:	cultural heritage preservation image segmentation vision foundation model prompt learning
Online Access:	https://www.mdpi.com/1424-8220/25/1/96
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841548900437065728
author	Yingtong Wan Wanru Wang Meng Zhang Wei Peng He Tang
author_facet	Yingtong Wan Wanru Wang Meng Zhang Wei Peng He Tang
author_sort	Yingtong Wan
collection	DOAJ
description	This paper tackles the challenge of accurately segmenting images of Ming-style furniture, an important aspect of China’s cultural heritage, to aid in its preservation and analysis. Existing vision foundation models, like the segment anything model (SAM), struggle with the complex structures of Ming furniture due to the need for manual prompts and imprecise segmentation outputs. To address these limitations, we introduce two key innovations: the material attribute prompter (MAP), which automatically generates prompts based on the furniture’s material properties, and the structure refinement module (SRM), which enhances segmentation by combining high- and low-level features. Additionally, we present the MF2K dataset, which includes 2073 images annotated with pixel-level masks across eight materials and environments. Our experiments demonstrate that the proposed method significantly improves the segmentation accuracy, outperforming state-of-the-art models in terms of the mean intersection over union (mIoU). Ablation studies highlight the contributions of the MAP and SRM to both the performance and computational efficiency. This work offers a powerful automated solution for segmenting intricate furniture structures, facilitating digital preservation and in-depth analysis of Ming-style furniture.
format	Article
id	doaj-art-1bfe9df8d2d94a1db9bb4abe0e8d3014
institution	Kabale University
issn	1424-8220
language	English
publishDate	2024-12-01
publisher	MDPI AG
record_format	Article
series	Sensors
spelling	doaj-art-1bfe9df8d2d94a1db9bb4abe0e8d30142025-01-10T13:20:52ZengMDPI AGSensors1424-82202024-12-012519610.3390/s25010096Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and MethodYingtong Wan0Wanru Wang1Meng Zhang2Wei Peng3He Tang4School of Industrial Design, Hubei University of Technology, Wuhan 430068, ChinaSchool of Industrial Design, Hubei University of Technology, Wuhan 430068, ChinaSchool of Industrial Design, Hubei University of Technology, Wuhan 430068, ChinaSchool of Industrial Design, Hubei University of Technology, Wuhan 430068, ChinaSchool of Software Engineering, Huazhong University of Science and Technology, Wuhan 430074, ChinaThis paper tackles the challenge of accurately segmenting images of Ming-style furniture, an important aspect of China’s cultural heritage, to aid in its preservation and analysis. Existing vision foundation models, like the segment anything model (SAM), struggle with the complex structures of Ming furniture due to the need for manual prompts and imprecise segmentation outputs. To address these limitations, we introduce two key innovations: the material attribute prompter (MAP), which automatically generates prompts based on the furniture’s material properties, and the structure refinement module (SRM), which enhances segmentation by combining high- and low-level features. Additionally, we present the MF2K dataset, which includes 2073 images annotated with pixel-level masks across eight materials and environments. Our experiments demonstrate that the proposed method significantly improves the segmentation accuracy, outperforming state-of-the-art models in terms of the mean intersection over union (mIoU). Ablation studies highlight the contributions of the MAP and SRM to both the performance and computational efficiency. This work offers a powerful automated solution for segmenting intricate furniture structures, facilitating digital preservation and in-depth analysis of Ming-style furniture.https://www.mdpi.com/1424-8220/25/1/96cultural heritage preservationimage segmentationvision foundation modelprompt learning
spellingShingle	Yingtong Wan Wanru Wang Meng Zhang Wei Peng He Tang Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method Sensors cultural heritage preservation image segmentation vision foundation model prompt learning
title	Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method
title_full	Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method
title_fullStr	Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method
title_full_unstemmed	Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method
title_short	Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method
title_sort	advancing a vision foundation model for ming style furniture image segmentation a new dataset and method
topic	cultural heritage preservation image segmentation vision foundation model prompt learning
url	https://www.mdpi.com/1424-8220/25/1/96
work_keys_str_mv	AT yingtongwan advancingavisionfoundationmodelformingstylefurnitureimagesegmentationanewdatasetandmethod AT wanruwang advancingavisionfoundationmodelformingstylefurnitureimagesegmentationanewdatasetandmethod AT mengzhang advancingavisionfoundationmodelformingstylefurnitureimagesegmentationanewdatasetandmethod AT weipeng advancingavisionfoundationmodelformingstylefurnitureimagesegmentationanewdatasetandmethod AT hetang advancingavisionfoundationmodelformingstylefurnitureimagesegmentationanewdatasetandmethod

Advancing a Vision Foundation Model for Ming-Style Furniture Image Segmentation: A New Dataset and Method

Similar Items