Enhancing Road Scene Segmentation With an Optimized DeepLabV3+

Semantic segmentation, as a dense predictive task, is inevitably affected by various external factor, making common road image semantic segmentation models unable to meet dual demands of high accuracy and real-time performance in unstructured road scenarios. To address these issues, this paper propo...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhe Ren, Libao Wang, Tianming Song, Yihang Li, Jian Zhang, Fengfeng Zhao
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Autonomous driving convolutional neural networks deep learning road scene segmentation semantic segmentation
Online Access:	https://ieeexplore.ieee.org/document/10812701/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841533385918382080
author	Zhe Ren Libao Wang Tianming Song Yihang Li Jian Zhang Fengfeng Zhao
author_facet	Zhe Ren Libao Wang Tianming Song Yihang Li Jian Zhang Fengfeng Zhao
author_sort	Zhe Ren
collection	DOAJ
description	Semantic segmentation, as a dense predictive task, is inevitably affected by various external factor, making common road image semantic segmentation models unable to meet dual demands of high accuracy and real-time performance in unstructured road scenarios. To address these issues, this paper proposes an enhanced road scene segmentation method based on DeepLabV3+ that addresses the common trade-offs between accuracy and real-time performance in existing approaches. First, the heavy Xception backbone is replaced with the lightweight MobileNetV2, significantly boosting real-time efficiency while maintaining competitive segmentation accuracy. Second, the Atrous Spatial Pyramid Pooling (ASPP) module is optimized by introducing depthwise separable convolutions and a hierarchical feature fusion strategy, reducing computational complexity and mitigating the grid effect, a limitation in many current models. Finally, a Shuffle Attention mechanism is incorporated to improve the handling of small objects and fine details, such as distant pedestrians or items held by them, enhancing segmentation precision without excessive computational overhead. The method was trained and evaluated on the Cityscapes and CamVid datasets, achieving 84.3% mPA and 41.8 FPS on Cityscapes, and 78.1% mPA and 30.2 FPS on CamVid. These experimental results demonstrate a significant improvement in balancing detection capabilities with real-time performance.
format	Article
id	doaj-art-0a55601030534f2299f2049d1995c878
institution	Kabale University
issn	2169-3536
language	English
publishDate	2024-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-0a55601030534f2299f2049d1995c8782025-01-16T00:02:00ZengIEEEIEEE Access2169-35362024-01-011219774819776510.1109/ACCESS.2024.352159710812701Enhancing Road Scene Segmentation With an Optimized DeepLabV3+Zhe Ren0Libao Wang1Tianming Song2Yihang Li3Jian Zhang4Fengfeng Zhao5https://orcid.org/0009-0008-3543-0584School of Intelligent Manufacturing, Wuxi Vocational College of Science and Technology, Wuxi, ChinaShandong Hopetry Information Technology Company, Linyi, ChinaSchool of Intelligent Manufacturing, Wuxi Vocational College of Science and Technology, Wuxi, ChinaHangzhou Wenyi Street Primary School, Hangzhou, ChinaSchool of Intelligent Manufacturing, Wuxi Vocational College of Science and Technology, Wuxi, ChinaCollege of Computer and Control Engineering, Northeast Forestry University, Harbin, ChinaSemantic segmentation, as a dense predictive task, is inevitably affected by various external factor, making common road image semantic segmentation models unable to meet dual demands of high accuracy and real-time performance in unstructured road scenarios. To address these issues, this paper proposes an enhanced road scene segmentation method based on DeepLabV3+ that addresses the common trade-offs between accuracy and real-time performance in existing approaches. First, the heavy Xception backbone is replaced with the lightweight MobileNetV2, significantly boosting real-time efficiency while maintaining competitive segmentation accuracy. Second, the Atrous Spatial Pyramid Pooling (ASPP) module is optimized by introducing depthwise separable convolutions and a hierarchical feature fusion strategy, reducing computational complexity and mitigating the grid effect, a limitation in many current models. Finally, a Shuffle Attention mechanism is incorporated to improve the handling of small objects and fine details, such as distant pedestrians or items held by them, enhancing segmentation precision without excessive computational overhead. The method was trained and evaluated on the Cityscapes and CamVid datasets, achieving 84.3% mPA and 41.8 FPS on Cityscapes, and 78.1% mPA and 30.2 FPS on CamVid. These experimental results demonstrate a significant improvement in balancing detection capabilities with real-time performance.https://ieeexplore.ieee.org/document/10812701/Autonomous drivingconvolutional neural networksdeep learningroad scene segmentationsemantic segmentation
spellingShingle	Zhe Ren Libao Wang Tianming Song Yihang Li Jian Zhang Fengfeng Zhao Enhancing Road Scene Segmentation With an Optimized DeepLabV3+ IEEE Access Autonomous driving convolutional neural networks deep learning road scene segmentation semantic segmentation
title	Enhancing Road Scene Segmentation With an Optimized DeepLabV3+
title_full	Enhancing Road Scene Segmentation With an Optimized DeepLabV3+
title_fullStr	Enhancing Road Scene Segmentation With an Optimized DeepLabV3+
title_full_unstemmed	Enhancing Road Scene Segmentation With an Optimized DeepLabV3+
title_short	Enhancing Road Scene Segmentation With an Optimized DeepLabV3+
title_sort	enhancing road scene segmentation with an optimized deeplabv3
topic	Autonomous driving convolutional neural networks deep learning road scene segmentation semantic segmentation
url	https://ieeexplore.ieee.org/document/10812701/
work_keys_str_mv	AT zheren enhancingroadscenesegmentationwithanoptimizeddeeplabv3 AT libaowang enhancingroadscenesegmentationwithanoptimizeddeeplabv3 AT tianmingsong enhancingroadscenesegmentationwithanoptimizeddeeplabv3 AT yihangli enhancingroadscenesegmentationwithanoptimizeddeeplabv3 AT jianzhang enhancingroadscenesegmentationwithanoptimizeddeeplabv3 AT fengfengzhao enhancingroadscenesegmentationwithanoptimizeddeeplabv3

Enhancing Road Scene Segmentation With an Optimized DeepLabV3+

Similar Items