Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes

Semantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However...

Full description

Saved in:

Bibliographic Details
Main Authors:	Baofeng Ye, Renzheng Xue
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Semantic segmentation attention real-time deep learning
Online Access:	https://ieeexplore.ieee.org/document/10813360/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841563324491235328
author	Baofeng Ye Renzheng Xue
author_facet	Baofeng Ye Renzheng Xue
author_sort	Baofeng Ye
collection	DOAJ
description	Semantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However, existing models either improve accuracy at the cost of high computational expense and long inference times or enhance inference speed by sacrificing resolution and multi-level detailed information, resulting in a significant drop in accuracy. In this paper, we propose a new architecture based on a bilateral segmentation network, called DADNet. We have designed a new attention mechanism to optimize feature maps and a feature fusion module with an attention mechanism to efficiently merge different features, effectively expanding the receptive field. Our method demonstrates an excellent balance between segmentation accuracy and speed on the Cityscapes and CamVid datasets. Specifically, DADNet achieves a mIoU of 78.2% at 90.5 FPS on the Cityscapes validation set using a single 2080Ti GPU. On the CamVid test set, it achieves a mIoU of 75.8% at 136.7 FPS. Our approach outperforms most state-of-the-art models while requiring less computational power.
format	Article
id	doaj-art-d2fe1bb611054a2a8b90a42ab9a076a8
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-d2fe1bb611054a2a8b90a42ab9a076a82025-01-03T00:01:57ZengIEEEIEEE Access2169-35362025-01-011358859510.1109/ACCESS.2024.352195810813360Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street ScenesBaofeng Ye0https://orcid.org/0009-0000-5651-4906Renzheng Xue1https://orcid.org/0009-0000-7183-4502School of Computer and Control Engineering, Qiqihar University, Qiqihar, ChinaKey Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, Heilongjiang, ChinaSemantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However, existing models either improve accuracy at the cost of high computational expense and long inference times or enhance inference speed by sacrificing resolution and multi-level detailed information, resulting in a significant drop in accuracy. In this paper, we propose a new architecture based on a bilateral segmentation network, called DADNet. We have designed a new attention mechanism to optimize feature maps and a feature fusion module with an attention mechanism to efficiently merge different features, effectively expanding the receptive field. Our method demonstrates an excellent balance between segmentation accuracy and speed on the Cityscapes and CamVid datasets. Specifically, DADNet achieves a mIoU of 78.2% at 90.5 FPS on the Cityscapes validation set using a single 2080Ti GPU. On the CamVid test set, it achieves a mIoU of 75.8% at 136.7 FPS. Our approach outperforms most state-of-the-art models while requiring less computational power.https://ieeexplore.ieee.org/document/10813360/Semantic segmentationattentionreal-timedeep learning
spellingShingle	Baofeng Ye Renzheng Xue Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes IEEE Access Semantic segmentation attention real-time deep learning
title	Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_full	Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_fullStr	Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_full_unstemmed	Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_short	Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_sort	dual attention dual resolution networks for real time semantic segmentation of street scenes
topic	Semantic segmentation attention real-time deep learning
url	https://ieeexplore.ieee.org/document/10813360/
work_keys_str_mv	AT baofengye dualattentiondualresolutionnetworksforrealtimesemanticsegmentationofstreetscenes AT renzhengxue dualattentiondualresolutionnetworksforrealtimesemanticsegmentationofstreetscenes

Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes

Similar Items