Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes

Semantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However...

Full description

Saved in:
Bibliographic Details
Main Authors: Baofeng Ye, Renzheng Xue
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10813360/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841563324491235328
author Baofeng Ye
Renzheng Xue
author_facet Baofeng Ye
Renzheng Xue
author_sort Baofeng Ye
collection DOAJ
description Semantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However, existing models either improve accuracy at the cost of high computational expense and long inference times or enhance inference speed by sacrificing resolution and multi-level detailed information, resulting in a significant drop in accuracy. In this paper, we propose a new architecture based on a bilateral segmentation network, called DADNet. We have designed a new attention mechanism to optimize feature maps and a feature fusion module with an attention mechanism to efficiently merge different features, effectively expanding the receptive field. Our method demonstrates an excellent balance between segmentation accuracy and speed on the Cityscapes and CamVid datasets. Specifically, DADNet achieves a mIoU of 78.2% at 90.5 FPS on the Cityscapes validation set using a single 2080Ti GPU. On the CamVid test set, it achieves a mIoU of 75.8% at 136.7 FPS. Our approach outperforms most state-of-the-art models while requiring less computational power.
format Article
id doaj-art-d2fe1bb611054a2a8b90a42ab9a076a8
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-d2fe1bb611054a2a8b90a42ab9a076a82025-01-03T00:01:57ZengIEEEIEEE Access2169-35362025-01-011358859510.1109/ACCESS.2024.352195810813360Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street ScenesBaofeng Ye0https://orcid.org/0009-0000-5651-4906Renzheng Xue1https://orcid.org/0009-0000-7183-4502School of Computer and Control Engineering, Qiqihar University, Qiqihar, ChinaKey Laboratory of Big Data Network Security Detection and Analysis, Qiqihar University, Qiqihar, Heilongjiang, ChinaSemantic segmentation is a crucial technology for autonomous vehicles to acquire information about their surrounding environment. To ensure that semantic segmentation has practical application value in autonomous driving and robotics, it must achieve corresponding real-time inference speeds. However, existing models either improve accuracy at the cost of high computational expense and long inference times or enhance inference speed by sacrificing resolution and multi-level detailed information, resulting in a significant drop in accuracy. In this paper, we propose a new architecture based on a bilateral segmentation network, called DADNet. We have designed a new attention mechanism to optimize feature maps and a feature fusion module with an attention mechanism to efficiently merge different features, effectively expanding the receptive field. Our method demonstrates an excellent balance between segmentation accuracy and speed on the Cityscapes and CamVid datasets. Specifically, DADNet achieves a mIoU of 78.2% at 90.5 FPS on the Cityscapes validation set using a single 2080Ti GPU. On the CamVid test set, it achieves a mIoU of 75.8% at 136.7 FPS. Our approach outperforms most state-of-the-art models while requiring less computational power.https://ieeexplore.ieee.org/document/10813360/Semantic segmentationattentionreal-timedeep learning
spellingShingle Baofeng Ye
Renzheng Xue
Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
IEEE Access
Semantic segmentation
attention
real-time
deep learning
title Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_full Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_fullStr Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_full_unstemmed Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_short Dual Attention Dual-Resolution Networks for Real-Time Semantic Segmentation of Street Scenes
title_sort dual attention dual resolution networks for real time semantic segmentation of street scenes
topic Semantic segmentation
attention
real-time
deep learning
url https://ieeexplore.ieee.org/document/10813360/
work_keys_str_mv AT baofengye dualattentiondualresolutionnetworksforrealtimesemanticsegmentationofstreetscenes
AT renzhengxue dualattentiondualresolutionnetworksforrealtimesemanticsegmentationofstreetscenes