Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing

As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various...

Full description

Saved in:

Bibliographic Details
Main Authors:	Seunghwan Kim, Changjong Kim, Sunggon Kim
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Edge devices machine learning object detection performance optimization
Online Access:	https://ieeexplore.ieee.org/document/10807180/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841556997151916032
author	Seunghwan Kim Changjong Kim Sunggon Kim
author_facet	Seunghwan Kim Changjong Kim Sunggon Kim
author_sort	Seunghwan Kim
collection	DOAJ
description	As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization.
format	Article
id	doaj-art-2d4427bfc0f74dae851e3c9e2a5035d6
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-2d4427bfc0f74dae851e3c9e2a5035d62025-01-07T00:01:33ZengIEEEIEEE Access2169-35362025-01-01131522153310.1109/ACCESS.2024.352024010807180Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame ProcessingSeunghwan Kim0https://orcid.org/0009-0006-0374-8542Changjong Kim1https://orcid.org/0000-0002-0842-1593Sunggon Kim2https://orcid.org/0000-0002-2295-3385Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaAs the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization.https://ieeexplore.ieee.org/document/10807180/Edge devicesmachine learningobject detectionperformance optimization
spellingShingle	Seunghwan Kim Changjong Kim Sunggon Kim Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing IEEE Access Edge devices machine learning object detection performance optimization
title	Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_full	Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_fullStr	Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_full_unstemmed	Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_short	Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_sort	improving performance of real time object detection in edge device through concurrent multi frame processing
topic	Edge devices machine learning object detection performance optimization
url	https://ieeexplore.ieee.org/document/10807180/
work_keys_str_mv	AT seunghwankim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing AT changjongkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing AT sunggonkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing

Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing

Similar Items