Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing

As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various...

Full description

Saved in:
Bibliographic Details
Main Authors: Seunghwan Kim, Changjong Kim, Sunggon Kim
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10807180/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841556997151916032
author Seunghwan Kim
Changjong Kim
Sunggon Kim
author_facet Seunghwan Kim
Changjong Kim
Sunggon Kim
author_sort Seunghwan Kim
collection DOAJ
description As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization.
format Article
id doaj-art-2d4427bfc0f74dae851e3c9e2a5035d6
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-2d4427bfc0f74dae851e3c9e2a5035d62025-01-07T00:01:33ZengIEEEIEEE Access2169-35362025-01-01131522153310.1109/ACCESS.2024.352024010807180Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame ProcessingSeunghwan Kim0https://orcid.org/0009-0006-0374-8542Changjong Kim1https://orcid.org/0000-0002-0842-1593Sunggon Kim2https://orcid.org/0000-0002-2295-3385Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaAs the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization.https://ieeexplore.ieee.org/document/10807180/Edge devicesmachine learningobject detectionperformance optimization
spellingShingle Seunghwan Kim
Changjong Kim
Sunggon Kim
Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
IEEE Access
Edge devices
machine learning
object detection
performance optimization
title Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_full Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_fullStr Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_full_unstemmed Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_short Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
title_sort improving performance of real time object detection in edge device through concurrent multi frame processing
topic Edge devices
machine learning
object detection
performance optimization
url https://ieeexplore.ieee.org/document/10807180/
work_keys_str_mv AT seunghwankim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing
AT changjongkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing
AT sunggonkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing