Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing
As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10807180/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841556997151916032 |
---|---|
author | Seunghwan Kim Changjong Kim Sunggon Kim |
author_facet | Seunghwan Kim Changjong Kim Sunggon Kim |
author_sort | Seunghwan Kim |
collection | DOAJ |
description | As the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization. |
format | Article |
id | doaj-art-2d4427bfc0f74dae851e3c9e2a5035d6 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-2d4427bfc0f74dae851e3c9e2a5035d62025-01-07T00:01:33ZengIEEEIEEE Access2169-35362025-01-01131522153310.1109/ACCESS.2024.352024010807180Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame ProcessingSeunghwan Kim0https://orcid.org/0009-0006-0374-8542Changjong Kim1https://orcid.org/0000-0002-0842-1593Sunggon Kim2https://orcid.org/0000-0002-2295-3385Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaDepartment of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of KoreaAs the performance and accuracy of machine learning and AI algorithms improve, the demand for adopting computer vision techniques to solve various problems, such as autonomous driving and AI robots, increases. To meet such demand, IoT and edge devices, which are small enough to be adopted in various environments while having sufficient computing capabilities, are being widely adopted. However, as devices are utilized in IoT and edge environments, which have harsh restrictions compared to traditional server environments, they are often limited by low computational and memory resources, in addition to the limited electrical power supply. This necessitates a unique approach for small IoT devices that are required to run complex tasks. In this paper, we propose a concurrent multi-frame processing scheme for real-time object detection algorithms. To do this, we first divide the video into individual frames and group the frames according to the number of cores in the device. Then, we allocate a group of frames per core to perform the object detection, resulting in parallel detection of multiple frames. We implement our scheme in YOLO (You Only Look Once), one of the most popular real-time object detection algorithms, on a state-of-the-art, resource-constrained IoT edge device, Nvidia Jetson Orin Nano, using real-world video and image datasets, including MS-COCO, ImageNet, PascalVOC, DOTA, animal videos, and car-traffic videos. Our evaluation results show that our proposed scheme can improve the diverse aspect of edge performance and improve the runtime, memory consumption, and power usage by up to 445%, 69%, and 73%, respectively. Additionally, it demonstrates improvements of <inline-formula> <tex-math notation="LaTeX">$2.10\times $ </tex-math></inline-formula> over state-of-the-art model optimization.https://ieeexplore.ieee.org/document/10807180/Edge devicesmachine learningobject detectionperformance optimization |
spellingShingle | Seunghwan Kim Changjong Kim Sunggon Kim Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing IEEE Access Edge devices machine learning object detection performance optimization |
title | Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing |
title_full | Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing |
title_fullStr | Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing |
title_full_unstemmed | Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing |
title_short | Improving Performance of Real-Time Object Detection in Edge Device Through Concurrent Multi-Frame Processing |
title_sort | improving performance of real time object detection in edge device through concurrent multi frame processing |
topic | Edge devices machine learning object detection performance optimization |
url | https://ieeexplore.ieee.org/document/10807180/ |
work_keys_str_mv | AT seunghwankim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing AT changjongkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing AT sunggonkim improvingperformanceofrealtimeobjectdetectioninedgedevicethroughconcurrentmultiframeprocessing |