Similarity based task timely termination method for image based intelligent agents

Abstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based t...

Full description

Saved in:
Bibliographic Details
Main Authors: Sheng Jie, Xing Huang, Chengxi Jing, Xian Jiang, Ligang Dong
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-83463-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841559493570199552
author Sheng Jie
Xing Huang
Chengxi Jing
Xian Jiang
Ligang Dong
author_facet Sheng Jie
Xing Huang
Chengxi Jing
Xian Jiang
Ligang Dong
author_sort Sheng Jie
collection DOAJ
description Abstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based task timely termination method for image-based intelligent agents, This method involves recording the scenario state after the completion of each sub-task and comparing it with the fully completed task scenario state using a structural similarity method. The result is quantified and standardized into a structural similarity index, which is used to judge whether the task has been completed. Moreover, we categorize the types of agents based on model and created an image-based agent task dataset. In experimental results, the image-based agents using this method showed an average reduction of 1.94 steps in the number of steps to complete 20 task tests, a $$44.1\%$$ reduction in time costs, and a $$47.3\%$$ reduction in token costs. This method can effectively reduce the negative actions of image-based agents when they experience hallucinations, ensuring their tasks are completed excellently, and it can effectively reduce the waste of resources such as time, tokens, and hardware. Our project can be found at GitHub .
format Article
id doaj-art-44829fbfbee94bf388b1d927a0969144
institution Kabale University
issn 2045-2322
language English
publishDate 2024-12-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-44829fbfbee94bf388b1d927a09691442025-01-05T12:25:48ZengNature PortfolioScientific Reports2045-23222024-12-0114111410.1038/s41598-024-83463-8Similarity based task timely termination method for image based intelligent agentsSheng Jie0Xing Huang1Chengxi Jing2Xian Jiang3Ligang Dong4Zhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEAbstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based task timely termination method for image-based intelligent agents, This method involves recording the scenario state after the completion of each sub-task and comparing it with the fully completed task scenario state using a structural similarity method. The result is quantified and standardized into a structural similarity index, which is used to judge whether the task has been completed. Moreover, we categorize the types of agents based on model and created an image-based agent task dataset. In experimental results, the image-based agents using this method showed an average reduction of 1.94 steps in the number of steps to complete 20 task tests, a $$44.1\%$$ reduction in time costs, and a $$47.3\%$$ reduction in token costs. This method can effectively reduce the negative actions of image-based agents when they experience hallucinations, ensuring their tasks are completed excellently, and it can effectively reduce the waste of resources such as time, tokens, and hardware. Our project can be found at GitHub .https://doi.org/10.1038/s41598-024-83463-8Intelligent agentLarge language modelApplicationTask terminationTimely
spellingShingle Sheng Jie
Xing Huang
Chengxi Jing
Xian Jiang
Ligang Dong
Similarity based task timely termination method for image based intelligent agents
Scientific Reports
Intelligent agent
Large language model
Application
Task termination
Timely
title Similarity based task timely termination method for image based intelligent agents
title_full Similarity based task timely termination method for image based intelligent agents
title_fullStr Similarity based task timely termination method for image based intelligent agents
title_full_unstemmed Similarity based task timely termination method for image based intelligent agents
title_short Similarity based task timely termination method for image based intelligent agents
title_sort similarity based task timely termination method for image based intelligent agents
topic Intelligent agent
Large language model
Application
Task termination
Timely
url https://doi.org/10.1038/s41598-024-83463-8
work_keys_str_mv AT shengjie similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents
AT xinghuang similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents
AT chengxijing similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents
AT xianjiang similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents
AT ligangdong similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents