Similarity based task timely termination method for image based intelligent agents
Abstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based t...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Portfolio
2024-12-01
|
Series: | Scientific Reports |
Subjects: | |
Online Access: | https://doi.org/10.1038/s41598-024-83463-8 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841559493570199552 |
---|---|
author | Sheng Jie Xing Huang Chengxi Jing Xian Jiang Ligang Dong |
author_facet | Sheng Jie Xing Huang Chengxi Jing Xian Jiang Ligang Dong |
author_sort | Sheng Jie |
collection | DOAJ |
description | Abstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based task timely termination method for image-based intelligent agents, This method involves recording the scenario state after the completion of each sub-task and comparing it with the fully completed task scenario state using a structural similarity method. The result is quantified and standardized into a structural similarity index, which is used to judge whether the task has been completed. Moreover, we categorize the types of agents based on model and created an image-based agent task dataset. In experimental results, the image-based agents using this method showed an average reduction of 1.94 steps in the number of steps to complete 20 task tests, a $$44.1\%$$ reduction in time costs, and a $$47.3\%$$ reduction in token costs. This method can effectively reduce the negative actions of image-based agents when they experience hallucinations, ensuring their tasks are completed excellently, and it can effectively reduce the waste of resources such as time, tokens, and hardware. Our project can be found at GitHub . |
format | Article |
id | doaj-art-44829fbfbee94bf388b1d927a0969144 |
institution | Kabale University |
issn | 2045-2322 |
language | English |
publishDate | 2024-12-01 |
publisher | Nature Portfolio |
record_format | Article |
series | Scientific Reports |
spelling | doaj-art-44829fbfbee94bf388b1d927a09691442025-01-05T12:25:48ZengNature PortfolioScientific Reports2045-23222024-12-0114111410.1038/s41598-024-83463-8Similarity based task timely termination method for image based intelligent agentsSheng Jie0Xing Huang1Chengxi Jing2Xian Jiang3Ligang Dong4Zhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEZhejiang Gongshang University, SIEEAbstract Due to the hallucination of the underlying large language model(LLMs) or the unclear description of the task’s ultimate goal, the agents have become somewhat confused. Despite having completed tasks, they have not ceased working, leading to a waste of resource. We propose similarity-based task timely termination method for image-based intelligent agents, This method involves recording the scenario state after the completion of each sub-task and comparing it with the fully completed task scenario state using a structural similarity method. The result is quantified and standardized into a structural similarity index, which is used to judge whether the task has been completed. Moreover, we categorize the types of agents based on model and created an image-based agent task dataset. In experimental results, the image-based agents using this method showed an average reduction of 1.94 steps in the number of steps to complete 20 task tests, a $$44.1\%$$ reduction in time costs, and a $$47.3\%$$ reduction in token costs. This method can effectively reduce the negative actions of image-based agents when they experience hallucinations, ensuring their tasks are completed excellently, and it can effectively reduce the waste of resources such as time, tokens, and hardware. Our project can be found at GitHub .https://doi.org/10.1038/s41598-024-83463-8Intelligent agentLarge language modelApplicationTask terminationTimely |
spellingShingle | Sheng Jie Xing Huang Chengxi Jing Xian Jiang Ligang Dong Similarity based task timely termination method for image based intelligent agents Scientific Reports Intelligent agent Large language model Application Task termination Timely |
title | Similarity based task timely termination method for image based intelligent agents |
title_full | Similarity based task timely termination method for image based intelligent agents |
title_fullStr | Similarity based task timely termination method for image based intelligent agents |
title_full_unstemmed | Similarity based task timely termination method for image based intelligent agents |
title_short | Similarity based task timely termination method for image based intelligent agents |
title_sort | similarity based task timely termination method for image based intelligent agents |
topic | Intelligent agent Large language model Application Task termination Timely |
url | https://doi.org/10.1038/s41598-024-83463-8 |
work_keys_str_mv | AT shengjie similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents AT xinghuang similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents AT chengxijing similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents AT xianjiang similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents AT ligangdong similaritybasedtasktimelyterminationmethodforimagebasedintelligentagents |