Research and application of defense mechanism for prompt injection attack of large language model in financial industry

The large language models had a broad application prospect in the financial sector, and they were expected to play an important role in both asset management and wealth management. With the rapid development and wide application of large language models such as ChatGPT and GPT-4, attention to the se...

Full description

Saved in:

Bibliographic Details
Main Authors:	MOU Daen, WEI Zhihua, SUN Minglong, SONG Na, NI Lin
Format:	Article
Language:	English
Published:	POSTS&TELECOM PRESS Co., LTD 2024-10-01
Series:	网络与信息安全学报
Subjects:	financial large language model security prompt injection risk taxonomy large model dataset illegal risk detection
Online Access:	http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2024071
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841529529049284608
author	MOU Daen WEI Zhihua SUN Minglong SONG Na NI Lin
author_facet	MOU Daen WEI Zhihua SUN Minglong SONG Na NI Lin
author_sort	MOU Daen
collection	DOAJ
description	The large language models had a broad application prospect in the financial sector, and they were expected to play an important role in both asset management and wealth management. With the rapid development and wide application of large language models such as ChatGPT and GPT-4, attention to the security of large language models increased. The financial industry, characterized by strict regulations and supervision, demanded heightened security measures. Consequently, a comprehensive study on prompt injection attacks and a security defense framework was delved into in large language models within the financial sector. A risk taxonomy encompassing eight forms of input prompt injection attacks and five categories of safety scenarios on the output side was developed, and a financial domain large model prompt injection attack dataset, FIN-CSAPrompts, was collected using a human-machine adversarial approach. An end-to-end security defense framework against prompt injection attacks in large language models was proposed and tested, and comparative evaluations were performed using prevalent open-source large language models. The research indicated that in the financial industry, the application of the proposed security defense framework significantly enhanced the defensive capabilities of Chinese large language models, effectively reducing the generation of inappropriate content and improving their resilience against prompt injection attacks. This research provided a reference and foundation for further research on the security issues of Chinese large language models in the financial domain, offering datasets, evaluation metrics, and solutions for consideration and adaptation.
format	Article
id	doaj-art-6f136b7dace24418aff410bec522d874
institution	Kabale University
issn	2096-109X
language	English
publishDate	2024-10-01
publisher	POSTS&TELECOM PRESS Co., LTD
record_format	Article
series	网络与信息安全学报
spelling	doaj-art-6f136b7dace24418aff410bec522d8742025-01-15T03:17:21ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2024-10-011011913377772303Research and application of defense mechanism for prompt injection attack of large language model in financial industryMOU DaenWEI ZhihuaSUN MinglongSONG NaNI LinThe large language models had a broad application prospect in the financial sector, and they were expected to play an important role in both asset management and wealth management. With the rapid development and wide application of large language models such as ChatGPT and GPT-4, attention to the security of large language models increased. The financial industry, characterized by strict regulations and supervision, demanded heightened security measures. Consequently, a comprehensive study on prompt injection attacks and a security defense framework was delved into in large language models within the financial sector. A risk taxonomy encompassing eight forms of input prompt injection attacks and five categories of safety scenarios on the output side was developed, and a financial domain large model prompt injection attack dataset, FIN-CSAPrompts, was collected using a human-machine adversarial approach. An end-to-end security defense framework against prompt injection attacks in large language models was proposed and tested, and comparative evaluations were performed using prevalent open-source large language models. The research indicated that in the financial industry, the application of the proposed security defense framework significantly enhanced the defensive capabilities of Chinese large language models, effectively reducing the generation of inappropriate content and improving their resilience against prompt injection attacks. This research provided a reference and foundation for further research on the security issues of Chinese large language models in the financial domain, offering datasets, evaluation metrics, and solutions for consideration and adaptation.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2024071financial large language model securityprompt injectionrisk taxonomylarge model datasetillegal risk detection
spellingShingle	MOU Daen WEI Zhihua SUN Minglong SONG Na NI Lin Research and application of defense mechanism for prompt injection attack of large language model in financial industry 网络与信息安全学报 financial large language model security prompt injection risk taxonomy large model dataset illegal risk detection
title	Research and application of defense mechanism for prompt injection attack of large language model in financial industry
title_full	Research and application of defense mechanism for prompt injection attack of large language model in financial industry
title_fullStr	Research and application of defense mechanism for prompt injection attack of large language model in financial industry
title_full_unstemmed	Research and application of defense mechanism for prompt injection attack of large language model in financial industry
title_short	Research and application of defense mechanism for prompt injection attack of large language model in financial industry
title_sort	research and application of defense mechanism for prompt injection attack of large language model in financial industry
topic	financial large language model security prompt injection risk taxonomy large model dataset illegal risk detection
url	http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2024071
work_keys_str_mv	AT moudaen researchandapplicationofdefensemechanismforpromptinjectionattackoflargelanguagemodelinfinancialindustry AT weizhihua researchandapplicationofdefensemechanismforpromptinjectionattackoflargelanguagemodelinfinancialindustry AT sunminglong researchandapplicationofdefensemechanismforpromptinjectionattackoflargelanguagemodelinfinancialindustry AT songna researchandapplicationofdefensemechanismforpromptinjectionattackoflargelanguagemodelinfinancialindustry AT nilin researchandapplicationofdefensemechanismforpromptinjectionattackoflargelanguagemodelinfinancialindustry

Research and application of defense mechanism for prompt injection attack of large language model in financial industry

Similar Items