Survey on large language models alignment research

With the rapid development of artificial intelligence technology, large language models have been widely applied in numerous fields. However, the potential of large language models to generate inaccurate, misleading, or even harmful contents has raised concerns about their reliability. Adopting alig...

Full description

Saved in:

Bibliographic Details
Main Authors:	LIU Kunlin, QU Xinji, TAN Fang, KANG Honghui, ZHAO Shaowei, SHI Rong
Format:	Article
Language:	zho
Published:	Beijing Xintong Media Co., Ltd 2024-06-01
Series:	Dianxin kexue
Subjects:	large language model alignment technique tune reinforcement learning
Online Access:	http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2024151/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841528841588178944
author	LIU Kunlin QU Xinji TAN Fang KANG Honghui ZHAO Shaowei SHI Rong
author_facet	LIU Kunlin QU Xinji TAN Fang KANG Honghui ZHAO Shaowei SHI Rong
author_sort	LIU Kunlin
collection	DOAJ
description	With the rapid development of artificial intelligence technology, large language models have been widely applied in numerous fields. However, the potential of large language models to generate inaccurate, misleading, or even harmful contents has raised concerns about their reliability. Adopting alignment techniques to ensure the behavior of large language models is consistent with human values has become an urgent issue to address. Recent research progress on alignment techniques for large language models were surveyed. Common methods for collecting instruction data and human preference datasets were introduced, research on supervised tuning and alignment adjustments was summarized, commonly used datasets and methods for model evaluation were discussed, and future research directions were concluded.
format	Article
id	doaj-art-44a90e4ec6b94a73a4fcf63e4aa892ab
institution	Kabale University
issn	1000-0801
language	zho
publishDate	2024-06-01
publisher	Beijing Xintong Media Co., Ltd
record_format	Article
series	Dianxin kexue
spelling	doaj-art-44a90e4ec6b94a73a4fcf63e4aa892ab2025-01-15T03:33:33ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012024-06-014017319464852434Survey on large language models alignment researchLIU KunlinQU XinjiTAN FangKANG HonghuiZHAO ShaoweiSHI RongWith the rapid development of artificial intelligence technology, large language models have been widely applied in numerous fields. However, the potential of large language models to generate inaccurate, misleading, or even harmful contents has raised concerns about their reliability. Adopting alignment techniques to ensure the behavior of large language models is consistent with human values has become an urgent issue to address. Recent research progress on alignment techniques for large language models were surveyed. Common methods for collecting instruction data and human preference datasets were introduced, research on supervised tuning and alignment adjustments was summarized, commonly used datasets and methods for model evaluation were discussed, and future research directions were concluded.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2024151/large language modelalignment techniquetunereinforcement learning
spellingShingle	LIU Kunlin QU Xinji TAN Fang KANG Honghui ZHAO Shaowei SHI Rong Survey on large language models alignment research Dianxin kexue large language model alignment technique tune reinforcement learning
title	Survey on large language models alignment research
title_full	Survey on large language models alignment research
title_fullStr	Survey on large language models alignment research
title_full_unstemmed	Survey on large language models alignment research
title_short	Survey on large language models alignment research
title_sort	survey on large language models alignment research
topic	large language model alignment technique tune reinforcement learning
url	http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2024151/
work_keys_str_mv	AT liukunlin surveyonlargelanguagemodelsalignmentresearch AT quxinji surveyonlargelanguagemodelsalignmentresearch AT tanfang surveyonlargelanguagemodelsalignmentresearch AT kanghonghui surveyonlargelanguagemodelsalignmentresearch AT zhaoshaowei surveyonlargelanguagemodelsalignmentresearch AT shirong surveyonlargelanguagemodelsalignmentresearch

Survey on large language models alignment research

Similar Items