Research on multimodal social media information popularity prediction based on large language model

To address the limitations of strong feature dependency, insufficient generalization, and inadequate performance in few-shot/cold-start settings in existing multimodal social media popularity prediction algorithms, a MultiSmpLLM model based on large language model with instruction fine-tuning and hu...

Full description

Saved in:

Bibliographic Details
Main Authors:	WANG Jie, WANG Zitong, PENG Yan, HAO Bowen
Format:	Article
Language:	zho
Published:	Editorial Department of Journal on Communications 2024-11-01
Series:	Tongxin xuebao
Subjects:	large language model popularity prediction instruction fine-tuning human alignment
Online Access:	http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024193/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841537125725503488
author	WANG Jie WANG Zitong PENG Yan HAO Bowen
author_facet	WANG Jie WANG Zitong PENG Yan HAO Bowen
author_sort	WANG Jie
collection	DOAJ
description	To address the limitations of strong feature dependency, insufficient generalization, and inadequate performance in few-shot/cold-start settings in existing multimodal social media popularity prediction algorithms, a MultiSmpLLM model based on large language model with instruction fine-tuning and human alignment was proposed. Firstly, the task of multimodal social media popularity prediction for cold-start users was defined. Secondly, multimodal fine-tuning instructions were constructed, and the large language model (Llama3) was instructionally fine-tuned using the low-rank adaptation (LoRA) and parameter freeze (Freeze) method. Finally, an improved direct preference optimization (DPO) algorithm IDPOP was developed by constructing preference data and adding a parameter-tuned penalty to the DPO loss function, resolving instability and non-convergence in RLHF and incorrect optimization in standard DPO for social media popularity prediction. Experiments show MultiSmpLLM outperforms conventional multimodal prediction models and multimodal large language models such as GPT-4o.
format	Article
id	doaj-art-bd05c145bb0c4603840c6eef2030bf29
institution	Kabale University
issn	1000-436X
language	zho
publishDate	2024-11-01
publisher	Editorial Department of Journal on Communications
record_format	Article
series	Tongxin xuebao
spelling	doaj-art-bd05c145bb0c4603840c6eef2030bf292025-01-14T08:46:18ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2024-11-014514115679134326Research on multimodal social media information popularity prediction based on large language modelWANG JieWANG ZitongPENG YanHAO BowenTo address the limitations of strong feature dependency, insufficient generalization, and inadequate performance in few-shot/cold-start settings in existing multimodal social media popularity prediction algorithms, a MultiSmpLLM model based on large language model with instruction fine-tuning and human alignment was proposed. Firstly, the task of multimodal social media popularity prediction for cold-start users was defined. Secondly, multimodal fine-tuning instructions were constructed, and the large language model (Llama3) was instructionally fine-tuned using the low-rank adaptation (LoRA) and parameter freeze (Freeze) method. Finally, an improved direct preference optimization (DPO) algorithm IDPOP was developed by constructing preference data and adding a parameter-tuned penalty to the DPO loss function, resolving instability and non-convergence in RLHF and incorrect optimization in standard DPO for social media popularity prediction. Experiments show MultiSmpLLM outperforms conventional multimodal prediction models and multimodal large language models such as GPT-4o.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024193/large language modelpopularity predictioninstruction fine-tuninghuman alignment
spellingShingle	WANG Jie WANG Zitong PENG Yan HAO Bowen Research on multimodal social media information popularity prediction based on large language model Tongxin xuebao large language model popularity prediction instruction fine-tuning human alignment
title	Research on multimodal social media information popularity prediction based on large language model
title_full	Research on multimodal social media information popularity prediction based on large language model
title_fullStr	Research on multimodal social media information popularity prediction based on large language model
title_full_unstemmed	Research on multimodal social media information popularity prediction based on large language model
title_short	Research on multimodal social media information popularity prediction based on large language model
title_sort	research on multimodal social media information popularity prediction based on large language model
topic	large language model popularity prediction instruction fine-tuning human alignment
url	http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024193/
work_keys_str_mv	AT wangjie researchonmultimodalsocialmediainformationpopularitypredictionbasedonlargelanguagemodel AT wangzitong researchonmultimodalsocialmediainformationpopularitypredictionbasedonlargelanguagemodel AT pengyan researchonmultimodalsocialmediainformationpopularitypredictionbasedonlargelanguagemodel AT haobowen researchonmultimodalsocialmediainformationpopularitypredictionbasedonlargelanguagemodel

Research on multimodal social media information popularity prediction based on large language model

Similar Items