Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs

IntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficie...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuchen Duan, Qingqing Zhou, Yu Li, Chi Qin, Ziyang Wang, Hongxing Kan, Jili Hu
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Medicine
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841556728066342912
author Yuchen Duan
Qingqing Zhou
Yu Li
Chi Qin
Ziyang Wang
Hongxing Kan
Hongxing Kan
Jili Hu
Jili Hu
author_facet Yuchen Duan
Qingqing Zhou
Yu Li
Chi Qin
Ziyang Wang
Hongxing Kan
Hongxing Kan
Jili Hu
Jili Hu
author_sort Yuchen Duan
collection DOAJ
description IntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficient retrieval and analysis. This study aimed to address these issues by leveraging modern data techniques to improve access and analysis of TCM case records.MethodsA total of 679 case records from Wang Zhongqi, a renowned physician of Xin’an Medicine, a branch of TCM, covering 41 diseases, were selected. The study involved four stages: pattern layer construction, knowledge extraction, integration, and data storage and visualization. A large language model (LLM) was employed to automatically extract key entities, including symptoms, pathogenesis, treatment principles, and prescriptions. These were structured into a TCM case knowledge graph.ResultsThe LLM successfully identified and extracted relevant entities, which were then organized into relational triples. A TCM case query system based on natural language input was developed. The system’s performance, evaluated using the RAGAS framework, achieved high scores: 0.9375 in faithfulness, 0.9686 in answer relevancy, and 0.9500 in context recall; In human evaluations, the levels of safety and usability are significantly higher than those of LLMs without using RAG.DiscussionThe results demonstrate that integrating LLMs with a knowledge graph significantly enhances the efficiency and accuracy of retrieving TCM case information. This approach could play a crucial role in modernizing TCM research and improving access to clinical insights. Future research may explore expanding the dataset and refining the query system for broader applications.
format Article
id doaj-art-7fcf42e7ac634a27bc8d36b76ca2761f
institution Kabale University
issn 2296-858X
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Medicine
spelling doaj-art-7fcf42e7ac634a27bc8d36b76ca2761f2025-01-07T06:43:34ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2025-01-011110.3389/fmed.2024.15123291512329Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphsYuchen Duan0Qingqing Zhou1Yu Li2Chi Qin3Ziyang Wang4Hongxing Kan5Hongxing Kan6Jili Hu7Jili Hu8School of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaCenter for Xin’an Medicine and Modernization of Traditional Chinese Medicine of IHM, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaCenter for Xin’an Medicine and Modernization of Traditional Chinese Medicine of IHM, Anhui University of Chinese Medicine, Hefei, ChinaIntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficient retrieval and analysis. This study aimed to address these issues by leveraging modern data techniques to improve access and analysis of TCM case records.MethodsA total of 679 case records from Wang Zhongqi, a renowned physician of Xin’an Medicine, a branch of TCM, covering 41 diseases, were selected. The study involved four stages: pattern layer construction, knowledge extraction, integration, and data storage and visualization. A large language model (LLM) was employed to automatically extract key entities, including symptoms, pathogenesis, treatment principles, and prescriptions. These were structured into a TCM case knowledge graph.ResultsThe LLM successfully identified and extracted relevant entities, which were then organized into relational triples. A TCM case query system based on natural language input was developed. The system’s performance, evaluated using the RAGAS framework, achieved high scores: 0.9375 in faithfulness, 0.9686 in answer relevancy, and 0.9500 in context recall; In human evaluations, the levels of safety and usability are significantly higher than those of LLMs without using RAG.DiscussionThe results demonstrate that integrating LLMs with a knowledge graph significantly enhances the efficiency and accuracy of retrieving TCM case information. This approach could play a crucial role in modernizing TCM research and improving access to clinical insights. Future research may explore expanding the dataset and refining the query system for broader applications.https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/fulllarge language modelknowledge graphtraditional Chinese medicinequestion answering systeminterdisciplinary research
spellingShingle Yuchen Duan
Qingqing Zhou
Yu Li
Chi Qin
Ziyang Wang
Hongxing Kan
Hongxing Kan
Jili Hu
Jili Hu
Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
Frontiers in Medicine
large language model
knowledge graph
traditional Chinese medicine
question answering system
interdisciplinary research
title Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
title_full Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
title_fullStr Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
title_full_unstemmed Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
title_short Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
title_sort research on a traditional chinese medicine case based question answering system integrating large language models and knowledge graphs
topic large language model
knowledge graph
traditional Chinese medicine
question answering system
interdisciplinary research
url https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/full
work_keys_str_mv AT yuchenduan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT qingqingzhou researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT yuli researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT chiqin researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT ziyangwang researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT hongxingkan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT hongxingkan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT jilihu researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs
AT jilihu researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs