Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
IntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficie...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Medicine |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841556728066342912 |
---|---|
author | Yuchen Duan Qingqing Zhou Yu Li Chi Qin Ziyang Wang Hongxing Kan Hongxing Kan Jili Hu Jili Hu |
author_facet | Yuchen Duan Qingqing Zhou Yu Li Chi Qin Ziyang Wang Hongxing Kan Hongxing Kan Jili Hu Jili Hu |
author_sort | Yuchen Duan |
collection | DOAJ |
description | IntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficient retrieval and analysis. This study aimed to address these issues by leveraging modern data techniques to improve access and analysis of TCM case records.MethodsA total of 679 case records from Wang Zhongqi, a renowned physician of Xin’an Medicine, a branch of TCM, covering 41 diseases, were selected. The study involved four stages: pattern layer construction, knowledge extraction, integration, and data storage and visualization. A large language model (LLM) was employed to automatically extract key entities, including symptoms, pathogenesis, treatment principles, and prescriptions. These were structured into a TCM case knowledge graph.ResultsThe LLM successfully identified and extracted relevant entities, which were then organized into relational triples. A TCM case query system based on natural language input was developed. The system’s performance, evaluated using the RAGAS framework, achieved high scores: 0.9375 in faithfulness, 0.9686 in answer relevancy, and 0.9500 in context recall; In human evaluations, the levels of safety and usability are significantly higher than those of LLMs without using RAG.DiscussionThe results demonstrate that integrating LLMs with a knowledge graph significantly enhances the efficiency and accuracy of retrieving TCM case information. This approach could play a crucial role in modernizing TCM research and improving access to clinical insights. Future research may explore expanding the dataset and refining the query system for broader applications. |
format | Article |
id | doaj-art-7fcf42e7ac634a27bc8d36b76ca2761f |
institution | Kabale University |
issn | 2296-858X |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Medicine |
spelling | doaj-art-7fcf42e7ac634a27bc8d36b76ca2761f2025-01-07T06:43:34ZengFrontiers Media S.A.Frontiers in Medicine2296-858X2025-01-011110.3389/fmed.2024.15123291512329Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphsYuchen Duan0Qingqing Zhou1Yu Li2Chi Qin3Ziyang Wang4Hongxing Kan5Hongxing Kan6Jili Hu7Jili Hu8School of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaCenter for Xin’an Medicine and Modernization of Traditional Chinese Medicine of IHM, Anhui University of Chinese Medicine, Hefei, ChinaSchool of Medical Information Engineering, Anhui University of Chinese Medicine, Hefei, ChinaCenter for Xin’an Medicine and Modernization of Traditional Chinese Medicine of IHM, Anhui University of Chinese Medicine, Hefei, ChinaIntroductionTraditional Chinese Medicine (TCM) case records encapsulate vast clinical experiences and theoretical insights, holding significant research and practical value. However, traditional case studies face challenges such as large data volumes, complex information, and difficulties in efficient retrieval and analysis. This study aimed to address these issues by leveraging modern data techniques to improve access and analysis of TCM case records.MethodsA total of 679 case records from Wang Zhongqi, a renowned physician of Xin’an Medicine, a branch of TCM, covering 41 diseases, were selected. The study involved four stages: pattern layer construction, knowledge extraction, integration, and data storage and visualization. A large language model (LLM) was employed to automatically extract key entities, including symptoms, pathogenesis, treatment principles, and prescriptions. These were structured into a TCM case knowledge graph.ResultsThe LLM successfully identified and extracted relevant entities, which were then organized into relational triples. A TCM case query system based on natural language input was developed. The system’s performance, evaluated using the RAGAS framework, achieved high scores: 0.9375 in faithfulness, 0.9686 in answer relevancy, and 0.9500 in context recall; In human evaluations, the levels of safety and usability are significantly higher than those of LLMs without using RAG.DiscussionThe results demonstrate that integrating LLMs with a knowledge graph significantly enhances the efficiency and accuracy of retrieving TCM case information. This approach could play a crucial role in modernizing TCM research and improving access to clinical insights. Future research may explore expanding the dataset and refining the query system for broader applications.https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/fulllarge language modelknowledge graphtraditional Chinese medicinequestion answering systeminterdisciplinary research |
spellingShingle | Yuchen Duan Qingqing Zhou Yu Li Chi Qin Ziyang Wang Hongxing Kan Hongxing Kan Jili Hu Jili Hu Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs Frontiers in Medicine large language model knowledge graph traditional Chinese medicine question answering system interdisciplinary research |
title | Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs |
title_full | Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs |
title_fullStr | Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs |
title_full_unstemmed | Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs |
title_short | Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs |
title_sort | research on a traditional chinese medicine case based question answering system integrating large language models and knowledge graphs |
topic | large language model knowledge graph traditional Chinese medicine question answering system interdisciplinary research |
url | https://www.frontiersin.org/articles/10.3389/fmed.2024.1512329/full |
work_keys_str_mv | AT yuchenduan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT qingqingzhou researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT yuli researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT chiqin researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT ziyangwang researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT hongxingkan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT hongxingkan researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT jilihu researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs AT jilihu researchonatraditionalchinesemedicinecasebasedquestionansweringsystemintegratinglargelanguagemodelsandknowledgegraphs |