CO39 | Artificial intelligence in hemophilia management: challenging the need for experts
Background and Aims: Hemophilia is a rare genetic disorder characterized by deficiencies in Factor VIII or Factor IX, which can lead to spontaneous and prolonged bleeding episodes, as well as bleeding caused by minor trauma. This occurs most frequently in patients with severe hemophilia who have no...
Saved in:
| Format: | Article |
|---|---|
| Language: | English |
| Published: |
PAGEPress Publications
2025-08-01
|
| Series: | Bleeding, Thrombosis and Vascular Biology |
| Subjects: | |
| Online Access: | https://www.btvb.org/btvb/article/view/246 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Background and Aims: Hemophilia is a rare genetic disorder characterized by deficiencies in Factor VIII or Factor IX, which can lead to spontaneous and prolonged bleeding episodes, as well as bleeding caused by minor trauma. This occurs most frequently in patients with severe hemophilia who have not initiated early prophylactic treatment or who do not receive prompt administration of clotting factor concentrates in response to bleeding events. Traditionally, specialized hematologists and dedicated Hemophilia Treatment Centers have been essential for accurate diagnosis and effective management of the disease. However, the recent advancement of artificial intelligence (AI) in healthcare has raised important questions regarding its potential to replace specialist physicians in disease management. This study aimed to evaluate the diagnostic and therapeutic accuracy of five AI models (ChatGPT (OpenAI), Gemini (Google), Copilot (Microsoft), DeepSeek AI and Perplexity AI) in responding to fundamental knowledge-based questionnaires on hemophilia and complex clinical scenarios.
Methods: Two sets of multiple-choice questionnaires were presented to five AI models. The first set of ten questions, generated by ChatGPT, covered essential aspects of hemophilia A and B, including diagnosis, traditional and novel treatments, disease incidence, and possible complications. The second set, independently created by an experienced hematologist, contained another ten questions on the same topics. Each question had four possible answers, with only one being correct. Following this, two complex clinical scenarios were proposed to assess the AI models’ ability to provide an accurate medical opinion. The first scenario involved the sudden loss of therapeutic response to prophylactic treatment in a severe hemophilia patient. The second scenario sought guidance on the optimal prophylactic regimen for an adolescent hemophiliac engaged in semi-professional sports activities.
Results: All five AI models demonstrated perfect accuracy, achieving a response rate of 100% (20/20 correct answers) across both questionnaire sets. To validate the consistency of the results, a chi-square statistical analysis was conducted comparing AI performance across the two sets of questions confirming no variance between responses (p>0.05). In response to the two complex clinical queries, each AI provided detailed and structured analyses, considering multiple possibilities, exploring differential diagnoses, and ultimately arriving at accurate and guideline-compliant conclusions. The AI-generated responses exhibited a level of depth and reasoning comparable to that of an experienced hematologist. Table 1.
Conclusions: The findings of this study, resumed in Table 1, highlight the impressive capabilities of AI in medical knowledge retrieval and clinical reasoning within hemophilia. Given the rarity of the disease and the historical necessity of specialized experts, this raises a profound question: has AI already reached a level where it could fully replace hematologists in hemophilia management? While AI demonstrates remarkable competency, further research and ethical considerations are required before determining whether it can autonomously guide patient care without human oversight. The integration of AI into clinical practice should be carefully assessed to ensure optimal patient outcomes while preserving the irreplaceable experience and judgment of medical specialists.
|
|---|---|
| ISSN: | 2785-5309 |