News|Articles|September 9, 2024

ChatGPT Outperforms Trainee Doctors in Assessing Pediatric Respiratory Illness

New findings highlight the potential role of artificial intelligence in supporting health care professionals, but thorough testing is needed before its integration into everyday clinical practice.

New research presented at the European Respiratory Society (ERS) Congress in Vienna, Austria, reveals that ChatGPT could assess complex cases of respiratory disease in children better than trainee doctors.¹

To come to this finding, 6 experts in pediatric respiratory medicine provided 6 clinical scenarios of cases such as cystic fibrosis, asthma, sleep-disordered breathing, breathlessness, and chest infections, that frequently occur in children. These scenarios were posed to 10 trainee doctors with less than 4 months of pediatric clinical experience, and they were given 1 hour to solve each case using internet resources, but not chatbots. These cases did not have an immediately clear diagnosis, and existing guidelines or evidence did not provide a definitive answer.

The 6 scenarios were also presented to 3 large language models (LLMs): ChatGPT version 3.5, Google’s Bard, and Microsoft Bing’s chatbot. The 6 experts then gave all responses a score out of 9 based on their correctness, comprehensiveness, usefulness, plausibility, and coherence, and answered whether they thought each response was generated by a human or chatbot.

Manjith Narayanan, MD, PhD, a consultant in pediatric pulmonology at the Royal Hospital for Children and Young People, presented the study’s findings at the ERS Congress 2024. He noted his motivation for the study was to determine how well LLMs can help clinicians in the real world.²

The results were intriguing. Trainee doctors scored a median (IQR) of 4 (3-6) points, the same as Bing (3-5), while Bard scored higher at 6 (5-7) and scored better than trainee doctors in coherence specifically (P < .05).

ChatGPT scored the highest overall with 7 of 9 points (6-8.25) and outperformed trainee doctors in all criteria (P < .001). Experts also believed ChatGPT had more human-like responses than responses from the other chatbots, as they correctly identified Bard and Bing responses as being nonhuman.

Notably, none of the chatbots showed signs of hallucination, a phenomenon where LLMs generate seemingly accurate but false information. However, there were occasional irrelevant responses from the chatbots and the trainee doctors, and experts should be aware of the potential of hallucinations.

According to Narayanan, this is the first study to test LLMs against trainee doctors in scenarios reflecting real-life clinical practice, and these results imply artificial intelligence (AI) could play a crucial role in alleviating pressure put on health care systems, although more research is needed.

“We have not directly tested how LLMs would work in patient facing roles,” Narayanan noted. “However, it could be used by triage nurses, trainee doctors, and primary care physicians, who are often the first to review a patient.”

Future studies will focus on comparing chatbot performance with that of more experienced doctors and exploring the capabilities of newer LLMs. The research team is also considering investigating how chatbots can assist with more complex cases and further testing for accuracy and safety in real-world clinical environments.

Hilary Pinnock, MD, chair of the ERS Education Council and professor of primary care respiratory medicine at The University of Edinburgh, called the study “fascinating” while also expressing caution.

“It is encouraging, but maybe also a bit scary, to see how a widely available AI tool like ChatGPT can provide solutions to complex cases of respiratory illness in children,” she said. “It certainly points the way to a brave new world of AI-supported care.”

However, as the researchers highlighted, it is crucial to ensure these chatbots and other generative AI tools do not cause errors before they can be implemented in everyday clinical practice. These mistakes can include fabricated or hallucinated information, and can be due to the AI being trained on data that inadequately represent the diverse populations it is meant to serve.

“As the researchers have demonstrated, AI holds out the promise of a new way of working, but we need extensive testing of clinical accuracy and safety, pragmatic assessment of organizational efficiency, and exploration of the societal implications before we embed this technology in routine care,” she added.

As AI continues to advance, this study signals a potential shift in the future of health care, where LLMs could become integrated into the clinical workflow, aiding professionals in delivering faster and more accurate diagnoses. However, the journey toward full adoption will require careful evaluation of clinical accuracy, organizational efficiency, and ethical considerations.

References

1. Juan J, Duverger K, Armstrong D, et al. Clinical scenarios in paediatric pulmonology: can large language models fare better than trainee doctors? Presented at: ERS Congress; September 7-11, 2024; Vienna, Austria. https://k4.ersnet.org/prod/v2/Front/Program/Session?e=549&session=17916

2. ChatGPT outperformed trainee doctors in assessing complex respiratory illness in children. News release. ERS. September 9, 2024. Accessed September 9, 2024. https://www.ersnet.org/news-releases/chatgpt-outperformed-trainee-doctors-in-assessing-complex-respiratory-illness-in-children/

Stay ahead of policy, cost, and value—subscribe to AJMC for expert insights at the intersection of clinical care and health economics.

Subscribe Now!

Latest CME

Online Article

New and Approved: FDA’s 2024 Drug Lineup

1.5 Credits / General Pharmacy

Online Article

Innovations in Medicine: 2024 Lineup of New and Approved Specialty Drugs

2.0 Credits / General Pharmacy

AJMC Supplement

Revolutionizing Acute Pain Relief: Emerging Nonopioid Therapies and the Essential Role of Managed Care

2.0 Credits / Pain Management

On-Demand Virtual Symposium

Advancing Immunotherapy in Endometrial Cancer: A Managed Care Perspective on Personalized Care

1.5 Credits / Gynecologic Cancer, Health Equity, Diversity & Inclusion, Oncology, Women's Health

On-Demand Virtual Symposium

Leveraging Managed Care to Optimize Patient Outcomes: Integrating Novel Treatments in Schizophrenia

1.5 Credits / Neurology, Psychiatry

On-Demand Virtual Symposium

Overcoming Operational and Clinical Barriers in Multiple Myeloma: Managed Care Strategies for Antibody-Based Regimens

1.5 Credits / Hematologic Cancer, Hematology, Oncology

On-Demand Virtual Symposium

Exploring Therapeutic Advances in Myelofibrosis and Key Considerations for Effective Management

1.5 Credits / Oncology, Hematology

On-Demand Virtual Symposium

Driving Better Outcomes in Hypertrophic Cardiomyopathy: A Managed Care Imperative

1.5 Credits / Cardiology

On-Demand Virtual Symposium

Optimizing the Uptake of Long-Acting Injectables in HIV Treatment and Prevention: Considerations for Managed Care

1.5 Credits / HIV/AIDS, Infectious Disease

On-Demand Virtual Symposium

Navigating the Advancements in Oral Therapy Options in HR+ Metastatic Breast Cancer

1.5 Credits / Breast Cancer, Oncology, Women's Health

On-Demand Virtual Symposium

Navigating the Changing Treatment Paradigm of Metabolic Dysfunction-Associated Steatohepatitis: A Guide for Managed Care Pharmacists

1.5 Credits / Endocrinology, Diabetes & Metabolism, Gastroenterology

On-Demand Virtual Symposium

Harnessing Data-Driven Insights and Innovations to Enhance AMD and DME Management: Strategic Approaches for Managed Care

1.5 Credits / Ophthalmology/Optometry

On-Demand Virtual Symposium

Evaluating the Effectiveness and Value of Novel Nonhormonal Treatments in the Management of Menopause-Related Vasomotor Symptoms

1.5 Credits / Women's Health

On-Demand Virtual Symposium

Managed Care Approaches and Models for Equitable Access to Innovations in Major Depressive Disorder Treatment

1.5 Credits / Psychiatry

On-Demand Virtual Symposium

Updated Guidance and Managed Care Strategies to Optimize Respiratory Syncytial Virus Vaccination Coverage

1.5 Credits / Immunization, Infectious Disease, Pulmonology

On-Demand Virtual Symposium

Leveraging Biologics and Immunotherapies in the Management of Food Allergies

1.5 Credits / Immunology, Allergy

On-Demand Virtual Symposium

Advancing Care in Neovascular Age-Related Macular Degeneration and Diabetic Macular Edema: Optimizing Outcomes With Emerging Therapies

1.5 Credits / Ophthalmology/Optometry

On-Demand Virtual Symposium

The Evolving Landscape of Transthyretin Amyloidosis Cardiomyopathy: New Therapies and Treatment Strategies

1.5 Credits / Cardiology, Rare Diseases

On-Demand Virtual Symposium

Reflecting on the Real-World Use of Biologic Therapy in Asthma Management

1.5 Credits / Immunology, Pulmonology

On-Demand Virtual Symposium

Advancing Cystic Fibrosis Management: The Evolving Role of Specialty and Managed Care Pharmacists

1.5 Credits / Pulmonology

On-Demand Virtual Symposium

Pulmonary Arterial Hypertension: Real-World Applications of New Therapies and Management Strategies

1.5 Credits / Pulmonology, Cardiology, Rare Diseases

On-Demand Virtual Symposium

Targeting Chronic Rhinosinusitis With Nasal Polyps With Biologics: Optimizing Outcomes and Reducing Health Care Burden

1.5 Credits / Immunology

On-Demand Virtual Symposium

Emerging Treatment Options for Inflammatory Bowel Disease: A Focus on IL-23 Pathway Inhibition

1.5 Credits / Gastroenterology, Immunology

On-Demand Virtual Symposium

Transforming Metabolic Dysfunction-Associated Steatohepatitis Management: Pharmacist Considerations for the Evolving Treatment Landscape

1.5 Credits / Gastroenterology

ChatGPT Outperforms Trainee Doctors in Assessing Pediatric Respiratory Illness

Newsletter

Related Content

CDC Reduces US Childhood Immunization Schedule From 17 to 11 Diseases

Adequately Addressing SDOH Could Alter Risk of Pediatric Long COVID

Sotorasib More Cost-Effective in KRAS G12C NSCLC

Incidence Declines, Survival Improves for Synchronous CRLM

GoodRx to Match Novo Nordisk Price for Oral Semaglutide

Latest CME

New and Approved: FDA’s 2024 Drug Lineup

Innovations in Medicine: 2024 Lineup of New and Approved Specialty Drugs

Revolutionizing Acute Pain Relief: Emerging Nonopioid Therapies and the Essential Role of Managed Care

Advancing Immunotherapy in Endometrial Cancer: A Managed Care Perspective on Personalized Care

Leveraging Managed Care to Optimize Patient Outcomes: Integrating Novel Treatments in Schizophrenia

Overcoming Operational and Clinical Barriers in Multiple Myeloma: Managed Care Strategies for Antibody-Based Regimens

Exploring Therapeutic Advances in Myelofibrosis and Key Considerations for Effective Management

Driving Better Outcomes in Hypertrophic Cardiomyopathy: A Managed Care Imperative

Optimizing the Uptake of Long-Acting Injectables in HIV Treatment and Prevention: Considerations for Managed Care

Navigating the Advancements in Oral Therapy Options in HR+ Metastatic Breast Cancer

Navigating the Changing Treatment Paradigm of Metabolic Dysfunction-Associated Steatohepatitis: A Guide for Managed Care Pharmacists

Harnessing Data-Driven Insights and Innovations to Enhance AMD and DME Management: Strategic Approaches for Managed Care

Evaluating the Effectiveness and Value of Novel Nonhormonal Treatments in the Management of Menopause-Related Vasomotor Symptoms

Managed Care Approaches and Models for Equitable Access to Innovations in Major Depressive Disorder Treatment

Updated Guidance and Managed Care Strategies to Optimize Respiratory Syncytial Virus Vaccination Coverage

Leveraging Biologics and Immunotherapies in the Management of Food Allergies

Advancing Care in Neovascular Age-Related Macular Degeneration and Diabetic Macular Edema: Optimizing Outcomes With Emerging Therapies

The Evolving Landscape of Transthyretin Amyloidosis Cardiomyopathy: New Therapies and Treatment Strategies

Reflecting on the Real-World Use of Biologic Therapy in Asthma Management

Advancing Cystic Fibrosis Management: The Evolving Role of Specialty and Managed Care Pharmacists

Pulmonary Arterial Hypertension: Real-World Applications of New Therapies and Management Strategies

Targeting Chronic Rhinosinusitis With Nasal Polyps With Biologics: Optimizing Outcomes and Reducing Health Care Burden

Emerging Treatment Options for Inflammatory Bowel Disease: A Focus on IL-23 Pathway Inhibition

Transforming Metabolic Dysfunction-Associated Steatohepatitis Management: Pharmacist Considerations for the Evolving Treatment Landscape

Innovations in Inflammatory Bowel Disease Therapy: How IL-23 Inhibitors Are Shaping Treatment and Managed Care Approaches

The Expanding Therapeutic Landscape in IgA Nephropathy: Applying Evidence-Based Strategies and Guideline Updates for Managed Care

Leveraging Novel Therapies to Transform Demodex Blepharitis Care (Pharmacy Technician Credit)

Leveraging Novel Therapies to Transform Demodex Blepharitis Care

The Changing Paradigm in Pain Management and Supporting Access to Novel Therapies

The Expanding Therapeutic Landscape in IgA Nephropathy: A Case-Based Exploration of Optimized Patient Outcomes

The Impact of Pharmacists and Pharmacy Technicians in Recognizing and Responding to Human Trafficking (Pharmacist Credit)

Innovations in Lymphoma Treatment and the Growing Impact of Bispecific Antibodies

Updated Guidance and Managed Care Strategies to Optimize Care in EGFR Mutated NSCLC

Payment for Pharmacist Services: 2025 Update

From Treatment to Prevention: Navigating the Expanding Hereditary Angioedema Treatment Landscape

Innovations in Hidradenitis Suppurativa Treatment: Navigating the Evolving Landscape

Paroxysmal Nocturnal Hemoglobinuria: Managed Care Strategies to Mitigate Burden and Enhance Outcomes

Optimizing Outcomes in Myasthenia Gravis: Therapeutic Advances and Value-Based Care Models

The Pharmacist's Role in Palliative and End of Life Symptom Management

Optimizing Lipid Management in Statin-Intolerant Populations: Payer Strategies for Evidence-Based Access and Risk Reduction

IL-23 Inhibitors in Psoriasis: Optimizing Access and Patient Outcomes Across Integrated Systems

The Expanding Therapeutic Landscape in IgA Nephropathy: Translating New Clinical Evidence and Updated Guidelines Into Managed Care Strategies

Bridging Innovation and Access in HR-Positive/HER2-Negative Metastatic Breast Cancer: Implications for Managed Care

Navigating Advanced Prostate Cancer Treatment: Optimizing Novel Therapeutic Strategies for Managed Care Pharmacists

Innovations in Retinal Therapies: A Managed Care Perspective on Anti-VEGF Advancements

New Horizons in ATTR-CM: Therapeutic Advances and Strategic Insights

Cardiorenal Protection With SGLT2 Inhibitors: Perspectives for Managed Care

Addressing Gaps in Care for the Rapid and Long-Term Management of Hyperkalemia With Novel Oral Potassium Binding Agents: Insights for Managed Care Professionals

Minimizing Injection Burden: Anti-VEGF Innovation for Retinal Disease Management

Bridging Clinical and Access Gaps in Phenylketonuria: A Managed Care Perspective

HER2-Positive Metastatic Breast Cancer: A Managed Care Perspective on Emerging Therapies and Clinical Data

An American Journal of Managed Care Forum: Bridging Evidence and Access in Advanced Small Cell Lung Cancer

Navigating the Complexities of Personalized Prostate Cancer Care: Insights for Oncology Managed Care

Opioids, Pain Management, and Substance Use Disorder: A Practical Overview

Trending on AJMC

CDC Reduces US Childhood Immunization Schedule From 17 to 11 Diseases