PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Study shows ChatGPT failed when challenging ESCMID guideline for treating brain abscesses

Study highlights limits of AI tool as a medical device.

2024-04-29
(Press-News.org) With artificial intelligence (AI) poised to become a fundamental part of clinical research and decision making, many still question the accuracy of ChatGPT, a sophisticated AI language model, to support complex diagnostic and treatment processes.

Now a new study, being presented at this year’s ESCMID Global Congress (formerly ECCMID) in Barcelona, Spain (27-30 April), which pitted ChatGPT against the ESCMID guideline for the management of brain abscesses, found that while ChatGPT seems able to give recommendations on key questions about diagnosis and treatment in most cases, some of the AI model’s responses could put patients at risk.

The study was conducted by members of the ESCMID Study Group for Infectious Diseases of the Brain (ESGIB), and is published in The Journal of Neurology.

“Anything less than 100% is a failure when you’re dealing with patient safety”, says author Dr Susanne Dyckhoff-Shen from LMU University Hospital Munich in Germany and a member of ESCMID. “While we are amazed by ChatGPT’s knowledge on the management of brain abscesses, there are some key limitations when it comes to using the AI model as a medical device, including potential patient harm and the lack of transparency about which data are used to provide responses.”

The ability of AI to rapidly assimilate, process, and interpret vast data sets offers tantalising prospects. But are time-consuming processes to create medical guidelines still necessary, or could AI models trained on a wealth of scientific medical literature rival clinical experts in answering complex clinical questions?

Brain abscesses are a potentially life-threatening central nervous system (CNS) infection that require immediate identification and treatment to prevent severe neurological complications and even death.

Historically, the management of brain abscesses has been largely guided by clinical experience and limited studies, but in 2023 ESCMID fulfilled the need for a standardised approach by developing an international guideline [1].

To find out whether ChatGPT is able to professionally evaluate medical research and give scientifically valid recommendations, a European team of researchers tested the AI model to see whether it could accurately provide answers to 10 key questions on brain abscess diagnostics and treatment in comparison to the ESCMID guideline.

First, the researchers asked ChatGPT (version 4) to answer 10 questions that had been developed and appraised by the ESCMID committee for their brain abscess guideline without any additional information.

Then, ChatGPT was additionally primed with the text of the same scientific research articles that were used to develop the guideline before asking the same questions. This was done to see if ChatGPT could provide more aligned recommendations when given the same data used for guideline development.

The AI-generated responses were then compared to the recommendations of the ESCMID guideline by three independent infectious CNS disease experts for their clarity, alignment with the guideline, and patient risk.

Clear responses to most key questions

The researchers found that overall, for 17 out of 20 questions asked (with and without data input), ChatGPT’s responses were clear on the management of patients with brain abscess, including grade of evidence and strength of recommendation, with clarity assessed at 80-90% (see link to poster in notes to editors).

However, the AI model did not provide clear enough answers to guide physicians on treatment decisions on withholding microbials until surgery and prophylactic antiepileptic treatment (questions 2 and 10).

More incorrect advice and risk to patients with data prompting

Without additional data input, ChatGPT’s responses to 70% (7/10) of questions were very similar to the guideline recommendation. However, the AI model failed to come up with the correct advice on three questions relating to withholding microbials, consolidation therapy, and prophylactic antiepileptic treatment (questions 2, 8, and 10). Importantly, however, these incorrect responses would not have harmed patients (see link to poster in notes to editors).

Surprisingly, data input resulted in fewer correct answers (40%) including two recommendations that directly contradicted the guideline, that could have put patients at risk.

Question 6 about duration of antimicrobial therapy for bacterial brain abscess was answered by ChatGPT after data entry as “intravenous administration for about 4 weeks, followed by 12 weeks oral medication”, but the ESCMID guideline recommends “a total duration of 6-8 weeks of intravenous antimicrobials…”.

For question 7 about early transition to oral antimicrobials, after data input ChatGPT recommended that “an early switch to oral antibiotics during the first 14 days of treatment…seems to be associated with favorable outcomes in selected patients”. However, the ESCMID guideline committee judged that there was insufficient evidence to provide a recommendation for this question.

In both cases, following ChatGPT’s advice could have potentially led to patient harm.

“The fact that ChatGPT’s recommendations were inferior after data entry might be due to an overvaluation of the few observational studies provided for key questions 6 and 7. For one of those, even the guideline committee was not able to give a recommendation as the evidence was insufficient to answer the question”, says senior author Professor Mattias Klein from LMU University Hospital Munich in Germany and a member of the ESCMID committee which established the guideline.

“As the exact operating procedures of ChatGPT remain unclear, we speculate that while the AI model can process large amounts of data quickly, it may lack the ability to correctly classify and weigh the data based on their scientific quality. Moreover, it remains unclear which data are used for ChatGPT’s responses as it does not disclose the sources of its answers, which risks dubious literature being used.”

Dr Dyckhoff-Shen adds, “It is alarming to think that patients could have come to harm if ChatGPT’s advice on two key questions had been followed. The nuanced expertise of expert committees remains essential, especially to answer complex clinical queries. Blindly relying on AI could put patients at risk.”

Nevertheless, the authors note that ChatGPT’s knowledge was from before September 2021 and the questions in the study covered some extremely complex medical issues, some of which are controversial even among experts and for which hardly any robust data are available. However, even when primed with the same research articles that were used to develop the ESCMID guideline, ChatGPT’s advice aligned even less with the guideline. They recommend that the quality of ChatGPT should be reviewed on an ongoing basis following its evolution and further development.

The authors explain further that ChatGPT, like many AI models, has a cutoff date for the information it can access. For the current version as of today, it was last trained on data up until January 2022. “This means that while it can provide responses based on a wide range of information, it doesn't have access to real-time data or events occurring after that date,” says Dr Dyckhoff-Shen. “When we used ChatGPT for our study before August 2023, it was only trained on data up until September 2021. There was no possibility for us to get a more up-to-date version at that time because ChatGPT was not trained further yet. This is also the reason why we used a second approach by prompting ChatGPT with relevant scientific articles that were used by the ESGIB group to give recommendations in the ESCMID guideline so that we sort of ‘manually’ tried to get it more up-to-date.

“In the future, it would be interesting to re-assess ChatGPT’s knowledge in the future after internal optimisation processes. However, once the chatbot has access to the ESCMID guideline itself, it could just use the recommendations from the guideline thus rendering a comparison no longer useful.”

 

END


ELSE PRESS RELEASES FROM THIS DATE:

Study finds resistance to critically important antibiotics in uncooked meat sold for human and animal consumption

2024-04-29
**Note: the release below is from the ESCMID Global Congress (formerly ECCMID, Barcelona, Spain, 27-30 April). Please credit the congress if you use this story** New research presented at the ESCMID Global Congress (formerly ECCMID) in Barcelona, Spain (27-30 April) has found substantial levels of resistance to critically important antibiotics in meat sold for human and animal consumption. The study is by Dr Jordan Sealey, Professor Matthew Avison and colleagues from the University of Bristol, UK. Meat sold for consumption by humans and companion animals in the UK is regulated by the UK Government Food Standards Agency (FSA) to ensure it falls within bacterial limits deemed ...

Global cervical cancer vaccine roll-out shows it to be very effective in reducing cervical cancer and other HPV-related disease, but huge variations between countries in coverage

2024-04-28
With the creation of safe and efficacious vaccines to target human papillomavirus in the first decade of this century, WHO has an ambitious target to lower cervical cancer incidence (mostly caused by HPV) and mortality by 30% by 2030, meaning each country has a target of vaccinating 90% of girls by age 15, 70% of women receiving a high precision screening test at least at age 35 and 45 years of age, and 90% of all women requiring treatment at any age to receive it. The targets are aspirational and as yet, it appears no country has been ...

Negativity about vaccines surged on Twitter after COVID-19 jabs become available

2024-04-28
It’s time to take a new approach to addressing negative messaging about vaccines, including avoiding the use of the term “anti-vaxxers”, say the researchers. **ECCMID has now changed name to ESCMID Global, please credit ESCMID Global Congress in all future stories** There was a marked increase in negativity about vaccines on Twitter after COVID-19 vaccines became available, the ESCMID Global Congress (formerly ECCMID) in Barcelona, Spain (27-30 April) will hear. The analysis also found that spikes in the number of negative tweets coincided with announcements ...

Global measles cases almost double in a year

2024-04-28
Provisional global total of 321,582 cases for 2023 is almost double (88% higher) than 2022 figure of 171,153 **ECCMID has now changed name to ESCMID Global, please credit ESCMID Global Congress in all future stories** Measles outbreaks are still occurring and in some cases increasing, among a wide variety of countries, raising concerns of an acceleration similar to just before the COVID pandemic. Dr Patrick O’Connor, of WHO Headquarters, Geneva, Switzerland will give an overview of the global measles situation at this year’s ...

Lower dose of mpox vaccine is safe and generates six-week antibody response equivalent to standard regimen

Lower dose of mpox vaccine is safe and generates six-week antibody response equivalent to standard regimen
2024-04-27
WHAT:  A dose-sparing intradermal mpox vaccination regimen was safe and generated an antibody response equivalent to that induced by the standard regimen at six weeks (two weeks after the second dose), according to findings presented today at the European Society of Clinical Microbiology and Infectious Diseases Global Congress in Barcelona. The results suggest that antibody responses contributed to the effectiveness of dose-sparing mpox vaccine regimens used during the 2022 U.S. outbreak.  The ...

Personalised “cocktails” of antibiotics, probiotics and prebiotics hold great promise in treating a common form of irritable bowel syndrome, pilot study finds

2024-04-27
Personalised “cocktails” of antibiotics, probiotics and prebiotics hold great promise in the treatment of a common form of irritable bowel syndrome (IBS), the ESCMID Global Congress (formerly ECCMID) in Barcelona, Spain (27-30 April) will hear. Post-infection IBS (PI-IBS) is a form of irritable bowel syndrome that occurs after gastroenteritis or food poisoning.  Lead researcher Professor Maurizio Sanguinetti, of the Università Cattolica del Sacro Cuore, Rome, Italy, says: “Estimates vary, but research ...

Experts developing immune-enhancing therapies to target tuberculosis

2024-04-27
Some immune-enhancing therapies already exist, and their use could be investigated while other immunomodulatory drugs specific to different sub-groups of TB disease are being developed Experts are working on novel immune-enhancing therapies called host-directed therapies to use the body’s own immune system to target tuberculosis, with hopes that they could tackle even the drug-resistant forms of the disease. In a presentation at this year’s ESCMID Global Congress (formerly ECCMID) in Barcelona (27-30 April), Associate Professor Susanna Brighenti, Center for Infectious Medicine (CIM), ANA Futura at the Karolinska ...

Making transfusion-transmitted malaria in Europe a thing of the past

2024-04-27
Individuals requiring blood transfusion are a vulnerable population, often with debilitating conditions like cancer, and transfusion-transmitted malaria is often fatal. Current serological tests used to identify “at risk” donors are not sensitive enough to completely eliminate malaria transfusion risk. The current challenges are to optimise the testing strategy to minimise the loss of blood products (especially those rare blood phenotypes) and to enhance screening sensitivity for infectious immuno-tolerant donors who are difficult to identify. **ECCMID has now changed its name to ESCMID Global, please credit ...

Experts developing way to harness Nobel Prize winning CRISPR technology to deal with antimicrobial resistance (AMR)

2024-04-27
**ECCMID has now changed its name to ESCMID Global, please credit ESCMID Global Congress (formerly ECCMID, Barcelona, Spain, 27-30 April) in all future stories** Antimicrobial resistance (AMR) is continuing to increase globally, with rates of AMR in most pathogens increasing and threatening a future in which every day medical procedures may no longer be possible and infections thought long dealt with could kill regularly again. As such, new tools to battle AMR are vitally needed. In a new research review at this year’s ESCMID Global Congress (formerly ECCMID – Barcelona 27-30 April) shows ...

CRISPR is promising to tackle antimicrobial resistance, but remember bacteria can fight back

2024-04-27
In the second new research review on this subject, Assistant Prof. Ibrahim Bitar, Department of Microbiology, Faculty of Medicine and University Hospital in Plzen, Charles University in Prague, Plzen, Czech Republic, will give an overview of the molecular biology of CRISPR technology in explaining how it can used to tackle antimicrobial resistance. Clustered regularly interspaced short palindromic repeats (CRISPRs) and CRISPR associated genes (cas) are widespread in the genome of many bacteria and are a defence mechanism against foreign ...

LAST 30 PRESS RELEASES:

Innovative risk score accurately calculates which kidney transplant candidates are also at risk for heart attack or stroke, new study finds

Kidney outcomes in transthyretin amyloid cardiomyopathy

Partial cardiac denervation to prevent postoperative atrial fibrillation after coronary artery bypass grafting

Finerenone in women and men with heart failure with mildly reduced or preserved ejection fraction

Finerenone, serum potassium, and clinical outcomes in heart failure with mildly reduced or preserved ejection fraction

Hormone therapy reshapes the skeleton in transgender individuals who previously blocked puberty

Evaluating performance and agreement of coronary heart disease polygenic risk scores

Heart failure in zero gravity— external constraint and cardiac hemodynamics

Amid record year for dengue infections, new study finds climate change responsible for 19% of today’s rising dengue burden

New study finds air pollution increases inflammation primarily in patients with heart disease

AI finds undiagnosed liver disease in early stages

The American Society of Tropical Medicine and Hygiene and the Bill & Melinda Gates Foundation announce new research fellowship in malaria genomics in honor of professor Dominic Kwiatkowski

Excessive screen time linked to early puberty and accelerated bone growth

First nationwide study discovers link between delayed puberty in boys and increased hospital visits

Traditional Mayan practices have long promoted unique levels of family harmony. But what effect is globalization having?

New microfluidic device reveals how the shape of a tumour can predict a cancer’s aggressiveness

Speech Accessibility Project partners with The Matthew Foundation, Massachusetts Down Syndrome Congress

Mass General Brigham researchers find too much sitting hurts the heart

New study shows how salmonella tricks gut defenses to cause infection

Study challenges assumptions about how tuberculosis bacteria grow

NASA Goddard Lidar team receives Center Innovation Award for Advancements

Can AI improve plant-based meats?

How microbes create the most toxic form of mercury

‘Walk this Way’: FSU researchers’ model explains how ants create trails to multiple food sources

A new CNIC study describes a mechanism whereby cells respond to mechanical signals from their surroundings

Study uncovers earliest evidence of humans using fire to shape the landscape of Tasmania

Researchers uncover Achilles heel of antibiotic-resistant bacteria

Scientists uncover earliest evidence of fire use to manage Tasmanian landscape

Interpreting population mean treatment effects in the Kansas City Cardiomyopathy Questionnaire

Targeting carbohydrate metabolism in colorectal cancer: Synergy of therapies

[Press-News.org] Study shows ChatGPT failed when challenging ESCMID guideline for treating brain abscesses
Study highlights limits of AI tool as a medical device.