PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Research identifies blind spots in AI medical triage

First independent evaluation of ChatGPT Health raises questions about safety of consumer AI tools for urgent medical decisions

2026-02-24
(Press-News.org) New York, NY [February 24, 2026] — ChatGPT Health, a widely used consumer artificial intelligence (AI) tool that provides health guidance directly to the public—including advice about how urgently to seek medical care—may fail to direct users appropriately to emergency care in a significant number of serious cases, according to researchers at the Icahn School of Medicine at Mount Sinai.

The study, fast-tracked in the February 23, 2026 online issue of Nature Medicine [https://doi.org/10.1038/s41591-026-04297-7], is the first independent safety evaluation of the large language model (LLM)-based tool since its January 2026 launch. It also identified serious concerns with the tool’s suicide-crisis safeguards.

“LLMs have become patients’ first stop for medical advice—but in 2026 they are least safe at the clinical extremes, where judgment separates missed emergencies from needless alarm,” says Isaac S. Kohane, MD, PhD, Chair, Department of Biomedical Informatics at Harvard Medical School, who was not involved with the research. “When millions of people are using an AI system to decide whether they need emergency care, the stakes are extraordinarily high. Independent evaluation should be routine, not optional.”

Within weeks of its release, ChatGPT Health’s maker, OpenAI, reported that about 40 million people were using the tool daily to seek health information and guidance, including advice about whether to seek urgent or emergency care. At the same time, say the investigators, there was little independent evidence about how safe or reliable its advice actually was.

“That gap motivated our study,” says lead author Ashwin Ramaswamy, MD, Instructor of Urology at the Icahn School of Medicine at Mount Sinai. “We wanted to answer a very basic but critical question: if someone is experiencing a real medical emergency and turns to ChatGPT Health for help, will it clearly tell them to go to the emergency room?”

With respect to suicide-risk alerts, ChatGPT Health was designed to direct users to the 988 Suicide and Crisis Lifeline in high-risk situations. However, the investigators found that these alerts appeared inconsistently, sometimes triggering in lower-risk scenarios while—alarmingly—failing to appear when users described specific plans for self-harm.

“This was a particularly surprising and concerning finding,” says senior and co-corresponding study author Girish N. Nadkarni, MD, MPH, Barbara T. Murphy Chair of the Windreich Department of Artificial Intelligence and Human Health, Director of the Hasso Plattner Institute for Digital Health, and Irene and Dr. Arthur M. Fishberg Professor of Medicine at the Icahn School of Medicine at Mount Sinai, and Chief AI Officer of the Mount Sinai Health System. “While we expected some variability, what we observed went beyond inconsistency. The system’s alerts were inverted relative to clinical risk, appearing more reliably for lower-risk scenarios than for cases when someone shared how they intended to hurt themselves. In real life, when someone talks about exactly how they would harm themselves, that’s a sign of more immediate and serious danger, not less.”

As part of the evaluation, the research team created 60 structured clinical scenarios spanning 21 medical specialties. Cases ranged from minor conditions appropriate for home care to true medical emergencies. Three independent physicians determined the correct level of urgency for each case using guidelines from 56 medical societies.

Each scenario was tested under 16 different contextual conditions, including variations in race, gender, social dynamics (such as someone minimizing symptoms), and barriers to care like lack of insurance or transportation. In total, the team conducted 960 interactions with ChatGPT Health and compared its recommendations with physician consensus.

In testing the 60 realistic patient scenarios developed by physicians, the researchers found that while the tool generally handled clear-cut emergencies correctly, it under-triaged more than half of cases that physicians determined required emergency care.

The investigators were also struck by how the system failed in emergency medical cases. The tool often demonstrated that it recognized dangerous findings in its own explanations, yet still reassured the patient.

“ChatGPT Health performed well in textbook emergencies such as stroke or severe allergic reactions,” says Dr. Ramaswamy. “But it struggled in more nuanced situations where the danger is not immediately obvious, and those are often the cases where clinical judgment matters most. In one asthma scenario, for example, the system identified early warning signs of respiratory failure in its explanation but still advised waiting rather than seeking emergency treatment.”

The study authors advise that for worsening or concerning symptoms, including chest pain, shortness of breath, severe allergic reactions, or changes in mental status, people should seek medical care directly rather than relying solely on chatbot guidance. In cases involving thoughts of self-harm, individuals should contact the 988 Suicide and Crisis Lifeline or go to an emergency department.

Still, the researchers emphasize that the findings do not suggest consumers should abandon AI health tools altogether.

“As a medical student training at a time when AI health tools are already in the hands of millions, I see them as technologies we must learn to integrate thoughtfully into care rather than substitutes for clinical judgment,” says Alvira Tyagi, a first-year medical student at the Icahn School of Medicine at Mount Sinai and second author of the study. “These systems are changing quickly, so part of our training now must consider learning how to understand their outputs critically, identify where they fall short, and use them in ways that protect patients.”

The study assessed the system at a single point in time. Because AI models are frequently updated, performance may change over time, underscoring the need for independent evaluation, the researchers say.

“Starting medical training alongside tools that are evolving in real time makes it clear that today’s results are not set in stone,” Ms. Tyagi says. “That reality calls for ongoing review to ensure that improvements in technology translate into safer care.”

The team plans to continue evaluating updated versions of ChatGPT Health and other consumer-facing AI tools, expanding future research into areas such as pediatric care, medication safety, and non-English-language use.

The paper is titled “ChatGPT Health performance in a structured test of triage recommendations.” 

The study’s authors, as listed in the journal, are Ashwin Ramaswamy, MD, MPP; Alvira Tyagi, BA; Hannah Hugo, MD; Joy Jiang, PhD; Pushkala Jayaraman, PhD; Mateen Jangda, MSc; Alexis E. Te, MD; Steven A. Kaplan, MD; Joshua Lampert, MD; Robert Freeman, MSN, MS; Nicholas Gavin, MD, MBA; Ashutosh K. Tewari, MBBS, MCh; Ankit Sakhuja, MBBS MS; Bilal Naved, PhD; Alexander W. Charney, MD, PhD; Mahmud Omar, MD; Michael A. Gorin, MD; Eyal Klang, MD; Girish N. Nadkarni, MD, MPH.

For more Mount Sinai artificial intelligence news, visit: https://icahn.mssm.edu/about/artificial-intelligence.  

 

About Mount Sinai's Windreich Department of AI and Human Health   

Led by Girish N. Nadkarni, MD, MPH—an international authority on the safe, effective, and ethical use of AI in health care—Mount Sinai’s Windreich Department of AI and Human Health is the first of its kind at a U.S. medical school, pioneering transformative advancements at the intersection of artificial intelligence and human health.  

The Department is committed to leveraging AI in a responsible, effective, ethical, and safe manner to transform research, clinical care, education, and operations. By bringing together world-class AI expertise, cutting-edge infrastructure, and unparalleled computational power, the department is advancing breakthroughs in multi-scale, multimodal data integration while streamlining pathways for rapid testing and translation into practice.  

The Department benefits from dynamic collaborations across Mount Sinai, including with the Hasso Plattner Institute for Digital Health at Mount Sinai—a partnership between the Hasso Plattner Institute for Digital Engineering in Potsdam, Germany, and the Mount Sinai Health System—which complements its mission by advancing data-driven approaches to improve patient care and health outcomes.  

At the heart of this innovation is the renowned Icahn School of Medicine at Mount Sinai, which serves as a central hub for learning and collaboration. This unique integration enables dynamic partnerships across institutes, academic departments, hospitals, and outpatient centers, driving progress in disease prevention, improving treatments for complex illnesses, and elevating quality of life on a global scale.  

In 2024, the Department's innovative NutriScan AI application, developed by the Mount Sinai Health System Clinical Data Science team in partnership with Department faculty, earned Mount Sinai Health System the prestigious Hearst Health Prize. NutriScan is designed to facilitate faster identification and treatment of malnutrition in hospitalized patients. This machine learning tool improves malnutrition diagnosis rates and resource utilization, demonstrating the impactful application of AI in health care.  

For more information on Mount Sinai's Windreich Department of AI and Human Health, visit: ai.mssm.edu  

 

About the Hasso Plattner Institute at Mount Sinai  

At the Hasso Plattner Institute for Digital Health at Mount Sinai, the tools of data science, biomedical and digital engineering, and medical expertise are used to improve and extend lives. The Institute represents a collaboration between the Hasso Plattner Institute for Digital Engineering in Potsdam, Germany, and the Mount Sinai Health System.   

Under the leadership of Girish Nadkarni, MD, MPH, who directs the Institute, and Professor Lothar Wieler, a globally recognized expert in public health and digital transformation, they jointly oversee the partnership, driving innovations that positively impact patient lives while transforming how people think about personal health and health systems.  

The Hasso Plattner Institute for Digital Health at Mount Sinai receives generous support from the Hasso Plattner Foundation. Current research programs and machine learning efforts focus on improving the ability to diagnose and treat patients.  

 

About the Icahn School of Medicine at Mount Sinai 

The Icahn School of Medicine at Mount Sinai is internationally renowned for its outstanding research, educational, and clinical care programs. It is the sole academic partner for the seven member hospitals* of the Mount Sinai Health System, one of the largest academic health systems in the United States, providing care to New York City’s large and diverse patient population.   

The Icahn School of Medicine at Mount Sinai offers highly competitive MD, PhD, MD-PhD, and master’s degree programs, with enrollment of more than 1,200 students. It has the largest graduate medical education program in the country, with more than 2,600 clinical residents and fellows training throughout the Health System. Its Graduate School of Biomedical Sciences offers 13 degree-granting programs, conducts innovative basic and translational research, and trains more than 560 postdoctoral research fellows.  

Ranked 11th nationwide in National Institutes of Health (NIH) funding, the Icahn School of Medicine at Mount Sinai is among the 99th percentile in research dollars per investigator according to the Association of American Medical Colleges.  More than 4,500 scientists, educators, and clinicians work within and across dozens of academic departments and multidisciplinary institutes with an emphasis on translational research and therapeutics. Through Mount Sinai Innovation Partners (MSIP), the Health System facilitates the real-world application and commercialization of medical breakthroughs made at Mount Sinai. 

-------------------------------------------------------  

* Mount Sinai Health System member hospitals: The Mount Sinai Hospital; Mount Sinai Brooklyn; Mount Sinai Morningside; Mount Sinai Queens; Mount Sinai South Nassau; Mount Sinai West; and New York Eye and Ear Infirmary of Mount Sinai 

 

 

 

END



ELSE PRESS RELEASES FROM THIS DATE:

$9M for exploring the fundamental limits of entangled quantum sensor networks

2026-02-24
Photos in the Quantum Engineering Lab at U-M Quantum sensors take sensitivity and accuracy to new levels, and even higher levels of precision are possible when quantum entanglement is used to connect them.  The University of Michigan is leading a $9 million project funded by the U.S. Office of Naval Research to develop methods for creating entangled networks of quantum sensors. Entanglement is promising for high-precision networking because it links particles through their quantum states, no matter the distance between them. Measuring ...

Study shows marine plastic pollution alters octopus predator-prey encounters

2026-02-24
More than 350,000 chemicals are used worldwide, and many find their way into the ocean through plastic pollution. As plastics accumulate in coastal waters, they continuously leach bioactive additives that can interfere with the chemical cues marine animals rely on to find food, avoid predators, choose habitats and communicate. One such chemical, oleamide, is an industrial lubricant in plastics like polyethylene and polypropylene. As these plastics degrade, oleamide seeps into the water. But it’s not just industrial: oleamide is naturally produced by many organisms and influences sleep in mammals, acts as a pheromone in some marine species, and closely resembles ...

Night lights can structure ecosystems

2026-02-24
Night lights affect two marine crustaceans differently, helping explain which species will be found in which portion of Tokyo Bay, Japan, according to a study. Artificial light at night can affect the behavior, physiology, and ecological distribution of marine species. Daiki Sato sought to explore the effects of city lights on the ecosystem of Tokyo Bay, one of the world’s most intensely illuminated coastal regions. Sato specifically focused on two closely related nocturnal isopods, Ligia furcata and Ligia laticarpa. ...

A parasitic origin for the ribosome?

2026-02-24
Ribosomes are the components of cells that read RNA and build proteins. Without the ribosome, the chemistry of life would still be catalyzed by raw RNA. And yet the origin of the ribosome remains a mystery. In a Perspective, Michael Lynch and Andrew Ellington note that the ribosome, which creates all cellular proteins, is itself composed of multiple proteins. How, then, did the ribosome first come to be? The authors propose a proto-ribosome that began by assembling small molecules into useful products, such as short peptides. This proto-ribosome, the authors argue, was likely a viral parasite, which began by taking ...

A gold-standard survey of the American mood

2026-02-24
American reports of individual well-being have remained relatively stable over decades, but confidence in the nation has sharply declined. James N. Druckman and colleagues analyzed long-term survey data from two National Science Foundation-supported infrastructure projects: the General Social Survey and the American National Election Studies. The analysis examined trends in economic satisfaction, health, happiness, satisfaction with democracy, affective polarization, political efficacy, and institutional confidence. The data showed that individual measures ...

Tool for identifying children at risk of speech disorders

2026-02-24
Researchers have developed a tool for identifying children at risk of speech disorders, reducing unnecessary treatment for common speech errors that often resolve on their own.   The research, led by Murdoch Children’s Research Institute (MCRI) in Melbourne and published in the Archives of Disease in Childhood, identifies red flags to help guide speech therapy referrals. Additionally, the data confirms for the first time in more than two decades that speech errors are common and vary widely up to six years of age. For the study, 1179 participants aged 2-12 years were recruited from ...

How Japanese medical trainees view artificial intelligence in medicine

2026-02-24
Artificial intelligence (AI) is rapidly transforming healthcare and medical education. From enhancing diagnostic accuracy and clinical decision-making to enabling virtual simulations and personalized learning, AI technologies are becoming embedded in the daily practice of clinicians and trainees. Despite these benefits, concerns remain regarding ethical responsibility, data privacy, the loss of human autonomy, and potential job displacement. As AI continues to expand across medical systems worldwide, understanding how future physicians perceive and engage with these technologies is increasingly important. Attitudes ...

MambaAlign fusion framework for detecting defects missed by inspection systems

2026-02-24
Industrial quality inspection plays a critical role in manufacturing, from ensuring the reliability of electronics and vehicles to preventing costly failures in aerospace and energy systems. Traditional vision-based inspection systems typically rely on Red, Green, Blue (RGB) cameras, which are fast and inexpensive but often miss defects related to geometry (scratches or dents), material structure, or heat dissipation. While additional sensors, such as thermal cameras or depth scanners, can reveal these hidden anomalies, effectively combining information from multiple sensors remains a major technical challenge. ...

Children born with upper limb difference show the incredible adaptability of the young brain

2026-02-24
A unique study imaging brain activity in children born with upper limb difference – for example, one hand – has shown the amazing ability of the brain to adapt to compensate and support their daily lives. The research, led by a team at the University of Cambridge and Durham University, reveals widespread changes in the brain as it devotes more resources to help the children adapt to the world around them. Our brains hold a map of the body in an area known as the somatosensory cortex, with different regions corresponding to different body parts. These maps are responsible for processing ...

How bacteria can reclaim lost energy, nutrients, and clean water from wastewater

2026-02-24
Wastewater contains untapped resources that, if reclaimed, could power agriculture, global sanitation, and its own treatment to help us meet UN SDG goals, according to a review published today in Frontiers in Science.   Every year, we produce about 359 billion cubic meters of wastewater globally—enough to fill Lake Geneva four times over.   Half of global wastewater is discarded, with the rest expensively and ...

LAST 30 PRESS RELEASES:

New research finds heart health benefits in combining mango and avocado daily

New research finds peanut butter consumption builds muscle power in older adults

Study identifies aging-associated mitochondrial circular RNAs

The brain’s primitive ‘fear center’ is actually a sophisticated mediator

Brain Healthy Campus Collaborative announces winner of first-ever Brain Health Prize

Tokyo Bay’s night lights reveal hidden boundaries between species

As worms and jellyfish wriggle, new AI tools track their neurons

ATG14 identified as a central guardian against liver injury and fibrosis

Research identifies blind spots in AI medical triage

$9M for exploring the fundamental limits of entangled quantum sensor networks

Study shows marine plastic pollution alters octopus predator-prey encounters

Night lights can structure ecosystems

A parasitic origin for the ribosome?

A gold-standard survey of the American mood

Tool for identifying children at risk of speech disorders

How Japanese medical trainees view artificial intelligence in medicine

MambaAlign fusion framework for detecting defects missed by inspection systems

Children born with upper limb difference show the incredible adaptability of the young brain

How bacteria can reclaim lost energy, nutrients, and clean water from wastewater

Fast-paced lives demand faster vision: ecology shapes how “quickly” animals see time

Global warming and heat stress risk close in on the Tour de France

New technology reveals hidden DNA scaffolding built before life ‘switches on’

New study reveals early healthy eating shapes lifelong brain health

Trashing cancer’s ‘undruggable’ proteins

Industrial research labs were invented in Europe but made the U.S. a tech superpower

Enzymes work as Maxwell's demon by using memory stored as motion

Methane’s missing emissions: The underestimated impact of small sources

Beating cancer by eating cancer

How sleep disruption impairs social memory: Oxytocin circuits reveal mechanisms and therapeutic opportunities

Natural compound from pomegranate leaves disrupts disease-causing amyloid

[Press-News.org] Research identifies blind spots in AI medical triage
First independent evaluation of ChatGPT Health raises questions about safety of consumer AI tools for urgent medical decisions