PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Medical AI models rely on 'shortcuts' that could lead to misdiagnosis of COVID-19

2021-05-31
(Press-News.org) Artificial intelligence promises to be a powerful tool for improving the speed and accuracy of medical decision-making to improve patient outcomes. From diagnosing disease, to personalizing treatment, to predicting complications from surgery, AI could become as integral to patient care in the future as imaging and laboratory tests are today.

But as University of Washington researchers discovered, AI models -- like humans -- have a tendency to look for shortcuts. In the case of AI-assisted disease detection, these shortcuts could lead to diagnostic errors if deployed in clinical settings.

In a new paper published May 31 in Nature Machine Intelligence, UW researchers examined multiple models recently put forward as potential tools for accurately detecting COVID-19 from chest radiography, otherwise known as chest X-rays. The team found that, rather than learning genuine medical pathology, these models rely instead on shortcut learning to draw spurious associations between medically irrelevant factors and disease status. Here, the models ignored clinically significant indicators and relied instead on characteristics such as text markers or patient positioning that were specific to each dataset to predict whether someone had COVID-19.

"A physician would generally expect a finding of COVID-19 from an X-ray to be based on specific patterns in the image that reflect disease processes," said co-lead author Alex DeGrave, who is pursuing his doctorate in the Paul G. Allen School of Computer Science & Engineering and a medical degree as part of the UW's Medical Scientist Training Program. "But rather than relying on those patterns, a system using shortcut learning might, for example, judge that someone is elderly and thus infer that they are more likely to have the disease because it is more common in older patients. The shortcut is not wrong per se, but the association is unexpected and not transparent. And that could lead to an inappropriate diagnosis."

Shortcut learning is less robust than genuine medical pathology and usually means the model will not generalize well outside of the original setting, the team said.

"A model that relies on shortcuts will often only work in the hospital in which it was developed, so when you take the system to a new hospital, it fails -- and that failure can point doctors toward the wrong diagnosis and improper treatment," DeGrave said.

Combine that lack of robustness with the typical opacity of AI decision-making, and such a tool could go from a potential life-saver to a liability.

The lack of transparency is one of the factors that led the team to focus on explainable AI techniques for medicine and science. Most AI is regarded as a "black box" -- the model is trained on massive datasets and it spits out predictions without anyone knowing precisely how the model came up with a given result. With explainable AI, researchers and practitioners are able to understand, in detail, how various inputs and their weights contributed to a model's output.

The team used these same techniques to evaluate the trustworthiness of models recently touted for appearing to accurately identify cases of COVID-19 from chest X-rays. Despite a number of published papers heralding the results, the researchers suspected that something else may have been happening inside the black box that led to the models' predictions.

Specifically, the team reasoned that these models would be prone to a condition known as "worst-case confounding," owing to the lack of training data available for such a new disease. This scenario increased the likelihood that the models would rely on shortcuts rather than learning the underlying pathology of the disease from the training data.

"Worst-case confounding is what allows an AI system to just learn to recognize datasets instead of learning any true disease pathology," said co-lead author Joseph Janizek, who is also a doctoral student in the Allen School and earning a medical degree at the UW. "It's what happens when all of the COVID-19 positive cases come from a single dataset while all of the negative cases are in another. And while researchers have come up with techniques to mitigate associations like this in cases where those associations are less severe, these techniques don't work in situations where you have a perfect association between an outcome such as COVID-19 status and a factor like the data source."

The team trained multiple deep convolutional neural networks on X-ray images from a dataset that replicated the approach used in the published papers. First they tested each model's performance on an internal set of images from that initial dataset that had been withheld from the training data. Then the researchers tested how well the models performed on a second, external dataset meant to represent new hospital systems.

While the models maintained their high performance when tested on images from the internal dataset, their accuracy was reduced by half on the second set. The researchers referred to this as a "generalization gap" and cited it as strong evidence that confounding factors were responsible for the models' predictive success on the initial dataset.

The team then applied explainable AI techniques, including generative adversarial networks and saliency maps, to identify which image features were most important in determining the models' predictions.

The researchers trained the models on a second dataset, which contained positive and negative COVID-19 cases drawn from similar sources, and was therefore presumed to be less prone to confounding. But even those models exhibited a corresponding drop in performance when tested on external data.

These results upend the conventional wisdom that confounding poses less of an issue when datasets are derived from similar sources. They also reveal the extent to which high-performance medical AI systems could exploit undesirable shortcuts rather than the desired signals.

"My team and I are still optimistic about the clinical viability of AI for medical imaging. I believe we will eventually have reliable ways to prevent AI from learning shortcuts, but it's going to take some more work to get there," said senior author Su-In Lee, a professor in the Allen School. "Going forward, explainable AI is going to be an essential tool for ensuring these models can be used safely and effectively to augment medical decision-making and achieve better outcomes for patients."

Despite the concerns raised by the team's findings, it is unlikely that the models the team studied have been deployed widely in the clinical setting, DeGrave said. While there is evidence that at least one of the faulty models - COVID-Net - was deployed in multiple hospitals, it is unclear whether it was used for clinical purposes or solely for research.

"Complete information about where and how these models have been deployed is unavailable, but it's safe to assume that clinical use of these models is rare or nonexistent," DeGrave said. "Most of the time, healthcare providers diagnose COVID-19 using a laboratory test, PCR, rather than relying on chest radiographs. And hospitals are averse to liability, making it even less likely that they would rely on a relatively untested AI system."

Researchers looking to apply AI to disease detection will need to revamp their approach before such models can be used to make actual treatment decisions for patients, Janizek said. "Our findings point to the importance of applying explainable AI techniques to rigorously audit medical AI systems," Janizek said. "If you look at a handful of X-rays, the AI system might appear to behave well. Problems only become clear once you look at many images. Until we have methods to more efficiently audit these systems using a greater sample size, a more systematic application of explainable AI could help researchers avoid some of the pitfalls we identified with the COVID-19 models." This group has already demonstrated the value of explainable AI for a range of medical applications beyond imaging. These include tools for assessing patient risk factors for complications during surgery and targeting cancer therapies based on an individual's molecular profile.

INFORMATION:

This paper is one of two studies from this group to appear in the current issue of Nature Machine Intelligence. Lee is also the senior and corresponding author on the second paper, "Improving performance of deep learning models with axiomatic attribution priors and expected gradients," for which she teamed up with Janizek, his fellow M.D.-Ph.D. student Gabriel Erion, Ph.D. student Pascal Sturmfels, and affiliate professor Scott Lundberg of Microsoft Research.

This research was funded by the National Science Foundation and the National Institutes of Health.

For more information, contact DeGrave at degrave@uw.edu, Janizek at jjanizek@uw.edu and Lee at suinlee@uw.edu.



ELSE PRESS RELEASES FROM THIS DATE:

Isolating an elusive missing link

Isolating an elusive missing link
2021-05-31
The Water Oxidation Reaction (WOR) is one of the most important reactions on the planet since it is the source of nearly all the atmosphere's oxygen. Understanding its intricacies can hold the key to improve the efficiency of the reaction. Unfortunately, the reaction's mechanisms are complex and the intermediates highly unstable, thus making their isolation and characterisation extremely challenging. To overcome this, scientists are using molecular catalysts as models to understand the fundamental aspects of water oxidation - particularly the oxygen-oxygen bond-forming reaction. For the first time, scientists in ICIQ's ...

Global warming already responsible for one in three heat-related deaths

2021-05-31
Between 1991 and 2018, more than a third of all deaths in which heat played a role were attributable to human-induced global warming, according to a new article in Nature Climate Change. The study, the largest of its kind, was led by the London School of Hygiene & Tropical Medicine (LSHTM) and the University of Bern within the Multi-Country Multi-City (MCC) Collaborative Research Network. Using data from 732 locations in 43 countries around the world it shows for the first time the actual contribution of man-made climate change in increasing mortality risks due to heat. Overall, ...

Scientists discover a new genetic form of ALS in children

Scientists discover a new genetic form of ALS in children
2021-05-31
In a study of 11 medical-mystery patients, an international team of researchers led by scientists at the National Institutes of Health and the Uniformed Services University (USU) discovered a new and unique form of amyotrophic lateral sclerosis (ALS). Unlike most cases of ALS, the disease began attacking these patients during childhood, worsened more slowly than usual, and was linked to a gene, called SPTLC1, that is part of the body's fat production system. Preliminary results suggested that genetically silencing SPTLC1 activity would be an effective strategy for combating this type of ALS. "ALS is a paralyzing ...

Lundquist investigators in global study expanding genomic research of different ancestries

Lundquist investigators in global study expanding genomic research of different ancestries
2021-05-31
LOS ANGELES (May 31, 2021) -- Today The Lundquist Institute announced that its investigators contributed data from several studies, including data on Hispanics, African-Americans and East Asians, to the international MAGIC collaboration, composed of more than 400 global academics, who conducted a genome-wide association meta-analysis led by the University of Exeter. Now published in Nature Genetics, their findings demonstrate that expanding research into different ancestries yields more and better results, as well as ultimately benefitting global patient care. Up to now nearly 87 percent of genomic research of this type has been conducted in Europeans. ...

The price is right: Modeling economic growth in a zero-emission society

2021-05-31
Pollution from manufacturing is now widespread, affecting all regions in the world, with serious ecological, economic, and political consequences. Heightened public concern and scrutiny have led to numerous governments considering policies that aim to lower pollution and improve environmental qualities. Inter-governmental agreements such as the Paris Agreement and the United Nations' Sustainable Development Goals all focus on lowering emissions of pollution. Specifically, they aim to achieve a "zero-emission society," which means that pollution is cleaned up as it is produced, while also reducing pollution (This idea of dealing with pollution is referred to as the "kindergarten rule.") Of course, any efforts to achieve this ...

Beer byproduct mixed with manure proves an excellent pesticide

Beer byproduct mixed with manure proves an excellent pesticide
2021-05-31
The use of many chemical fumigants in agriculture have been demonstrated to be harmful to human health and the environment and therefore banned from use. Now, in an effort to reduce waste from the agricultural industry and reduce the amounts of harmful chemicals used, researchers have investigated using organic byproducts from beer production and farming as a potential method to disinfest soils, preserve healthy soil microorganisms and increase crop yields. In this study published to Frontiers in Sustainable Food Systems, researchers from the Neiker Basque Institute for Agricultural Research and Development in Spain investigated using agricultural by-products rapeseed cake and beer bagasse (spent beer grains), along with fresh cow manure as two organic biodisinfestation ...

Oncotarget: Activation of plasmacytoid dendritic cells promotes AML-cell fratricide

Oncotarget: Activation of plasmacytoid dendritic cells promotes AML-cell fratricide
2021-05-31
Oncotarget published "Activation of plasmacytoid dendritic cells promotes AML-cell fratricide" which reported that Interferons have been previously shown to aid in the clearance of AML cells. Type I interferons are produced primarily by plasmacytoid dendritic cells. However, these cells exist in a quiescent state in AML. In addition, the authors showed increased expression of the immune-stimulatory receptor CD40. Then they next tested whether IFNβ would influence antibody-mediated fratricide among AML cells, as our recent work showed that AML cells could undergo cell-to cell killing in the presence of the CD38 antibody daratumumab. These Oncotarget findings suggest that the tolerogenic phenotype ...

Oncotarget: Progression in high-risk non-muscle invasive bladder cancer

Oncotarget: Progression in high-risk non-muscle invasive bladder cancer
2021-05-31
Oncotarget published "A higher De Ritis ratio (AST/ALT) is a risk factor for progression in high-risk non-muscle invasive bladder cancer" which reported that a recent study revealed that a high De Ritis ratio was a risk factor in some solid malignancies. This Oncotarget study examined the importance of the De Ritis ratio as a prognostic marker in high-risk NMIBC. This Oncotarget study examined the importance of the De Ritis ratio as a prognostic marker in high-risk NMIBC Among these patients, 32 patients developed recurrent disease and 15 patients showed progression. A multivariate analysis revealed that non-BCG treatment was an independent risk factor ...

Oncotarget: Piperlongumine promotes death of retinoblastoma cancer cells

Oncotarget: Piperlongumine promotes death of retinoblastoma cancer cells
2021-05-31
Oncotarget published "Piperlongumine promotes death of retinoblastoma cancer cells" which reported that while retinoblastoma initiation is triggered by the inactivation of both alleles of the retinoblastoma tumor suppressor gene in the developing retina, tumor progression requires additional epigenetic changes, retinoblastoma genomes being quite stable. In this report, the authors analyzed the pro-death effect of piperlongumine, a natural compound isolated from Piper longum L., on two human retinoblastoma cell lines, WERI-Rb and Y79. The effects of PL on cell proliferation, cell death and cell cycle were investigated. PL effectively inhibited cell growth, impacted ...

Gender stereotypes still hold true for youth and types of political participation

2021-05-31
Gender roles absorbed at an early age seem to have shaped today's youth regarding their involvement in politics, in line with traditional stereotypes, concludes a new study, conducted amongst adolescents and young adults aged between 15 and 30 in Italy, within the Horizon 2020 project: "CATCH-EyoU. Processes in Youth's Construction of Active EU Citizenship". In their research article, published in the peer-reviewed, open-access scientific journal Social Psychological Bulletin, the research team from the University of Bologna report that it is young males that would more often engage directly with politics, like enrolling in a political party, acting to influence government policy, contacting a politician or taking part in a protest. On ...

LAST 30 PRESS RELEASES:

How crickets co-exist with hostile ant hosts

Tapered polymer fibers enhance light delivery for neuroscience research

Syracuse University’s Fran Brown named Paul “Bear” Bryant Newcomer Coach of the Year Award recipient

DARPA-ABC program supports Wyss Institute-led collaboration toward deeper understanding of anesthesia and safe drugs enabling anesthesia without the need for extensive monitoring

The Offshore Wind Innovation Hub 2025 call for innovators opens today

Aligning Science Across Parkinson’s (ASAP) launches a new funding opportunity to join the Collaborative Research Network

State-of-the-art fusion simulation leads three scientists to the 2024 Kaul Foundation Prize

Davos Alzheimer's Collaborative launches innovative brain health navigator program for intuitive coordination between patients and providers

Media registration now open: ATS 2025 in San Francisco

New study shows that corn-soybean crop rotation benefits are extremely sensitive to climate

From drops to data: Advancing global precipitation estimates with the LETKF algorithm

SeoulTech researchers propose a novel method to shed light on PFOS-induced neurotoxicity

Large-scale TMIST breast cancer screening trial achieves enrollment goal, paving the way for data that provides a precision approach to screeninge

Study published in NEJM Catalyst finds patients cared for by MedStar Health’s Safe Babies Safe Moms program have better outcomes in pregnancy, delivery, and postpartum

Octopus arms have segmented nervous systems to power extraordinary movements

Protein shapes can help untangle life’s ancient history

Memory systems in the brain drive food cravings that could influence body weight

Indigenous students face cumbersome barriers to attaining post-secondary education

Not all Hot Jupiters orbit solo

Study shows connection between childhood maltreatment and disease in later life

Discovery of two planets sheds new light on the formation of planetary systems

New West Health-Gallup survey finds incoming Trump administration faces high public skepticism over plans to lower healthcare costs

Reading signs: New method improves AI translation of sign language

Over 97 million US residents exposed to unregulated contaminants in their drinking water

New large-scale study suggests no link between common brain malignancy and hormone therapy

AI helps to identify subjective cognitive decline during the menopause transition

Machine learning assisted plasmonic absorbers

Healthy lifestyle changes shown to help low back pain

Waking up is not stressful, study finds

Texas A&M AgriLife Research aims for better control of widespread tomato spotted wilt virus

[Press-News.org] Medical AI models rely on 'shortcuts' that could lead to misdiagnosis of COVID-19