PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Fine-tuned LLMs boost error detection in radiology reports

2025-05-20
(Press-News.org) OAK BROOK, Ill. – A type of artificial intelligence called fine-tuned large language models (LLMs) greatly enhances error detection in radiology reports, according to a new study published today in Radiology, a journal of the Radiological Society of North America (RSNA). Researchers said the findings point to an important role for this technology in medical proofreading.

Radiology reports are crucial for optimal patient care. Their accuracy can be compromised by factors like errors in speech recognition software, variability in perceptual and interpretive processes and cognitive biases. These errors can lead to incorrect diagnoses or delayed treatments, making the need for accurate reports urgent.

LLMs like ChatGPT are advanced generative AI models that are trained on vast amounts of text to generate human language. While they offer great potential in proofreading, their application in the medical field, particularly in detecting errors within radiology reports, remains underexplored.

To bridge this gap in knowledge, researchers evaluated fine-tuned LLMs for detecting errors in radiology reports during medical proofreading. A fine-tuned LLM is a pre-trained language model that is further trained on domain-specific data.

“Initially, LLMs are trained on large-scale public data to learn general language patterns and knowledge,” said study senior author Yifan Peng, Ph.D., from the Department of Population Health Sciences at Weill Cornell Medicine in New York City. “Fine-tuning occurs as the next step, where the model undergoes additional training using smaller, targeted datasets relevant to particular tasks.”

To test the model, Dr. Peng and colleagues built a dataset with two parts. The first consisted of 1,656 synthetic reports, including 828 error-free reports and 828 reports with errors. The second part comprised 614 reports, including 307 error-free reports from MIMIC-CXR, a large, publicly available database of chest X-rays, and 307 synthetic reports with errors.

The researchers used the synthetic reports to boost the amount of training data and fulfill the data-hungry needs of LLM fine-tuning.

“Synthetic reports can also increase the coverage and diversity, balance out the cases and reduce the annotation costs,” said the study’s first author, Cong Sun, Ph.D., from Dr. Peng’s lab. “In radiology, or more broadly, the clinical domain, synthetic reports allow safe data-sharing without compromising patient privacy.”

The researchers found that the fine-tuned model outperformed both GPT-4 and BiomedBERT, a natural language processing tool for biomedical research.

“The LLM that was fine-tuned on both MIMIC-CXR and synthetic reports demonstrated strong performance in the error detection tasks,” Dr. Sun said. “It meets our expectations and highlights the potential for developing lightweight, fine-tuned LLM specifically for medical proofreading applications.”

The study provided evidence that LLMs can assist in detecting various types of errors, including transcription errors and left/right errors, which refer to misidentification or misinterpretation of directions or sides in text or images.

The use of synthetic data in AI model building has raised concerns of bias in the data. Dr. Peng and colleagues took steps to minimize this by using diverse and representative samples of real-world data to generate the synthetic data. However, they acknowledged that synthetic errors may not fully capture the complexity of real-world errors in radiology reports. Future work could include a systematic evaluation of how bias introduced by synthetic errors affects model performance.

The researchers hope to study fine-tuning’s ability to reduce radiologists’ cognitive load and enhance patient care and find out if fine-tuning would degrade the model’s ability to generate reasoning explanations.

“We are excited to keep exploring innovative strategies to enhance the reasoning capabilities of fine-tuned LLMs in medical proofreading tasks,” Dr. Peng said. “Our goal is to develop transparent and understandable models that radiologists can confidently trust and fully embrace.”

###

“Generative Large Language Models Trained for Detecting Errors in Radiology Reports.” Collaborating with Drs. Peng and Sun were Kurt Teichman, M.S., Yiliang Zhou, M.S., Brian Critelli, B.S., David Nauheim, M.D.,

Graham Keir, M.D., Xindi Wang, Ph.D., Judy Zhong, Ph.D., Adam E. Flanders, M.D., and George Shih, M.D.

Radiology is edited by Linda Moy, M.D., New York University, New York, N.Y., and owned and published by the Radiological Society of North America, Inc. (https://pubs.rsna.org/journal/radiology)

RSNA is an association of radiologists, radiation oncologists, medical physicists and related scientists promoting excellence in patient care and health care delivery through education, research and technologic innovation. The Society is based in Oak Brook, Illinois. (RSNA.org)

For patient-friendly information on how to read a radiology report, visit RadiologyInfo.org.

END



ELSE PRESS RELEASES FROM THIS DATE:

Climate change emerges as third major threat to global wildlife, scientists warn

2025-05-20
New research published in BioScience reveals that climate change is rapidly emerging as a third major threat to Earth's wild animals, joining habitat alteration and overexploitation in what scientists call a shift from "twin to triple threats." The research team, led by William J. Ripple of Oregon State University, analyzed data for 70,814 animal species from 35 classes, using two publicly available biodiversity datasets to assess climate change vulnerability among the world's wild animal populations. Their ...

New blood test developed at Mass General Brigham shows superior sensitivity in detecting HPV-associated head and neck cancers

2025-05-20
A new liquid biopsy blood test could help detect cases of human papillomavirus (HPV)-associated head and neck cancers with significantly higher accuracy than currently used methods, including before patients develop symptoms, according to new Mass General Brigham research. The researchers at Mass Eye and Ear, a member of the Mass General Brigham healthcare system, found that the blood-based diagnostic test they developed called HPV-DeepSeek achieved 99% sensitivity and 99% specificity for diagnosing cancer at the time of first clinical presentation, including ...

The hidden drivers of aging: microbial influence on genomic stability and telomere dynamics

2025-05-20
Aging is a multifaceted process driven by interconnected biological mechanisms, among which genomic instability and telomere attrition stand as primary hallmarks. Emerging research underscores the pivotal role of the human microbiome in modulating these processes, offering novel insights into aging and age-related diseases. This review synthesizes current evidence on how microbial dysbiosis accelerates aging by disrupting genomic integrity and telomere dynamics, while also exploring therapeutic strategies to promote healthy ...

Neurosymbolic AI could be leaner and smarter

2025-05-20
Could AI that thinks more like a human be more sustainable than today’s LLMs? The AI industry is dominated by large companies with deep pockets and a gargantuan appetite for energy to power their models’ mammoth computing needs. Data centers supporting AI already account for up to 3.7% of global greenhouse emissions. In a Perspective, Alvaro Velasquez and colleagues propose an alternative model: neurosymbolic AI, which would require far less computing power, creating opportunities for smaller players to enter the field and allowing society to enjoy the benefits of AI without the environmental costs. Neurosymbolic AI is built on data-driven neural ...

Intuition-guided reinforcement learning for soft tissue manipulation with unknown constraints

2025-05-20
A research paper by scientists at Hefei University of Technology presented an intuition-guided deep reinforcement learning framework for soft tissue manipulation under unknown constraints. The research paper, published on Apr. 14, 2025 in the journal Cyborg and Bionic Systems. Intraoperative soft tissue manipulation is a critical challenge in autonomous robotic surgery. Furthermore, the intricate in vivo environment surrounding the target soft tissues poses additional hindrances to autonomous robotic decision-making. Previous studies assumed the grasping point was known and the target deformation could be achieved. The constraints were assumed to be constant during the ...

Mount Sinai surgeons perform first heart-liver-kidney transplants in New York State

2025-05-20
A team of Mount Sinai surgeons has performed the first heart-liver-kidney triple organ transplants in New York. They successfully completed two of these complex surgeries on patients from Westchester County, who have since returned home and are making full recoveries. Heart-liver-kidney transplants are extremely rare—only 58 have been done across the country since the United Network for Organ Sharing, the government agency that oversees transplantation, started tracking cases in 1987. The two procedures at The Mount Sinai Hospital, which took place on January 10 and March 8, were among only four to date in the ...

‘Sharkitecture:’ A nanoscale look inside a blacktip shark’s skeleton

2025-05-20
Sharks have been evolving for more than 450 million years, developing skeletons not from bone, but from a tough, mineralized form of cartilage. These creatures are more than just fast swimmers – they’re built for efficiency. Their spines act like natural springs, storing and releasing energy with each tailbeat, allowing them to move through the water with smooth, powerful grace. Now, scientists are peering inside shark skeletons at the nanoscale, revealing a microscopic “sharkitecture” that helps these ancient apex predators withstand extreme physical demands of constant motion. Using synchrotron X-ray nanotomography with detailed ...

Public opinion on who should do content moderation

2025-05-20
Americans perceive small juries of content experts as the most legitimate moderators of potentially misleading content on social media, according to a survey, but perceive large, nationally representative or politically balanced juries with minimum knowledge qualifications as comparably legitimate. Social media content moderation policies tend to attract criticism, with some calling for more aggressive removal of harmful and misleading content and others decrying moderation as censorship and accusing expert moderators of being politically biased. Less clear is what the general public would like to see in terms of content ...

Accounting for marine ecosystems in China promises greater environmental and economic sustainability

2025-05-20
A Perspective proposes a pathway to improvements in sustainability of marine ecosystems and resources in China. Based on environmental accounting used in China’s terrestrial ecosystems, the approach would implement policy and governance to ensure accountability for sustainable use of marine systems. Laurence J. McCook and colleagues argue that the ecosystem goods and services provided to the nation by oceans and coastal ecosystems—including seagrass beds, salt marshes, coral reefs, and mangrove forests—are ...

Diabetes drug gives hope for new treatment for prostate cancer

2025-05-20
A drug used to treat type 2 diabetes may also be effective in slowing the progression of prostate cancer. This is shown by an international study in which researchers at Umeå University, Sweden, have participated. The researchers have found that drugs that regulate a particular protein have a key role in reducing prostate cancer recurrence among diabetic patients. "This is a significant discovery. For the first time, we have clinical observations showing that prostate cancer patients with diabetes who received drugs targeting the protein remained relapse-free during the period we followed them," ...

LAST 30 PRESS RELEASES:

Researchers clarify how ketogenic diets treat epilepsy, guiding future therapy development

PsyMetRiC – a new tool to predict physical health risks in young people with psychosis

Island birds reveal surprising link between immunity and gut bacteria

Research presented at international urology conference in London shows how far prostate cancer screening has come

Further evidence of developmental risks linked to epilepsy drugs in pregnancy

Cosmetic procedures need tighter regulation to reduce harm, argue experts

How chaos theory could turn every NHS scan into its own fortress

Vaccine gaps rooted in structural forces, not just personal choices: SFU study

Safer blood clot treatment with apixaban than with rivaroxaban, according to large venous thrombosis trial

Turning herbal waste into a powerful tool for cleaning heavy metal pollution

Immune ‘peacekeepers’ teach the body which foods are safe to eat

AAN issues guidance on the use of wearable devices

In former college athletes, more concussions associated with worse brain health

Racial/ethnic disparities among people fatally shot by U.S. police vary across state lines

US gender differences in poverty rates may be associated with the varying burden of childcare

3D-printed robotic rattlesnake triggers an avoidance response in zoo animals, especially species which share their distribution with rattlers in nature

Simple ‘cocktail’ of amino acids dramatically boosts power of mRNA therapies and CRISPR gene editing

Johns Hopkins scientists engineer nanoparticles able to seek and destroy diseased immune cells

A hidden immune circuit in the uterus revealed: Findings shed light on preeclampsia and early pregnancy failure

Google Earth’ for human organs made available online

AI assistants can sway writers’ attitudes, even when they’re watching for bias

Still standing but mostly dead: Recovery of dying coral reef in Moorea stalls

3D-printed rattlesnake reveals how the rattle is a warning signal

Despite their contrasting reputations, bonobos and chimpanzees show similar levels of aggression in zoos

Unusual tumor cells may be overlooked factors in advanced breast cancer

Plants pause, play and fast forward growth depending on types of climate stress

University of Minnesota scientists reveal how deadly Marburg virus enters human cells, identify therapeutic vulnerability

Here's why seafarers have little confidence in autonomous ships

MYC amplification in metastatic prostate cancer associated with reduced tumor immunogenicity

The gut can drive age-associated memory loss

[Press-News.org] Fine-tuned LLMs boost error detection in radiology reports