PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Fine-tuned LLMs boost error detection in radiology reports

2025-05-20
(Press-News.org) OAK BROOK, Ill. – A type of artificial intelligence called fine-tuned large language models (LLMs) greatly enhances error detection in radiology reports, according to a new study published today in Radiology, a journal of the Radiological Society of North America (RSNA). Researchers said the findings point to an important role for this technology in medical proofreading.

Radiology reports are crucial for optimal patient care. Their accuracy can be compromised by factors like errors in speech recognition software, variability in perceptual and interpretive processes and cognitive biases. These errors can lead to incorrect diagnoses or delayed treatments, making the need for accurate reports urgent.

LLMs like ChatGPT are advanced generative AI models that are trained on vast amounts of text to generate human language. While they offer great potential in proofreading, their application in the medical field, particularly in detecting errors within radiology reports, remains underexplored.

To bridge this gap in knowledge, researchers evaluated fine-tuned LLMs for detecting errors in radiology reports during medical proofreading. A fine-tuned LLM is a pre-trained language model that is further trained on domain-specific data.

“Initially, LLMs are trained on large-scale public data to learn general language patterns and knowledge,” said study senior author Yifan Peng, Ph.D., from the Department of Population Health Sciences at Weill Cornell Medicine in New York City. “Fine-tuning occurs as the next step, where the model undergoes additional training using smaller, targeted datasets relevant to particular tasks.”

To test the model, Dr. Peng and colleagues built a dataset with two parts. The first consisted of 1,656 synthetic reports, including 828 error-free reports and 828 reports with errors. The second part comprised 614 reports, including 307 error-free reports from MIMIC-CXR, a large, publicly available database of chest X-rays, and 307 synthetic reports with errors.

The researchers used the synthetic reports to boost the amount of training data and fulfill the data-hungry needs of LLM fine-tuning.

“Synthetic reports can also increase the coverage and diversity, balance out the cases and reduce the annotation costs,” said the study’s first author, Cong Sun, Ph.D., from Dr. Peng’s lab. “In radiology, or more broadly, the clinical domain, synthetic reports allow safe data-sharing without compromising patient privacy.”

The researchers found that the fine-tuned model outperformed both GPT-4 and BiomedBERT, a natural language processing tool for biomedical research.

“The LLM that was fine-tuned on both MIMIC-CXR and synthetic reports demonstrated strong performance in the error detection tasks,” Dr. Sun said. “It meets our expectations and highlights the potential for developing lightweight, fine-tuned LLM specifically for medical proofreading applications.”

The study provided evidence that LLMs can assist in detecting various types of errors, including transcription errors and left/right errors, which refer to misidentification or misinterpretation of directions or sides in text or images.

The use of synthetic data in AI model building has raised concerns of bias in the data. Dr. Peng and colleagues took steps to minimize this by using diverse and representative samples of real-world data to generate the synthetic data. However, they acknowledged that synthetic errors may not fully capture the complexity of real-world errors in radiology reports. Future work could include a systematic evaluation of how bias introduced by synthetic errors affects model performance.

The researchers hope to study fine-tuning’s ability to reduce radiologists’ cognitive load and enhance patient care and find out if fine-tuning would degrade the model’s ability to generate reasoning explanations.

“We are excited to keep exploring innovative strategies to enhance the reasoning capabilities of fine-tuned LLMs in medical proofreading tasks,” Dr. Peng said. “Our goal is to develop transparent and understandable models that radiologists can confidently trust and fully embrace.”

###

“Generative Large Language Models Trained for Detecting Errors in Radiology Reports.” Collaborating with Drs. Peng and Sun were Kurt Teichman, M.S., Yiliang Zhou, M.S., Brian Critelli, B.S., David Nauheim, M.D.,

Graham Keir, M.D., Xindi Wang, Ph.D., Judy Zhong, Ph.D., Adam E. Flanders, M.D., and George Shih, M.D.

Radiology is edited by Linda Moy, M.D., New York University, New York, N.Y., and owned and published by the Radiological Society of North America, Inc. (https://pubs.rsna.org/journal/radiology)

RSNA is an association of radiologists, radiation oncologists, medical physicists and related scientists promoting excellence in patient care and health care delivery through education, research and technologic innovation. The Society is based in Oak Brook, Illinois. (RSNA.org)

For patient-friendly information on how to read a radiology report, visit RadiologyInfo.org.

END



ELSE PRESS RELEASES FROM THIS DATE:

Climate change emerges as third major threat to global wildlife, scientists warn

2025-05-20
New research published in BioScience reveals that climate change is rapidly emerging as a third major threat to Earth's wild animals, joining habitat alteration and overexploitation in what scientists call a shift from "twin to triple threats." The research team, led by William J. Ripple of Oregon State University, analyzed data for 70,814 animal species from 35 classes, using two publicly available biodiversity datasets to assess climate change vulnerability among the world's wild animal populations. Their ...

New blood test developed at Mass General Brigham shows superior sensitivity in detecting HPV-associated head and neck cancers

2025-05-20
A new liquid biopsy blood test could help detect cases of human papillomavirus (HPV)-associated head and neck cancers with significantly higher accuracy than currently used methods, including before patients develop symptoms, according to new Mass General Brigham research. The researchers at Mass Eye and Ear, a member of the Mass General Brigham healthcare system, found that the blood-based diagnostic test they developed called HPV-DeepSeek achieved 99% sensitivity and 99% specificity for diagnosing cancer at the time of first clinical presentation, including ...

The hidden drivers of aging: microbial influence on genomic stability and telomere dynamics

2025-05-20
Aging is a multifaceted process driven by interconnected biological mechanisms, among which genomic instability and telomere attrition stand as primary hallmarks. Emerging research underscores the pivotal role of the human microbiome in modulating these processes, offering novel insights into aging and age-related diseases. This review synthesizes current evidence on how microbial dysbiosis accelerates aging by disrupting genomic integrity and telomere dynamics, while also exploring therapeutic strategies to promote healthy ...

Neurosymbolic AI could be leaner and smarter

2025-05-20
Could AI that thinks more like a human be more sustainable than today’s LLMs? The AI industry is dominated by large companies with deep pockets and a gargantuan appetite for energy to power their models’ mammoth computing needs. Data centers supporting AI already account for up to 3.7% of global greenhouse emissions. In a Perspective, Alvaro Velasquez and colleagues propose an alternative model: neurosymbolic AI, which would require far less computing power, creating opportunities for smaller players to enter the field and allowing society to enjoy the benefits of AI without the environmental costs. Neurosymbolic AI is built on data-driven neural ...

Intuition-guided reinforcement learning for soft tissue manipulation with unknown constraints

2025-05-20
A research paper by scientists at Hefei University of Technology presented an intuition-guided deep reinforcement learning framework for soft tissue manipulation under unknown constraints. The research paper, published on Apr. 14, 2025 in the journal Cyborg and Bionic Systems. Intraoperative soft tissue manipulation is a critical challenge in autonomous robotic surgery. Furthermore, the intricate in vivo environment surrounding the target soft tissues poses additional hindrances to autonomous robotic decision-making. Previous studies assumed the grasping point was known and the target deformation could be achieved. The constraints were assumed to be constant during the ...

Mount Sinai surgeons perform first heart-liver-kidney transplants in New York State

2025-05-20
A team of Mount Sinai surgeons has performed the first heart-liver-kidney triple organ transplants in New York. They successfully completed two of these complex surgeries on patients from Westchester County, who have since returned home and are making full recoveries. Heart-liver-kidney transplants are extremely rare—only 58 have been done across the country since the United Network for Organ Sharing, the government agency that oversees transplantation, started tracking cases in 1987. The two procedures at The Mount Sinai Hospital, which took place on January 10 and March 8, were among only four to date in the ...

‘Sharkitecture:’ A nanoscale look inside a blacktip shark’s skeleton

2025-05-20
Sharks have been evolving for more than 450 million years, developing skeletons not from bone, but from a tough, mineralized form of cartilage. These creatures are more than just fast swimmers – they’re built for efficiency. Their spines act like natural springs, storing and releasing energy with each tailbeat, allowing them to move through the water with smooth, powerful grace. Now, scientists are peering inside shark skeletons at the nanoscale, revealing a microscopic “sharkitecture” that helps these ancient apex predators withstand extreme physical demands of constant motion. Using synchrotron X-ray nanotomography with detailed ...

Public opinion on who should do content moderation

2025-05-20
Americans perceive small juries of content experts as the most legitimate moderators of potentially misleading content on social media, according to a survey, but perceive large, nationally representative or politically balanced juries with minimum knowledge qualifications as comparably legitimate. Social media content moderation policies tend to attract criticism, with some calling for more aggressive removal of harmful and misleading content and others decrying moderation as censorship and accusing expert moderators of being politically biased. Less clear is what the general public would like to see in terms of content ...

Accounting for marine ecosystems in China promises greater environmental and economic sustainability

2025-05-20
A Perspective proposes a pathway to improvements in sustainability of marine ecosystems and resources in China. Based on environmental accounting used in China’s terrestrial ecosystems, the approach would implement policy and governance to ensure accountability for sustainable use of marine systems. Laurence J. McCook and colleagues argue that the ecosystem goods and services provided to the nation by oceans and coastal ecosystems—including seagrass beds, salt marshes, coral reefs, and mangrove forests—are ...

Diabetes drug gives hope for new treatment for prostate cancer

2025-05-20
A drug used to treat type 2 diabetes may also be effective in slowing the progression of prostate cancer. This is shown by an international study in which researchers at Umeå University, Sweden, have participated. The researchers have found that drugs that regulate a particular protein have a key role in reducing prostate cancer recurrence among diabetic patients. "This is a significant discovery. For the first time, we have clinical observations showing that prostate cancer patients with diabetes who received drugs targeting the protein remained relapse-free during the period we followed them," ...

LAST 30 PRESS RELEASES:

Brain cells drive endurance gains after exercise

Same-day hospital discharge is safe in selected patients after TAVI

Why do people living at high altitudes have better glucose control? The answer was in plain sight

Red blood cells soak up sugar at high altitude, protecting against diabetes

A new electrolyte points to stronger, safer batteries

Environment: Atmospheric pollution directly linked to rocket re-entry

Targeted radiation therapy improves quality of life outcomes for patients with multiple brain metastases

Cardiovascular events in women with prior cervical high-grade squamous intraepithelial lesion

Transplantation and employment earnings in kidney transplant recipients

Brain organoids can be trained to solve a goal-directed task

Treatment can protect extremely premature babies from lung disease

Roberto Morandotti wins prestigious Max Born Award for pioneering research in quantum photonics

Scientists map brain's blood pressure control center

Acute coronary events registry provides insights into sex-specific differences

Bar-Ilan University and NVIDIA researchers improve AI’s ability to understand spatial instructions

New single-cell transcriptomic clock reveals intrinsic and systemic T cell aging in COVID-19 and HIV

Smaller fish and changing food webs – even where species numbers stay the same

Missed opportunity to protect pregnant women and newborns: Study shows low vaccination rates among expectant mothers in Norway against COVID-19 and influenza

Emotional memory region of aged brain is sensitive to processed foods

Neighborhood factors may lead to increased COPD-related emergency department visits, hospitalizations

Food insecurity impacts employees’ productivity

Prenatal infection increases risk of heavy drinking later in life

‘The munchies’ are real and could benefit those with no appetite

FAU researchers discover novel bacteria in Florida’s stranded pygmy sperm whales

DEGU debuts with better AI predictions and explanations

‘Giant superatoms’ unlock a new toolbox for quantum computers

Jeonbuk National University researchers explore metal oxide electrodes as a new frontier in electrochemical microplastic detection

Cannabis: What is the profile of adults at low risk of dependence?

Medical and materials innovations of two women engineers recognized by Sony and Nature

Blood test “clocks” predict when Alzheimer’s symptoms will start

[Press-News.org] Fine-tuned LLMs boost error detection in radiology reports