PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Fine-tuned LLMs boost error detection in radiology reports

2025-05-20
(Press-News.org) OAK BROOK, Ill. – A type of artificial intelligence called fine-tuned large language models (LLMs) greatly enhances error detection in radiology reports, according to a new study published today in Radiology, a journal of the Radiological Society of North America (RSNA). Researchers said the findings point to an important role for this technology in medical proofreading.

Radiology reports are crucial for optimal patient care. Their accuracy can be compromised by factors like errors in speech recognition software, variability in perceptual and interpretive processes and cognitive biases. These errors can lead to incorrect diagnoses or delayed treatments, making the need for accurate reports urgent.

LLMs like ChatGPT are advanced generative AI models that are trained on vast amounts of text to generate human language. While they offer great potential in proofreading, their application in the medical field, particularly in detecting errors within radiology reports, remains underexplored.

To bridge this gap in knowledge, researchers evaluated fine-tuned LLMs for detecting errors in radiology reports during medical proofreading. A fine-tuned LLM is a pre-trained language model that is further trained on domain-specific data.

“Initially, LLMs are trained on large-scale public data to learn general language patterns and knowledge,” said study senior author Yifan Peng, Ph.D., from the Department of Population Health Sciences at Weill Cornell Medicine in New York City. “Fine-tuning occurs as the next step, where the model undergoes additional training using smaller, targeted datasets relevant to particular tasks.”

To test the model, Dr. Peng and colleagues built a dataset with two parts. The first consisted of 1,656 synthetic reports, including 828 error-free reports and 828 reports with errors. The second part comprised 614 reports, including 307 error-free reports from MIMIC-CXR, a large, publicly available database of chest X-rays, and 307 synthetic reports with errors.

The researchers used the synthetic reports to boost the amount of training data and fulfill the data-hungry needs of LLM fine-tuning.

“Synthetic reports can also increase the coverage and diversity, balance out the cases and reduce the annotation costs,” said the study’s first author, Cong Sun, Ph.D., from Dr. Peng’s lab. “In radiology, or more broadly, the clinical domain, synthetic reports allow safe data-sharing without compromising patient privacy.”

The researchers found that the fine-tuned model outperformed both GPT-4 and BiomedBERT, a natural language processing tool for biomedical research.

“The LLM that was fine-tuned on both MIMIC-CXR and synthetic reports demonstrated strong performance in the error detection tasks,” Dr. Sun said. “It meets our expectations and highlights the potential for developing lightweight, fine-tuned LLM specifically for medical proofreading applications.”

The study provided evidence that LLMs can assist in detecting various types of errors, including transcription errors and left/right errors, which refer to misidentification or misinterpretation of directions or sides in text or images.

The use of synthetic data in AI model building has raised concerns of bias in the data. Dr. Peng and colleagues took steps to minimize this by using diverse and representative samples of real-world data to generate the synthetic data. However, they acknowledged that synthetic errors may not fully capture the complexity of real-world errors in radiology reports. Future work could include a systematic evaluation of how bias introduced by synthetic errors affects model performance.

The researchers hope to study fine-tuning’s ability to reduce radiologists’ cognitive load and enhance patient care and find out if fine-tuning would degrade the model’s ability to generate reasoning explanations.

“We are excited to keep exploring innovative strategies to enhance the reasoning capabilities of fine-tuned LLMs in medical proofreading tasks,” Dr. Peng said. “Our goal is to develop transparent and understandable models that radiologists can confidently trust and fully embrace.”

###

“Generative Large Language Models Trained for Detecting Errors in Radiology Reports.” Collaborating with Drs. Peng and Sun were Kurt Teichman, M.S., Yiliang Zhou, M.S., Brian Critelli, B.S., David Nauheim, M.D.,

Graham Keir, M.D., Xindi Wang, Ph.D., Judy Zhong, Ph.D., Adam E. Flanders, M.D., and George Shih, M.D.

Radiology is edited by Linda Moy, M.D., New York University, New York, N.Y., and owned and published by the Radiological Society of North America, Inc. (https://pubs.rsna.org/journal/radiology)

RSNA is an association of radiologists, radiation oncologists, medical physicists and related scientists promoting excellence in patient care and health care delivery through education, research and technologic innovation. The Society is based in Oak Brook, Illinois. (RSNA.org)

For patient-friendly information on how to read a radiology report, visit RadiologyInfo.org.

END



ELSE PRESS RELEASES FROM THIS DATE:

Climate change emerges as third major threat to global wildlife, scientists warn

2025-05-20
New research published in BioScience reveals that climate change is rapidly emerging as a third major threat to Earth's wild animals, joining habitat alteration and overexploitation in what scientists call a shift from "twin to triple threats." The research team, led by William J. Ripple of Oregon State University, analyzed data for 70,814 animal species from 35 classes, using two publicly available biodiversity datasets to assess climate change vulnerability among the world's wild animal populations. Their ...

New blood test developed at Mass General Brigham shows superior sensitivity in detecting HPV-associated head and neck cancers

2025-05-20
A new liquid biopsy blood test could help detect cases of human papillomavirus (HPV)-associated head and neck cancers with significantly higher accuracy than currently used methods, including before patients develop symptoms, according to new Mass General Brigham research. The researchers at Mass Eye and Ear, a member of the Mass General Brigham healthcare system, found that the blood-based diagnostic test they developed called HPV-DeepSeek achieved 99% sensitivity and 99% specificity for diagnosing cancer at the time of first clinical presentation, including ...

The hidden drivers of aging: microbial influence on genomic stability and telomere dynamics

2025-05-20
Aging is a multifaceted process driven by interconnected biological mechanisms, among which genomic instability and telomere attrition stand as primary hallmarks. Emerging research underscores the pivotal role of the human microbiome in modulating these processes, offering novel insights into aging and age-related diseases. This review synthesizes current evidence on how microbial dysbiosis accelerates aging by disrupting genomic integrity and telomere dynamics, while also exploring therapeutic strategies to promote healthy ...

Neurosymbolic AI could be leaner and smarter

2025-05-20
Could AI that thinks more like a human be more sustainable than today’s LLMs? The AI industry is dominated by large companies with deep pockets and a gargantuan appetite for energy to power their models’ mammoth computing needs. Data centers supporting AI already account for up to 3.7% of global greenhouse emissions. In a Perspective, Alvaro Velasquez and colleagues propose an alternative model: neurosymbolic AI, which would require far less computing power, creating opportunities for smaller players to enter the field and allowing society to enjoy the benefits of AI without the environmental costs. Neurosymbolic AI is built on data-driven neural ...

Intuition-guided reinforcement learning for soft tissue manipulation with unknown constraints

2025-05-20
A research paper by scientists at Hefei University of Technology presented an intuition-guided deep reinforcement learning framework for soft tissue manipulation under unknown constraints. The research paper, published on Apr. 14, 2025 in the journal Cyborg and Bionic Systems. Intraoperative soft tissue manipulation is a critical challenge in autonomous robotic surgery. Furthermore, the intricate in vivo environment surrounding the target soft tissues poses additional hindrances to autonomous robotic decision-making. Previous studies assumed the grasping point was known and the target deformation could be achieved. The constraints were assumed to be constant during the ...

Mount Sinai surgeons perform first heart-liver-kidney transplants in New York State

2025-05-20
A team of Mount Sinai surgeons has performed the first heart-liver-kidney triple organ transplants in New York. They successfully completed two of these complex surgeries on patients from Westchester County, who have since returned home and are making full recoveries. Heart-liver-kidney transplants are extremely rare—only 58 have been done across the country since the United Network for Organ Sharing, the government agency that oversees transplantation, started tracking cases in 1987. The two procedures at The Mount Sinai Hospital, which took place on January 10 and March 8, were among only four to date in the ...

‘Sharkitecture:’ A nanoscale look inside a blacktip shark’s skeleton

2025-05-20
Sharks have been evolving for more than 450 million years, developing skeletons not from bone, but from a tough, mineralized form of cartilage. These creatures are more than just fast swimmers – they’re built for efficiency. Their spines act like natural springs, storing and releasing energy with each tailbeat, allowing them to move through the water with smooth, powerful grace. Now, scientists are peering inside shark skeletons at the nanoscale, revealing a microscopic “sharkitecture” that helps these ancient apex predators withstand extreme physical demands of constant motion. Using synchrotron X-ray nanotomography with detailed ...

Public opinion on who should do content moderation

2025-05-20
Americans perceive small juries of content experts as the most legitimate moderators of potentially misleading content on social media, according to a survey, but perceive large, nationally representative or politically balanced juries with minimum knowledge qualifications as comparably legitimate. Social media content moderation policies tend to attract criticism, with some calling for more aggressive removal of harmful and misleading content and others decrying moderation as censorship and accusing expert moderators of being politically biased. Less clear is what the general public would like to see in terms of content ...

Accounting for marine ecosystems in China promises greater environmental and economic sustainability

2025-05-20
A Perspective proposes a pathway to improvements in sustainability of marine ecosystems and resources in China. Based on environmental accounting used in China’s terrestrial ecosystems, the approach would implement policy and governance to ensure accountability for sustainable use of marine systems. Laurence J. McCook and colleagues argue that the ecosystem goods and services provided to the nation by oceans and coastal ecosystems—including seagrass beds, salt marshes, coral reefs, and mangrove forests—are ...

Diabetes drug gives hope for new treatment for prostate cancer

2025-05-20
A drug used to treat type 2 diabetes may also be effective in slowing the progression of prostate cancer. This is shown by an international study in which researchers at Umeå University, Sweden, have participated. The researchers have found that drugs that regulate a particular protein have a key role in reducing prostate cancer recurrence among diabetic patients. "This is a significant discovery. For the first time, we have clinical observations showing that prostate cancer patients with diabetes who received drugs targeting the protein remained relapse-free during the period we followed them," ...

LAST 30 PRESS RELEASES:

Synergistic effects of single-crystal HfB2 nanorods: Simultaneous enhancement of mechanical properties and ablation resistance

Mysterious X-ray variability of the strongly magnetized neutron star NGC 7793 P13

The key to increasing patients’ advance care medical planning may be automatic patient outreach

Palaeontology: Ancient tooth suggests ocean predator could hunt in rivers

Polar bears may be adapting to survive warmer climates, says study

Canadian wildfire smoke worsened pediatric asthma in US Northeast: UVM study

New UBCO research challenges traditional teen suicide prevention models

Diversity language in US medical research agency grants declined 25% since 2024

Concern over growing use of AI chatbots to stave off loneliness

Biomedical authors often call a reference “recent” — even when it is decades old, analysis shows

The Lancet: New single dose oral treatment for gonorrhoea effectively combats drug-resistant infections, trial finds

Proton therapy shows survival benefit in Phase III trial for patients with head and neck cancers

Blood test reveals prognosis after cardiac arrest

UBCO study finds microdosing can temporarily improve mood, creativity

An ECOG-ACRIN imaging study solves a long-standing gap in metastatic breast cancer research and care: accurately measuring treatment response in patients with bone metastases

Cleveland Clinic presents final results of phase 1 clinical trial of preventive breast cancer vaccine study

Nationally renowned anesthesiology physician-scientist and clinical operations leader David Mintz, MD, PhD, named Chair of the Department of Anesthesiology at the UM School of Medicine

Clean water access improves child health in Mozambique, study shows

Study implicates enzyme in neurodegenerative conditions

Tufts professor named Fellow of the National Academy of Inventors

Tiny new device could enable giant future quantum computers

Tracing a path through photosynthesis to food security

First patient in Arizona treated with new immune-cell therapy at HonorHealth Research Institute

Studies investigate how AI can aid clinicians in analyzing medical images

Researchers pitch strategies to identify potential fraudulent participants in online qualitative research

Sweeping study shows similar genetic factors underlie multiple psychiatric disorders

How extreme weather events affect agricultural trade between US states

Smallholder farms maintain strong pollinator diversity – even when far from forests

Price of a bot army revealed across hundreds of online platforms worldwide – from TikTok to Amazon

Warblers borrow color-related genes from evolutionary neighbors, study finds

[Press-News.org] Fine-tuned LLMs boost error detection in radiology reports