PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Fine-tuned LLMs boost error detection in radiology reports

2025-05-20
(Press-News.org) OAK BROOK, Ill. – A type of artificial intelligence called fine-tuned large language models (LLMs) greatly enhances error detection in radiology reports, according to a new study published today in Radiology, a journal of the Radiological Society of North America (RSNA). Researchers said the findings point to an important role for this technology in medical proofreading.

Radiology reports are crucial for optimal patient care. Their accuracy can be compromised by factors like errors in speech recognition software, variability in perceptual and interpretive processes and cognitive biases. These errors can lead to incorrect diagnoses or delayed treatments, making the need for accurate reports urgent.

LLMs like ChatGPT are advanced generative AI models that are trained on vast amounts of text to generate human language. While they offer great potential in proofreading, their application in the medical field, particularly in detecting errors within radiology reports, remains underexplored.

To bridge this gap in knowledge, researchers evaluated fine-tuned LLMs for detecting errors in radiology reports during medical proofreading. A fine-tuned LLM is a pre-trained language model that is further trained on domain-specific data.

“Initially, LLMs are trained on large-scale public data to learn general language patterns and knowledge,” said study senior author Yifan Peng, Ph.D., from the Department of Population Health Sciences at Weill Cornell Medicine in New York City. “Fine-tuning occurs as the next step, where the model undergoes additional training using smaller, targeted datasets relevant to particular tasks.”

To test the model, Dr. Peng and colleagues built a dataset with two parts. The first consisted of 1,656 synthetic reports, including 828 error-free reports and 828 reports with errors. The second part comprised 614 reports, including 307 error-free reports from MIMIC-CXR, a large, publicly available database of chest X-rays, and 307 synthetic reports with errors.

The researchers used the synthetic reports to boost the amount of training data and fulfill the data-hungry needs of LLM fine-tuning.

“Synthetic reports can also increase the coverage and diversity, balance out the cases and reduce the annotation costs,” said the study’s first author, Cong Sun, Ph.D., from Dr. Peng’s lab. “In radiology, or more broadly, the clinical domain, synthetic reports allow safe data-sharing without compromising patient privacy.”

The researchers found that the fine-tuned model outperformed both GPT-4 and BiomedBERT, a natural language processing tool for biomedical research.

“The LLM that was fine-tuned on both MIMIC-CXR and synthetic reports demonstrated strong performance in the error detection tasks,” Dr. Sun said. “It meets our expectations and highlights the potential for developing lightweight, fine-tuned LLM specifically for medical proofreading applications.”

The study provided evidence that LLMs can assist in detecting various types of errors, including transcription errors and left/right errors, which refer to misidentification or misinterpretation of directions or sides in text or images.

The use of synthetic data in AI model building has raised concerns of bias in the data. Dr. Peng and colleagues took steps to minimize this by using diverse and representative samples of real-world data to generate the synthetic data. However, they acknowledged that synthetic errors may not fully capture the complexity of real-world errors in radiology reports. Future work could include a systematic evaluation of how bias introduced by synthetic errors affects model performance.

The researchers hope to study fine-tuning’s ability to reduce radiologists’ cognitive load and enhance patient care and find out if fine-tuning would degrade the model’s ability to generate reasoning explanations.

“We are excited to keep exploring innovative strategies to enhance the reasoning capabilities of fine-tuned LLMs in medical proofreading tasks,” Dr. Peng said. “Our goal is to develop transparent and understandable models that radiologists can confidently trust and fully embrace.”

###

“Generative Large Language Models Trained for Detecting Errors in Radiology Reports.” Collaborating with Drs. Peng and Sun were Kurt Teichman, M.S., Yiliang Zhou, M.S., Brian Critelli, B.S., David Nauheim, M.D.,

Graham Keir, M.D., Xindi Wang, Ph.D., Judy Zhong, Ph.D., Adam E. Flanders, M.D., and George Shih, M.D.

Radiology is edited by Linda Moy, M.D., New York University, New York, N.Y., and owned and published by the Radiological Society of North America, Inc. (https://pubs.rsna.org/journal/radiology)

RSNA is an association of radiologists, radiation oncologists, medical physicists and related scientists promoting excellence in patient care and health care delivery through education, research and technologic innovation. The Society is based in Oak Brook, Illinois. (RSNA.org)

For patient-friendly information on how to read a radiology report, visit RadiologyInfo.org.

END



ELSE PRESS RELEASES FROM THIS DATE:

Climate change emerges as third major threat to global wildlife, scientists warn

2025-05-20
New research published in BioScience reveals that climate change is rapidly emerging as a third major threat to Earth's wild animals, joining habitat alteration and overexploitation in what scientists call a shift from "twin to triple threats." The research team, led by William J. Ripple of Oregon State University, analyzed data for 70,814 animal species from 35 classes, using two publicly available biodiversity datasets to assess climate change vulnerability among the world's wild animal populations. Their ...

New blood test developed at Mass General Brigham shows superior sensitivity in detecting HPV-associated head and neck cancers

2025-05-20
A new liquid biopsy blood test could help detect cases of human papillomavirus (HPV)-associated head and neck cancers with significantly higher accuracy than currently used methods, including before patients develop symptoms, according to new Mass General Brigham research. The researchers at Mass Eye and Ear, a member of the Mass General Brigham healthcare system, found that the blood-based diagnostic test they developed called HPV-DeepSeek achieved 99% sensitivity and 99% specificity for diagnosing cancer at the time of first clinical presentation, including ...

The hidden drivers of aging: microbial influence on genomic stability and telomere dynamics

2025-05-20
Aging is a multifaceted process driven by interconnected biological mechanisms, among which genomic instability and telomere attrition stand as primary hallmarks. Emerging research underscores the pivotal role of the human microbiome in modulating these processes, offering novel insights into aging and age-related diseases. This review synthesizes current evidence on how microbial dysbiosis accelerates aging by disrupting genomic integrity and telomere dynamics, while also exploring therapeutic strategies to promote healthy ...

Neurosymbolic AI could be leaner and smarter

2025-05-20
Could AI that thinks more like a human be more sustainable than today’s LLMs? The AI industry is dominated by large companies with deep pockets and a gargantuan appetite for energy to power their models’ mammoth computing needs. Data centers supporting AI already account for up to 3.7% of global greenhouse emissions. In a Perspective, Alvaro Velasquez and colleagues propose an alternative model: neurosymbolic AI, which would require far less computing power, creating opportunities for smaller players to enter the field and allowing society to enjoy the benefits of AI without the environmental costs. Neurosymbolic AI is built on data-driven neural ...

Intuition-guided reinforcement learning for soft tissue manipulation with unknown constraints

2025-05-20
A research paper by scientists at Hefei University of Technology presented an intuition-guided deep reinforcement learning framework for soft tissue manipulation under unknown constraints. The research paper, published on Apr. 14, 2025 in the journal Cyborg and Bionic Systems. Intraoperative soft tissue manipulation is a critical challenge in autonomous robotic surgery. Furthermore, the intricate in vivo environment surrounding the target soft tissues poses additional hindrances to autonomous robotic decision-making. Previous studies assumed the grasping point was known and the target deformation could be achieved. The constraints were assumed to be constant during the ...

Mount Sinai surgeons perform first heart-liver-kidney transplants in New York State

2025-05-20
A team of Mount Sinai surgeons has performed the first heart-liver-kidney triple organ transplants in New York. They successfully completed two of these complex surgeries on patients from Westchester County, who have since returned home and are making full recoveries. Heart-liver-kidney transplants are extremely rare—only 58 have been done across the country since the United Network for Organ Sharing, the government agency that oversees transplantation, started tracking cases in 1987. The two procedures at The Mount Sinai Hospital, which took place on January 10 and March 8, were among only four to date in the ...

‘Sharkitecture:’ A nanoscale look inside a blacktip shark’s skeleton

2025-05-20
Sharks have been evolving for more than 450 million years, developing skeletons not from bone, but from a tough, mineralized form of cartilage. These creatures are more than just fast swimmers – they’re built for efficiency. Their spines act like natural springs, storing and releasing energy with each tailbeat, allowing them to move through the water with smooth, powerful grace. Now, scientists are peering inside shark skeletons at the nanoscale, revealing a microscopic “sharkitecture” that helps these ancient apex predators withstand extreme physical demands of constant motion. Using synchrotron X-ray nanotomography with detailed ...

Public opinion on who should do content moderation

2025-05-20
Americans perceive small juries of content experts as the most legitimate moderators of potentially misleading content on social media, according to a survey, but perceive large, nationally representative or politically balanced juries with minimum knowledge qualifications as comparably legitimate. Social media content moderation policies tend to attract criticism, with some calling for more aggressive removal of harmful and misleading content and others decrying moderation as censorship and accusing expert moderators of being politically biased. Less clear is what the general public would like to see in terms of content ...

Accounting for marine ecosystems in China promises greater environmental and economic sustainability

2025-05-20
A Perspective proposes a pathway to improvements in sustainability of marine ecosystems and resources in China. Based on environmental accounting used in China’s terrestrial ecosystems, the approach would implement policy and governance to ensure accountability for sustainable use of marine systems. Laurence J. McCook and colleagues argue that the ecosystem goods and services provided to the nation by oceans and coastal ecosystems—including seagrass beds, salt marshes, coral reefs, and mangrove forests—are ...

Diabetes drug gives hope for new treatment for prostate cancer

2025-05-20
A drug used to treat type 2 diabetes may also be effective in slowing the progression of prostate cancer. This is shown by an international study in which researchers at Umeå University, Sweden, have participated. The researchers have found that drugs that regulate a particular protein have a key role in reducing prostate cancer recurrence among diabetic patients. "This is a significant discovery. For the first time, we have clinical observations showing that prostate cancer patients with diabetes who received drugs targeting the protein remained relapse-free during the period we followed them," ...

LAST 30 PRESS RELEASES:

Research reveals unexpected roles of TEAD proteins in neurodevelopment

UTA ATLAS team shares Breakthrough Prize in physics

New research on ALS opens up for early treatment

Molecules in blood and urine could reveal how much ultra-processed food you eat

Language isn’t just for communication — it also shapes how sensory experiences are stored in the brain

Reducing underwater noise when installing subsea structures #ASA188

How membranes may have brought about the chemistry of life on earth

NIH researchers develop biomarker score for predicting diets high in ultra-processed foods

AI and partnerships are vital to tackling food contamination - study

Fluridone widens Palmer pigweed control options for rice growers, but stick to the label

Christopher Kane appointed President of American Board of Urology

SwRI breaks pressure and temperature record for sCO2 materials testing

Native turtles return to Yosemite after removal of invasive bullfrogs

Maternal air pollution exposure worsens asthma severity for offspring

Post-intensive care syndrome linked to long-term deficits

ICU delirium tests misclassify Spanish-speakers

Terrence Sejnowski elected to the Royal Society and the American Philosophical Society

Commercially available peroxide binds incompatible polymers for recycling

Depression linked to physical pain years later

Beyond ‘one size fits all’: Study reveals ethnic differences in breast cancer development and outcomes, demanding tailored care approaches

New flammable gas research facility under construction at Southwest Research Institute

Planning grants awarded for competitive proposals testing efficacy of food is medicine

Substance use screening, brief intervention, and referral to treatment among youth-serving clinicians

LJI scientists uncover key clues to how a viral infection can lead to arthritis-like disease

Aging and DNA damage: investigating the microbiome’s stealthy impact – a perspective

Updated economic geography model incorporates heterogeneity in firm productivity and environmental pollution

Magnetic shaftless propeller millirobot with multimodal motion for small-scale fluidic manipulation

Green tea, turmeric, and berries may help reverse epigenetic aging in men

The Online Journal of Public Health Informatics invites submissions on opportunities and challenges in the applications of AI in public health informatics

Thousands of animal species threatened by climate change, novel analysis finds

[Press-News.org] Fine-tuned LLMs boost error detection in radiology reports