(Press-News.org) The drug development pipeline is a costly and lengthy process. Identifying high-quality “hit” compounds—those with high potency, selectivity, and favorable metabolic properties—at the earliest stages is important for reducing cost and accelerating the path to clinical trials. For the last decade, scientists have looked to machine learning to make this initial screening process more efficient.
Computer-aided drug design is used to computationally screen for compounds that interact with a target protein. However, the ability to accurately and rapidly estimate the strength of these interactions remains a challenge.
“Machine learning promised to bridge the gap between the accuracy of gold-standard, physics-based computational methods and the speed of simpler empirical scoring functions,” said Dr. Benjamin P. Brown, an assistant professor of pharmacology at the Vanderbilt University School of Medicine Basic Sciences. “Unfortunately, its potential has so far been unrealized because current ML methods can unpredictably fail when they encounter chemical structures that they were not exposed to during their training, which limits their usefulness for real-world drug discovery.”
Brown is the single author on a recent Proceedings of the National Academy of Sciences paper that addresses this “generalizability gap.” In the paper, he proposes a targeted approach: Instead of learning from the entire 3D structure of a protein and a drug molecule, Brown proposes a task-specific model architecture that is intentionally restricted to learn only from a representation of their interaction space, which captures the distance-dependent physicochemical interactions between atom pairs.
“By constraining the model to this view, it is forced to learn the transferable principles of molecular binding rather than structural shortcuts present in the training data that fail to generalize to new molecules,” Brown said.
A key aspect of Brown’s work was the rigorous evaluation protocol he developed. “We set up our training and testing runs to simulate a real-world scenario: ‘If a novel protein family were discovered tomorrow, would our model be able to make effective predictions for it?’” he said. To do this, he left out entire protein superfamilies and all their associated chemical data from the training set, creating a challenging and realistic test of the model’s ability to generalize.
Brown’s work provides several key insights for the field:
Task-specific specialized architectures provide a clear avenue for building generalizable models using today’s publicly available datasets. By designing a model with a specific “inductive bias” that forces it to learn from a representation of molecular interactions rather than from raw chemical structures, it generalizes more effectively.
Rigorous, realistic benchmarks are critical. The paper’s validation protocol revealed that contemporary ML models performing well on standard benchmarks can show a significant drop in performance when faced with novel protein families. This highlights the need for more stringent evaluation practices in the field to accurately gauge real-world utility.
Current performance gains over conventional scoring functions are modest, but the work establishes a clear, reliable baseline for a modeling strategy that doesn't fail unpredictably, which is a critical step toward building trustworthy AI for drug discovery.
Brown, a core faculty member of the Center for AI in Protein Dynamics, knows that there is more work to be done. His current project focused exclusively on scoring—ranking compounds based on the strength of their interaction with the target protein—which is only part of the structure-based drug discovery equation. “My lab is fundamentally interested in modeling challenges related to scalability and generalizability in molecular simulation and computer-aided drug design. Hopefully soon we can share some additional work that aims to advance these principles,” Brown said.
For now, significant challenges remain, but Brown’s work on building a more dependable approach for machine learning in structure-based computer-aided drug design has clarified the path forward.
Go deeper
The paper “A Generalizable Deep Learning Framework for Structure-Based Protein-Ligand Affinity Ranking” was published in PNAS in October 2025.
Funding
This research used funds from the National Institute on Drug Abuse.
School of Medicine Basic Sciences shared resources
This research was supported by the Center for AI in Protein Dynamics and the Center for Structural Biology.
END
Vanderbilt scientist tackles key roadblock for AI in drug discovery
Vanderbilt’s Dr. Benjamin P. Brown is improving the way the field of drug discovery creates machine learning algorithms to predict a protein’s interactions with a small molecule. These improvements bring ML closer to fulfilling its potential in the fi
2025-10-16
ELSE PRESS RELEASES FROM THIS DATE:
Overheating bat boxes place bats in mortal danger during heatwaves
2025-10-16
Staying cool during heatwaves is challenging for small creatures, but the problem could be even more extreme for nocturnal creatures that are unable to move to cooler locations while slumbering. ‘Roosting bats may face lethally high body temperatures during extremely hot days’, says Ruvinda de Mel, from the University of New England, Australia. And bat boxes are often designed to retain heat to keep bats cozy, which could place the animals at even greater risk during heatwaves, depending on the box’s position ...
Study shows medical-legal partnerships aid recovery for patients with violent injuries
2025-10-16
Researchers at the University of Chicago have found that patients with violent injuries often face legal and financial needs that can have an impact on their recovery—and that providing legal help at the bedside can make a measurable difference.
The study, published in JAMA Network Open, evaluated the Recovery Legal Care program at the University of Chicago Medical Center, the nation’s first medical-legal partnership embedded in a trauma center.
The team of UChicago investigators, led by ...
Learning the language of lasso peptides to improve peptide engineering
2025-10-16
In the hunt for new therapeutics for cancer and infectious diseases, lasso peptides prove to be a catch. Their knot-like structures afford these molecules high stability and diverse biological activities, making them a promising avenue for new therapeutics. To better unleash their clinical potential, a team from the Carl R. Woese Institute for Genomic Biology developed LassoESM, a new large language model for predicting lasso peptide properties.
The collaborative study was recently published in Nature Communications.
Lasso peptides are natural products made by bacteria. To produce these peptides, bacteria use ribosomes to build chains of amino acids that are then folded by biosynthetic ...
Social conflict among strongest predictors of teen mental health concerns
2025-10-16
A new study from researchers at Washington University School of Medicine in St. Louis provides some answers. Published Sept. 15 in Nature Mental Health, it mined an enormous set of data collected from pre-teens and teens across the U.S. and found that social conflicts — particularly family fighting and reputational damage or bullying from peers — were the strongest predictors of near- and long-term mental health issues. The research also revealed sex differences in how boys and girls experience stress from ...
New framework can improve the planning stage of surgical quality improvement projects
2025-10-16
Key Takeaways
An evaluation of 50 surgical QI projects found that only one scored above 70% on criteria for a well-conducted effort, with major deficits in the critical early "front-end" planning stage.
The new EPoSSI framework provides a structured, nine-step guide and checklist to help clinicians systematically plan projects before launch.
In testing, using the full EPoSSI tool (diagram and guidance table) led to an increase in planning comprehensiveness, with participants meeting 100% of scoring criteria compared to just 24% without the framework.
CHICAGO ...
Research shows anger, not fear, shifts political beliefs
2025-10-16
Political attitudes and opinions can and do shift, sometimes drastically. Recent psychological research from Washington University in St. Louis offers insight into how emotional responses to threats contribute to shifts in political attitudes.
One striking example of how emotions drive political shifts is that people tend to become more supportive of conservative views during times of external, or foreign, threat.
Immediately after the 9/11 attacks, for example, national polls showed that support for President George W. Bush — a moderately conservative Republican — soared by 39 points to a record-breaking ...
Gale and Ira Drukier Prize in Children’s Health Research awarded to pediatric rheumatologist at Boston Children’s Hospital
2025-10-16
Dr. Lauren Henderson, a physician-scientist whose research focuses on children with difficult-to-treat juvenile idiopathic arthritis and other autoimmune disorders, has been awarded the 10th annual Gale and Ira Drukier Prize in Children’s Health Research, Weill Cornell Medicine announced today.
The Drukier Prize honors an early-career pediatrician whose research promises to make important contributions toward improving the health of children and adolescents. Dr. Henderson is an associate professor of pediatrics at Harvard Medical School and a pediatric rheumatologist ...
UNF chemistry professor awarded NSF Grant to advance laser-based measurement technology
2025-10-16
The University of North Florida has been awarded a National Science Foundation (NSF) grant to advance laser-based measurement technology to find more accurate and reliable chemical measurements across diverse scientific fields.
Dr. Willis Jones, assistant professor of chemistry and biochemistry, will lead the study that will pursue groundbreaking advances in laser-induced breakdown spectroscopy (LIBS), a powerful but often limited analytical technique.
LIBS uses a high-powered laser to create a small plasma that reveals the elemental compositions of solids, liquids and gases with minimal preparation. While powerful, the method is hindered by ...
Research shows how Dust Bowl-type drought causes unprecedented productivity loss
2025-10-16
EMBARGO: THIS CONTENT IS UNDER EMBARGO UNTIL 2 P.M. U.S. EASTERN STANDARD TIME ON OCT. 16, 2025. INTERESTED MEDIA MAY RECIVE A PREVIEW COPY OF THE JOURNAL ARTICLE IN ADVANCE OF THAT DATE OR CONDUCT INTERVIEWS, BUT THE INFORMATION MAY NOT BE PUBLISHED, BROADCAST, OR POSTED ONLINE UNTIL AFTER THE RELEASE WINDOW.
A global research effort led by Colorado State University shows that extreme, prolonged drought conditions in grasslands and shrublands would greatly limit the long-term health of crucial ecosystems that cover nearly half the planet. ...
Non-hibernating pikas' protein restriction tweaks their gut microbiome to help them survive the winter, when winter-active herbivores often struggle to find dietary protein
2025-10-16
Non-hibernating pikas' protein restriction tweaks their gut microbiome to help them survive the winter, when winter-active herbivores often struggle to find dietary protein
In your coverage, please use this URL to provide access to the freely available paper in PLOS Biology: https://plos.io/4nI13TV
Article title: Increased urea nitrogen salvaging by a remodeled gut microbiota helps nonhibernating pikas maintain protein homeostasis during winter
Author countries: China, Israel
Funding: see manuscript END ...
LAST 30 PRESS RELEASES:
AI analysis of world’s largest heart attack datasets opens way to new treatment strategies
Decoding dangers of Arctic sea ice with seismic, radar method
Counting bites with AI might one day help prevent childhood obesity
Utah chemists discover enzyme that could help build next-generation GLP-1 drugs
Surprising bacteria discovery links Hawaiʻi’s groundwater to the ocean
New grants for schools offer CPR training and resources to make campuses safer
30 NFL players urge fans to join Nation of Lifesavers, learn lifesaving CPR
Study finds humans outweigh climate in depleting Arizona's water supply
Old-school material could power quantum computing, cut data center energy use
Vanderbilt scientist tackles key roadblock for AI in drug discovery
Overheating bat boxes place bats in mortal danger during heatwaves
Study shows medical-legal partnerships aid recovery for patients with violent injuries
Learning the language of lasso peptides to improve peptide engineering
Social conflict among strongest predictors of teen mental health concerns
New framework can improve the planning stage of surgical quality improvement projects
Research shows anger, not fear, shifts political beliefs
Gale and Ira Drukier Prize in Children’s Health Research awarded to pediatric rheumatologist at Boston Children’s Hospital
UNF chemistry professor awarded NSF Grant to advance laser-based measurement technology
Research shows how Dust Bowl-type drought causes unprecedented productivity loss
Non-hibernating pikas' protein restriction tweaks their gut microbiome to help them survive the winter, when winter-active herbivores often struggle to find dietary protein
Not for hearing but for symbiosis
Disconnected cerebral hemisphere in epilepsy patients shows sleep-like state during wakefulness
Incentivizing risk to inspire investments in clean innovation for aviation
Stinkbug leg organ contains symbiotic fungi to shield eggs from parasitic wasps
Extreme, multi-year droughts drive cumulative collapse in terrestrial productivity
Researchers chart path for investors to build a cleaner aviation industry
USTC scientists uncover mystery of neurotransmission with time-resolved cryo-ET
New study finds large fluctuations in sea level occurred throughout the last ice age, a significant shift in understanding of past climate
Study reveals how bacteria in tumors drive treatment resistance in cancer
Language barriers in health care have fallen – but not online, study shows
[Press-News.org] Vanderbilt scientist tackles key roadblock for AI in drug discoveryVanderbilt’s Dr. Benjamin P. Brown is improving the way the field of drug discovery creates machine learning algorithms to predict a protein’s interactions with a small molecule. These improvements bring ML closer to fulfilling its potential in the fi