PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Faster and more reliable crystal structure prediction of organic molecules

Researchers develop a machine learning-based workflow for crystal structure prediction of organic molecules

2025-10-29
(Press-News.org)

Prediction of crystal structures of organic molecules is a critical task in many industries, especially in pharmaceuticals and design of functional materials. In pharmaceuticals, crystal structures directly influence a drug’s solubility and stability. In functional materials, like organic semiconductors, controlling crystal structures is crucial for achieving desired electronic properties. However, crystal structure prediction (CSP) is an inherently challenging task due to the weak and diverse intra- and inter-molecular interactions unique to organic crystals. Even minor variations can result in entirely different packing arrangements.

CSP is typically conducted in two stages: structure exploration and structure relaxation. In the first stage, a large number of potential structures are generated, often at random, for which various search algorithms have been developed. During structure relaxation, these structures are refined to identify the most stable configurations using energy minimization. However, random structure generation often produces several low-density and unstable structures, while conventional density functional theory (DFT)-based methods for structure relaxation are computationally expensive and time-consuming.

To address these challenges, Associate Professor Takuya Taniguchi from the Center for Data Science and Ryo Fukasawa from Graduate School of Advanced Science and Engineering at Waseda University, Japan, developed a breakthrough machine learning (ML)-based CSP workflow called SPaDe-CSP that leverages space group (SP) and packing density (PD) predictors. “Our workflow employs a unique strategy where machine learning models first predict the most probable space groups and crystal densities, filtering out unstable, low-density candidates before computationally intensive relaxation steps,” explains Taniguchi. “Together with an efficient neural network potential for structure relaxation, this method enables a more direct and reliable path to identifying experimentally observed crystal arrangements.” Their study was published in the journal Digital Discovery on
13 October 2025.

SPaDe-CSP narrows the search space for organic crystals, by first predicting probable space group candidates and crystal densities using ML models. For training and testing, the researchers extracted a dataset from the Cambridge Structural Database (CSD), consisting of 32 space group candidates with 169,656 data entries. Both prediction models used MACCSKeys as the molecular fingerprint and LightGBM as the prediction function. The researchers also interpreted the trained models using Shapley additive explanations (SHAP) analysis to identify the most important structural characteristics for effective predictions.

After lattice sampling, the generated unrelaxed structures are then subjected to structure relaxation using an efficient neural network potential (NNP) pretrained on DFT data, ultimately producing the energy density diagram of the target molecule. Two hyperparameters control the SPaDe-CSP process: the probability threshold for filtering space groups and the tolerance window for the crystal density.

The researchers tested the workflow first on a model molecule from the CSD dataset to investigate the dependence of success rate on the hyperparameters, and then on 20 different organic molecules, including the model molecule, to test generalizability. The results were successfully validated against the known experimental crystal structures of the molecules, and also compared against the results obtained from conventional random-CSP.

Results revealed that the probability of success increases with higher space group threshold and smaller density tolerance window. For 80% of the tested compounds, SPaDe-CSP successfully predicted the experimental crystal structures, achieving twice the success rate of random-CSP. Notably, the researchers also identified a key structural descriptor correlating linearly with success rate, indicating both crystal- and molecule-level structural influences.

“Our strategy can significantly accelerate the design and discovery pipeline for new molecules within the pharmaceutical and materials science industries,” says Taniguchi. “This will enable faster, more reliable identification of most stable, effective physical form of a new drug, important for maintaining solubility, shelf life, and overall efficacy, and allow computational screening of novel functional materials with optimal electronic properties.”

By making CSP faster and more reliable, this research marks an important step towards accelerating discovery of life-saving medication and next-generation technologies.

 

***

 

Reference
Authors: Takuya Taniguchi,*a Ryo Fukasawab
DOI:  10.1039/d5dd00304k
Affiliations: aCenter for Data Science, Waseda University, Japan
bGraduate School of Advanced Science and Engineering, Waseda University, Japan

 

About Waseda University
Located in the heart of Tokyo, Waseda University is a leading private research university that has long been dedicated to academic excellence, innovative research, and civic engagement at both the local and global levels since 1882. The University has produced many changemakers in its history, including eight prime ministers and many leaders in business, science and technology, literature, sports, and film. Waseda has strong collaborations with overseas research institutions and is committed to advancing cutting-edge research and developing leaders who can contribute to the resolution of complex, global social issues. The University has set a target of achieving a zero-carbon campus by 2032, in line with the Sustainable Development Goals (SDGs) adopted by the United Nations in 2015. 
To learn more about Waseda University, visit https://www.waseda.jp/top/en  


About Associate Professor Tanuya Taniguchi
Dr. Takuya Taniguchi is an Associate Professor at the Center for Data Science at Waseda University, Japan. He received a Doctor of Engineering degree from the Department of Advanced Science and Engineering, Graduate School of Advanced Science and Engineering, Waseda University, in 2019. His research areas of interest include structural organic chemistry, physical organic chemistry, organic functional materials, materials informatics, and materials science. His publications have received over 500 citations.

END



ELSE PRESS RELEASES FROM THIS DATE:

Thankful at work: A two-week gratitude journal boosts employee engagement

2025-10-29
Work engagement refers to a positive, fulfilling state of mind towards one’s work. It plays a key role in supporting both personal well-being and company success. However, ways to strengthen work engagement over the long term remain limited. A new study led by Professor Noriko Yamagishi from Ritsumeikan University, in collaboration with Dr. Norberto Eiji Nawa from the National Institute of Information and Communications Technology (NICT), and Mr. Shota Isomura from NTT Data Institute of Management Consulting, Inc., suggests that a simple practice, namely gratitude journaling, can make a meaningful difference. Published in BMC Psychology on October 6, 2025, the study invited 100 ...

Fibroblasts: Hidden drivers of heart failure progression

2025-10-29
Heart failure (HF) is one of the leading causes of death and disability worldwide, affecting millions of people and placing an enormous burden on healthcare systems. The disease occurs when the heart can no longer pump blood efficiently, leaving patients short of breath, fatigued, and at risk of life-threatening complications. For decades, scientists have focused on studying cardiomyocytes—the heart’s muscle cells responsible for pumping blood—believing that these were the key drivers of the disease. But new research challenges this long-standing view by showing that another, often-overlooked group of cells ...

IOCB Prague unveils a fundamentally faster, more affordable way to produce quantum nanodiamonds

2025-10-29
An international team of scientists from three continents led by Dr. Petr Cígler of IOCB Prague has developed a method for creating light-emitting quantum centers in nanodiamonds in only a matter of minutes. In just one week, the process can yield as much material as conventional methods would produce in more than forty years. Moreover, the resulting nanodiamonds show improved optical and quantum properties. The breakthrough brings us one step closer to the industrial production of higher-quality and more affordable quantum nanodiamonds, ...

Artificial intelligence takes the lead in revolutionizing cancer research explored at NFCR’s 2025 Global Summit and Award Ceremonies for Cancer Research and Entrepreneurship.

2025-10-29
The National Foundation for Cancer Research (NFCR) hosted its 2025 Global Summit and Award Ceremonies for Cancer Research & Entrepreneurship on October 24 at the National Press Club, in Washington, D.C., gathering many of the most forward-thinking minds in oncology, cancer research, technology, patient care, and biomedical innovation. This year’s summit centered on a defining theme: how artificial intelligence (AI) is reshaping the entire ecosystem of cancer research and patient care, from laboratory discovery to bedside decision-making. The Next Frontier: AI’s Expanding Role in Cancer Research The ...

Switching memories on and off with epigenetics

2025-10-29
Our experiences leave traces in the brain, stored in small groups of cells called “engrams”. Engrams are thought to hold the information of a memory and are reactivated when we remember, which makes them very interesting to research on memory and age- or trauma-related memory loss. At the same time, scientists know that the biology of learning is accompanied by epigenetic changes, which refers to the ways the cell regulates genes by adding chemical "post-it notes" on DNA. But the question of whether the epigenetic state of a single gene in turn can cause a memory ...

This is your brain without sleep

2025-10-29
CAMBRIDGE, MA -- Nearly everyone has experienced it: After a night of poor sleep, you don’t feel as alert as you should. Your brain might seem foggy, and your mind drifts off when you should be paying attention. A new study from MIT reveals what happens inside the brain as these momentary failures of attention occur. The scientists found that during these lapses, a wave of cerebrospinal fluid (CSF) flows out of the brain — a process that typically occurs during sleep and helps to wash away waste products that have built up during the ...

3D DNA looping discovery in rice paves the way for higher yields with less fertilizer

2025-10-29
A team of Chinese scientists has uncovered a hidden 3D structure in rice DNA that allows the crop to grow more grain while using less nitrogen fertilizer. The finding, published in Nature Genetics by researchers from the Chinese Academy of Sciences (CAS) on Oct. 29, could guide the next "green revolution" toward higher yields and more sustainable farming. The study reveals that a looping section of DNA—a "chromatin loop"—controls the activity of a gene called RCN2, which governs how rice plants form ...

Four subgroups of PCOS open up for individualized treatment

2025-10-29
Four distinct subgroups of polycystic ovary syndrome (PCOS) have been identified in an international study published in Nature Medicine by researchers from Karolinska Institutet, among others. The results open up for more tailored treatments for the millions of women living with the disease worldwide. PCOS is a common hormonal disorder that affects the function of the ovaries and affects approximately 11 to 13 percent of women of childbearing age. In the current study, the researchers analyzed clinical data from over 11,900 affected women over a period of 6.5 years. The results were confirmed in five international cohorts from Asia, Europe, and ...

Perovskites reveal ultrafast quantum light in new study

2025-10-29
Halide perovskites – already a focus of major research into efficient, low-cost solar cells – have been shown to handle light faster than most semiconductors on the market. The paper, published in Nature Nanotechnology, reports quantum transients on the scale of ~2 picoseconds at low temperature in bulk formamidinium lead iodide films grown by scalable solution or vapour methods. That ultrafast timescale indicates use in very fast light sources and other photonic components. Crucially, these effects appear in films made by scalable processing rather than specialised growth in lab-settings – suggesting a practical and affordable ...

New clues on how physical forces spread in neurons

2025-10-29
How do embryos develop? Why does the cortex of the mammalian brain fold? How do we feel touch at our fingertips? These and other fundamental biological questions remain unsolved. Yet, scientists know they all rely on a common principle: the conversion of a physical stimulus into a biochemical signal. The field of mechanobiology has recently gained new insights into which physical signals travel across cells and how far they spread. One key finding is that the rheological properties of the cell membrane (how it deforms and flows under stress) play a key role ...

LAST 30 PRESS RELEASES:

American Pediatric Society announces Bruce D. Gelb, MD, as recipient of its prestigious 2026 APS John Howland Award

Friendships can ease loneliness for dementia caregivers

Researchers pose five guiding questions to improve the use of artificial intelligence in physicians’ clinical decision-making

Global call to “Help the Kelp” with US $14 billion conservation target

Artificial tongue uses milk to determine heat level in spicy foods

IU Kelley Futurecast: AI and energy infrastructure may buoy US economy in 2026

The biggest threats to maintaining fat bike trails: climate change and volunteer burnout

AI models for drug design fail in physics

Practice pattern of aerosol drug therapy in acute respiratory distress syndrome patients: An aero-in-ICU study

GLIS model as a predictor of outcomes in older adults with heart failure

Molecules in motion: pioneering the era of supramolecular robotics

Faster and more reliable crystal structure prediction of organic molecules

Thankful at work: A two-week gratitude journal boosts employee engagement

Fibroblasts: Hidden drivers of heart failure progression

IOCB Prague unveils a fundamentally faster, more affordable way to produce quantum nanodiamonds

Artificial intelligence takes the lead in revolutionizing cancer research explored at NFCR’s 2025 Global Summit and Award Ceremonies for Cancer Research and Entrepreneurship.

Switching memories on and off with epigenetics

This is your brain without sleep

3D DNA looping discovery in rice paves the way for higher yields with less fertilizer

Four subgroups of PCOS open up for individualized treatment

Perovskites reveal ultrafast quantum light in new study

New clues on how physical forces spread in neurons

Heart ‘blueprint’ reveals origins of defects and insights into fetal development

Some acute and chronic viral infections may increase the risk of cardiovascular disease

Flavanols in cocoa can protect blood vessel function following uninterrupted sitting - study

$100 Million gift will advance UCSF’s dementia research and care

The 4th Japan-India Universities Forum on 15 November

Arctic town Kiruna is colder after the move

Mayo Clinic study finds majority of midlife women with menopause symptoms do not seek care

Underwater robot ‘Lassie’ discovers remarkable icefish nests during search for Shackleton’s lost ship off Antarctica

[Press-News.org] Faster and more reliable crystal structure prediction of organic molecules
Researchers develop a machine learning-based workflow for crystal structure prediction of organic molecules