(Press-News.org) A team of engineers at the University of California San Diego is making it easier for researchers from a broad range of backgrounds to understand how different species are evolutionarily related, and support the transformative biological and medical applications that rely on these species trees. The researchers developed a scalable, automated and user-friendly tool called ROADIES that allows scientists to infer species trees directly from raw genome data, with less reliance on the domain expertise and computational resources currently required.
Species trees are critical to solidifying our understanding of how species evolved on a broad scale, but can also help find functional regions of the genome that could serve as drug targets; link physical traits to genomic changes; predict and respond to zoonotic outbreaks; and even guide conservation efforts.
In a new paper published in the journal Proceedings of the National Academy of Sciences on May 2, the researchers, led by UC San Diego electrical and computer engineering professor Yatish Turakhia, showed that ROADIES infers species trees that are comparable in quality with the state-of-the-art studies, but in a fraction of the time and effort. This paper focused on four diverse life forms — placental mammals, pomace flies, birds and budding yeasts — though ROADIES can be used for any species.
“Rapid advances in high-throughput sequencing and computational tools have enabled genome assemblies to be produced at scale,” said Anshu Gupta, a computer science PhD student at the Jacobs School of Engineering and the study’s first author. “However, accurately inferring species trees is still beyond the reach of many researchers.”
“ROADIES is a timely and transformative solution to this problem,” added Turakhia. “With its speed, accuracy, and automation, ROADIES has the potential to vastly simplify species tree inference, making it accessible to a broader range of scientists and applications.”
A truly automated process
ROADIES— which stands for “Reference-free, Orthology-free, Annotation-free, Discordance-aware Estimation of Species Trees”-- stands apart from existing phylogenetic tools because it uses a completely automated pipeline yet produces highly accurate results.
One of ROADIES’ key innovations is that instead of using predefined genomic regions with specific characteristics, such as protein-coding genes, ROADIES is based on a random sampling of loci from input genomes. This eliminates the need for genome annotation prior to species tree inference.
"It may seem surprising that reconstructing species trees from randomly selected loci can yield highly accurate results,” said Turakhia. “But our results show that this simple approach is effective, and we believe it can even offer unique benefits, including better adherence to models of sequence evolution.”
Another strategy that proved key to automation is that ROADIES, unlike many existing methods, is able to take advantage of genes that are present in multiple copies across the genome, a phenomenon that is prevalent for many species. ROADIES does this by integrating methods developed at UC San Diego in the lab of Siavash Mirarab, a professor of electrical and computer engineering and co-author of this PNAS study. This strategy allows ROADIES to eliminate the need to infer orthology, or determine the correspondence of individual gene copies in different species.
By removing the need for two cumbersome steps (genome annotation and orthology inference), ROADIES not only overcomes a major barrier to building reliable, fully automated pipelines, but it also requires significantly less computing power than existing tools. The study highlights ROADIES’ scalability to datasets with hundreds of genomes, inferring phylogenies that are concordant with expert-led, large-scale studies, yet requiring a fraction of the time and effort.
The researchers are continuing to improve the capability of ROADIES, including the placement of new taxa on existing species trees and the potential use of GPUs to allow for the processing of tens of thousands of genomes and beyond.
“Large-scale initiatives are already underway to sequence thousands of species—and eventually, potentially every extant eukaryotic species on Earth,” said Turakhia. “We want to ensure ROADIES is ready to meet that scale.”
Full study: Accurate, scalable, and fully automated inference of species trees from raw genome assemblies using ROADIES
This work is supported by an Amazon Research Award (Fall 2022 Call for Proposals), NIH grant 1R35GM142725, and funding from the Hellman Fellowship.
END
A fully automated tool for species tree inference
2025-05-05
ELSE PRESS RELEASES FROM THIS DATE:
Text-to-video AI blossoms with new metamorphic video capabilities
2025-05-05
While text-to-video artificial intelligence models like OpenAI’s Sora are rapidly metamorphosing in front of our eyes, they have struggled to produce metamorphic videos. Simulating a tree sprouting or a flower blooming is harder for AI systems than generating other types of videos because it requires the knowledge of the physical world and can vary widely.
But now, these models have taken an evolutionary step.
Computer scientists at the University of Rochester, Peking University, University of California, Santa Cruz, and National University of Singapore ...
Using age, sex, and race-specific standards could reclassify many thyroid disease diagnoses
2025-05-05
Embargoed for release until 5:00 p.m. ET on Monday 5 May 2025
Follow @Annalsofim on X, Facebook, Instagram, threads, and Linkedin
Below please find summaries of new articles that will be published in the next issue of Annals of Internal Medicine. The summaries are not intended to substitute for the full articles as a source of information. This information is under strict embargo and by taking it into possession, media representatives are committing to the terms of the embargo not only on their own behalf, but also on ...
A Big Data approach for battery electrolytes
2025-05-05
Discovering new, powerful electrolytes is one of the major bottlenecks for designing next-generation batteries for electric vehicles, phones, laptops and grid-scale energy storage.
The most stable electrolytes are not always the most conductive. The most efficient batteries are not always the most stable. And so on.
“The electrodes have to satisfy very different properties at the same time. They always conflict with each other,” said Ritesh Kumar, an Eric and Wendy Schimdt AI in Science Postdoctoral Fellow working in the Amanchukwu Lab at the University of Chicago Pritzker School of ...
Moffitt study finds structural barriers may prevent cancer care for people living with HIV
2025-05-05
TAMPA, Fla. (May 5, 2025) — People living with HIV are less likely to receive potentially lifesaving cancer treatment if they live in communities with lower income levels and educational attainment, according to a new national study led by researchers from Moffitt Cancer Center.
In the study, published in Cancer, researchers looked at cancer treatment records for more than 31,000 adults with HIV who were diagnosed with one of 14 common cancers between 2004 and 2020. They found that 16.5% of them did not receive the recommended first line curative treatment for ...
Min proteins for max efficiency during cell division
2025-05-05
The Min protein system prevents abnormal cell division in bacteria by forming oscillating patterns between the ends of a cell (“poles”). Despite decades of theoretical work, predicting the protein concentrations at which oscillations start and whether cells can maintain them under different conditions has been a challenge. Understanding these thresholds is important because they reveal how efficient this self-organizing system is in guiding division to the right place.
UC San Diego researchers have engineered ...
How tiny particles coordinate energy transfer inside cells uncovered
2025-05-05
Protons are the basis of bioenergetics. We know them, in our everyday life, from the pH values we see on various soaps and lotions. But the ability to move them through biological systems is essential for life. A new study shows for the first time that proton transfer is directly influenced by the spin of electrons, when measured in chiral biological environments such as proteins. In other words, proton movement in living systems is not purely chemical; it is also a quantum process involving electron spin and molecular chirality. The quantum process directly affects ...
Gorilla study reveals complex pros and cons of friendship
2025-05-05
Friendship comes with complex pros and cons – possibly explaining why some individuals are less sociable, according to a new study of gorillas.
Scientists examined over 20 years of data on 164 wild mountain gorillas, to see how their social lives affected their health.
Costs and benefits changed depending on the size of gorilla groups, and differed for males and females.
For example, friendly females in small groups didn’t get ill very often but had fewer offspring – while those in large groups got ill more but had higher birth rates.
Meanwhile, males with strong social bonds tended to get ill more – ...
Ancient Andes society used hallucinogens to strengthen social order
2025-05-05
Two thousand years before the Inca empire dominated the Andes, a lesser-known society known as the Chavín Phenomenon shared common art, architecture, and materials throughout modern-day Peru. Through agricultural innovations, craft production, and trade, Chavín shaped a growing social order and laid the foundations for hierarchical society among the high peaks.
But one of their most powerful tools wasn’t farming. It was access to altered states of consciousness.
That’s according to a new study that uncovered the earliest-known direct evidence of the use of psychoactive plants in the Peruvian Andes. A team ...
Biological ‘clocks’ key to muscle health and accelerated ageing in shift workers
2025-05-05
Muscle cells contain their own circadian clocks and disrupting them with shift work can have a profound impact on ageing, according to new research.
A study published today in Proceedings of the National Academy of Sciences (PNAS) contributes to the growing evidence of the damage shift work has on health.
The King’s College London team revealed how muscle cells have an intrinsic timekeeping mechanism that regulates protein turnover, modulating muscle growth and function. At night, the muscle clock activates the breakdown of defective proteins, replenishing muscles while the body rests.
Altering this intrinsic ...
Physical cloaking works like a disappearing act for structural defects
2025-05-05
Whether designing a window in an airliner or a cable conduit for an engine, manufacturers devote a lot of effort to reinforcing openings for structural integrity. But the reinforcement is rarely perfect and often creates structural weaknesses elsewhere.
Now, engineers at Princeton and Georgia Institute of Technology have developed a technique that can maintain structural integrity by essentially hiding the opening from the surrounding forces. Rather than reinforcing the opening to protect against a few select forces, the new approach reorganizes nearly any set of forces that could affect the surrounding material to avoid the opening.
In a May 5 article in the ...