PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Faster, smarter, more open: a new way to accelerate AI models

Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one

2025-07-16
(Press-News.org) Just as people from different countries speak different languages, AI models also create various internal “languages” – a unique set of tokens understood only by each model. Until recently, there was no way for models developed by different companies to communicate directly, collaborate or combine their strengths to improve performance. This week, at the International Conference on Machine Learning (ICML) in Vancouver, Canada, scientists from the Weizmann Institute of Science and Intel Labs are presenting a new set of algorithms that overcome this barrier, enabling users to benefit from combined computational power of AI models working together. The new algorithms, already available to millions of AI developers around the world, speed up the performance of large language models (LLMs) – today’s leading models of generative AI – by 1.5 times, on average.

LLMs, such as ChatGPT and Gemini, are powerful tools, but they come with significant drawbacks: They are slow and consume large amounts of computing power. In 2022, major tech companies realized that AI models, like people, could benefit from collaboration and division of labor. This led to the development of a method called speculative decoding, in which a small, fast model, possessing relatively limited knowledge, makes a first guess while answering a user’s query, and a larger, more powerful but slower model reviews and corrects the answer if needed. Speculative decoding was quickly adopted by tech giants because it maintains 100-percent accuracy – unlike most acceleration techniques, which reduce output quality. But it had one big limitation: Both models had to “speak” the exact same digital language, which meant that models developed by different companies could not be combined.

“Tech giants adopted speculative decoding, benefiting from faster performance and saving billions of dollars a year in cost of processing power, but they were the only ones to have access to small, faster models that speak the same language as larger models,” explains Nadav Timor, a PhD student in Prof. David Harel’s research team in Weizmann’s Computer Science and Applied Mathematics Department, who led the new development. “In contrast, a startup seeking to benefit from speculative decoding had to train its own small model that matched the language of the big one, and that takes a great deal of expertise and costly computational resources.”

The new algorithms developed by Weizmann and Intel researchers allow developers to pair any small model with any large model, causing them to work as a team. To overcome the language barrier, the researchers came up with two solutions.


First, they designed an algorithm that allows an LLM to translate its output from its internal token language into a shared format that all models can understand. Second, they created another algorithm that prompts such models to mainly rely in their collaborative work on tokens that have the same meaning across models, similarly to words like “banana” or “internet” that are nearly identical across human languages.

“At first, we worried that too much information would be ‘lost in translation’ and that different models wouldn’t be able to collaborate effectively,” says Timor. “But we were wrong. Our algorithms speed up the performance of LLMs by up to 2.8 times, leading to massive savings in spending on processing power.”

The significance of this research has been recognized by ICML organizers, who selected the study for public presentation – a distinction granted to only about 1 percent of the 15,000 submissions received this year. “We have solved a core inefficiency in generative AI,” says Oren Pereg, a senior researcher at Intel Labs and co-author of the study. “This isn’t just a theoretical improvement; these are practical tools that are already helping developers build faster and smarter applications.” 

In the past several months, the team released their algorithms on the open-source AI platform Hugging Face Transformers, making them freely available to developers around the world. The algorithms have since become part of standard tools for running efficient AI processes.

“This new development is especially important for edge devices, from phones and drones to autonomous cars, which must rely on limited computing power when not connected to the internet,” Timor adds. “Imagine, for example, a self-driving car that is guided by an AI model. In this case, a faster model can make the difference between a safe decision and a dangerous error.”

Also participating in the study were Dr. Jonathan Mamou, Daniel Korat, Moshe Berchansky and Moshe Wasserblat from Intel Labs and Gaurav Jain from d-Matrix.

 

Prof. David Harel is the incumbent of the William Sussman Professorial Chair of Mathematics.

END


ELSE PRESS RELEASES FROM THIS DATE:

What does it cost an animal to fight?

2025-07-16
How do animals decide when to fight and when to walk, fly, slither, or swim away? Most research on animal conflict has focused on the short-term costs of single interactions, but a pair of behavioral ecologists argue that these one-time events might paint an incomplete picture. In an opinion paper publishing July 16 in the Cell Press journal Trends in Ecology & Evolution, the researchers say that to really understand the consequences of animal conflict, we need to also consider its long-term and cumulative impact on an individual’s longevity and reproduction.  “By linking individual contests to lifetime reproductive success, we ...

Discovery could battle Alzheimer’s by boosting blood flow to brain

2025-07-16
New University of Virginia School of Medicine research suggests an unexpected way doctors may be able to improve blood flow to the brain to battle Alzheimer’s and other neurodegenerative diseases. Scientists led by Ukpong B. Eyo, PhD, of UVA’s Department of Neuroscience, found that immune cells called microglia play an essential role in determining how well tiny capillaries deliver blood and essential nourishment to our brains. The scientists believe problems with these microglia could be contributing to failing brain health, and targeting them could help us prevent or reverse memory-stealing diseases caused or worsened by lack of adequate blood flow. This could include Alzheimer’s, ...

New antibody selectively targets immune cells that suppress anti-tumor responses

2025-07-16
“Taken together, our studies suggest that 2B010 represents an anti-CD25 mAb with unique properties in that it deleted Treg from an inflammatory environment (GVHD) as well as from the TME.” BUFFALO, NY – July 16, 2025 – A new research paper was published in Volume 16 of Oncotarget on July 9, 2025, titled “A novel anti-human CD25 mAb with preferential reactivity to activated T regulatory cells depletes them from the tumor microenvironment.” In this study, researchers from the National Institute of Allergy and Infectious Diseases, led by first author Maja ...

OHSU scientists develop tool that improves tissue cancer analysis

2025-07-16
Researchers have developed a powerful new tool that makes it easier to study the mix of cell types in human tissue, which is crucial for understanding diseases such as cancer. Developed by researchers at Oregon Health & Science University’s Knight Cancer Institute, the tool, dubbed OmicsTweezer, uses advanced machine learning techniques to analyze biological data at a scale large enough to estimate the composition of cell types in a sample of tissue that may be taken from a biopsy. This process allows scientists to map the cellular makeup of tumors and surrounding tissues — an area ...

The 2025 World Cultural Council’s award winner is announced

2025-07-16
The 2025 World Cultural Council’s award winner is announced The winner of the 2025 “Albert Einstein” World Award of Science is Professor Mercouri G. Kanatzidis, Charles E. and Emma H. Morrison Professor in the Department of Chemistry and the Department of Materials Science and Engineering at Northwestern University, USA. He is also a Senior Scientist at Argonne National Laboratory. Professor Kanatzidis is recognized for his groundbreaking contributions as a pioneer in shaping the field solar photovoltaic materials through his seminal work on halide perovskite semiconductors. He has made fundamental contributions for creating materials enabling ...

Stephenson Global Scholar Grants Program awards $5.3 million to drive breakthroughs in pancreatic cancer research

2025-07-16
The significant philanthropic support comes at a time of uncertainty for federal research funding The grants will support new approaches to the deadliest cancer, from novel early detection methods, using AI to identify those with higher risk, and new immunotherapy treatments LOS ANGELES, July 16, 2025 — The Stephenson Global Pancreatic Cancer Research Institute and its partner City of Hope, one of the country’s largest and most advanced cancer research and treatment organizations, today announced the six inaugural recipients of the prestigious Stephenson Scholar Grants, awarding $5.25 million to support high-impact research aimed at transforming the understanding, ...

A statement from the Global Virus Network (GVN) on the rapidly escalating measles crisis in the U.S. and worldwide

2025-07-16
Tampa, FL, USA - The Global Virus Network (GVN), a coalition of leading human and animal virologists from 80+ Centers of Excellence and Affiliates in more than 40 countries, is sounding the alarm over a sharp resurgence of measles cases in the United States and globally. This resurgence, fueled by falling vaccination rates, threatens to erode decades of public health progress. Measles is one of the most contagious viruses known to humans and is entirely preventable through routine vaccination. The U.S. is now experiencing its highest ...

Restored wetlands reap benefits for climate, drought-resilience after just one year: study

2025-07-16
Reviving floodplain wetlands slashes carbon emissions by 39% and restores critical ecosystem functions in one year – without the methane spike typically seen in restored peatlands, a new study has found. Peatlands are known as top carbon sinks, but can produce up to 530% more methane after restoration, potentially offsetting short-term climate benefits. Whereas floodplain, or riparian wetlands, which comprise over half of global wetlands, are often overlooked due to their lower carbon storage. Now a new study in the Journal of Environmental ...

PPPL’s Jack Berkery receives Fulbright Specialist award to share research on spherical tokamaks

2025-07-16
In a field where collaboration is key to progress, Jack Berkery, a leader in U.S. fusion research, is heading to Japan as a Fulbright Specialist to help strengthen the ties that power the future of fusion energy. Berkery is the deputy director of the National Spherical Torus Experiment-Upgrade (NSTX-U) at the U.S. Department of Energy’s Princeton Plasma Physics Laboratory (PPPL). The Fulbright Specialist Program pairs specialists with select host institutions to build international partnerships.  Berkery’s two-week visit to Japan will include meetings with researchers at Kyushu University and participation ...

Survey shows GLP-1 weight-loss drugs are changing sex and dating for 50-60% of users

2025-07-16
GLP-1 weight-loss drugs are changing how people date and connect. In a nationally representative survey of 2,000 single U.S. adults (ages 18 to 91) led by the Kinsey Institute at Indiana University with DatingNews.com, GLP-1 users reported a wide range of physical, social, and psychological shifts they attributed to the drug. Among respondents, 8% reported having used a GLP-1 medication to assist with weight loss, with no significant difference difference in use between men and women. Among GLP-1 users, 59% reported at least one impact of the drug on their dating life including: 17% ...

LAST 30 PRESS RELEASES:

Older adults who increased their regular walking pace by just 14 steps per minute were more likely to experience clinically significant improvements in a test of aerobic capacity and walking endurance

For adults with hearing loss, linear amplification (amplification across all sound levels, available with some hearing aids) might restore their ability to recognize emotion in voices

Self-reporting climate anxiety in the United States is linked to being young, female, believing climate change will impact you personally, and more frequent media and community discussions around clim

A “silent epidemic” of stimulant use is shadowing the most recent opioid epidemic

Food insecurity causes anxiety and depression

New approach to kidney transplant matching could lead to better long-term outcomes

The patterns of elites who conceal their assets offshore

Elephant robot demonstrates bioinspired 3D printing technology

Walking slightly faster could help older adults stay fit

Private health industry lobby group uses marketing and publicity strategies similar to Big Tobacco and other unhealthy commodity industry groups

Government rollbacks of climate monitoring is a public health emergency

Robots that grow by consuming other robots

MD Anderson Research Highlights for July 16, 2025

Interbreeding with Neanderthals may be responsible for modern-day brain condition, SFU study finds

Tiny crystals provide insight to massive 2006 Augustine Volcano eruption

Six-month follow-up results announced from a first-of-its-kind robotic-assisted cerebral aneurysm embolization study

Why some elephants take more risks around people than others

Hope in sight for autosomal dominant optic atrophy (ADOA)

Snacking on avocado before bed may be linked to health impacts the next morning in adults with prediabetes

‘Fiery’ cell death during bladder cancer treatment may trigger chemo resistance by fueling cancer stem cells

How a tiny gene ensures the survival of male birds

New insights into ovarian cancer: why whole-genome doubling may hold the key to future HGSOC treatment strategies

Battery sharing could cut energy costs for communities

Expanded research tool to crack the code on Parkinson’s, the fastest-growing neurodegenerative disease

Can AI detect hidden heart disease?

Simple rules govern soil microbiome responses to environmental change

Researchers track the willingness of gun owners to temporarily store guns outside their homes

Living near St. Louis-area Coldwater Creek during childhood linked with higher risk of cancer from radiation

Prevalence of extremely severe obesity and metabolic dysfunction among US children and adolescents

Estimated burden of influenza and direct and indirect benefits of influenza vaccination

[Press-News.org] Faster, smarter, more open: a new way to accelerate AI models
Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one