PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Faster, smarter, more open: a new way to accelerate AI models

Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one

2025-07-16
(Press-News.org) Just as people from different countries speak different languages, AI models also create various internal “languages” – a unique set of tokens understood only by each model. Until recently, there was no way for models developed by different companies to communicate directly, collaborate or combine their strengths to improve performance. This week, at the International Conference on Machine Learning (ICML) in Vancouver, Canada, scientists from the Weizmann Institute of Science and Intel Labs are presenting a new set of algorithms that overcome this barrier, enabling users to benefit from combined computational power of AI models working together. The new algorithms, already available to millions of AI developers around the world, speed up the performance of large language models (LLMs) – today’s leading models of generative AI – by 1.5 times, on average.

LLMs, such as ChatGPT and Gemini, are powerful tools, but they come with significant drawbacks: They are slow and consume large amounts of computing power. In 2022, major tech companies realized that AI models, like people, could benefit from collaboration and division of labor. This led to the development of a method called speculative decoding, in which a small, fast model, possessing relatively limited knowledge, makes a first guess while answering a user’s query, and a larger, more powerful but slower model reviews and corrects the answer if needed. Speculative decoding was quickly adopted by tech giants because it maintains 100-percent accuracy – unlike most acceleration techniques, which reduce output quality. But it had one big limitation: Both models had to “speak” the exact same digital language, which meant that models developed by different companies could not be combined.

“Tech giants adopted speculative decoding, benefiting from faster performance and saving billions of dollars a year in cost of processing power, but they were the only ones to have access to small, faster models that speak the same language as larger models,” explains Nadav Timor, a PhD student in Prof. David Harel’s research team in Weizmann’s Computer Science and Applied Mathematics Department, who led the new development. “In contrast, a startup seeking to benefit from speculative decoding had to train its own small model that matched the language of the big one, and that takes a great deal of expertise and costly computational resources.”

The new algorithms developed by Weizmann and Intel researchers allow developers to pair any small model with any large model, causing them to work as a team. To overcome the language barrier, the researchers came up with two solutions.


First, they designed an algorithm that allows an LLM to translate its output from its internal token language into a shared format that all models can understand. Second, they created another algorithm that prompts such models to mainly rely in their collaborative work on tokens that have the same meaning across models, similarly to words like “banana” or “internet” that are nearly identical across human languages.

“At first, we worried that too much information would be ‘lost in translation’ and that different models wouldn’t be able to collaborate effectively,” says Timor. “But we were wrong. Our algorithms speed up the performance of LLMs by up to 2.8 times, leading to massive savings in spending on processing power.”

The significance of this research has been recognized by ICML organizers, who selected the study for public presentation – a distinction granted to only about 1 percent of the 15,000 submissions received this year. “We have solved a core inefficiency in generative AI,” says Oren Pereg, a senior researcher at Intel Labs and co-author of the study. “This isn’t just a theoretical improvement; these are practical tools that are already helping developers build faster and smarter applications.” 

In the past several months, the team released their algorithms on the open-source AI platform Hugging Face Transformers, making them freely available to developers around the world. The algorithms have since become part of standard tools for running efficient AI processes.

“This new development is especially important for edge devices, from phones and drones to autonomous cars, which must rely on limited computing power when not connected to the internet,” Timor adds. “Imagine, for example, a self-driving car that is guided by an AI model. In this case, a faster model can make the difference between a safe decision and a dangerous error.”

Also participating in the study were Dr. Jonathan Mamou, Daniel Korat, Moshe Berchansky and Moshe Wasserblat from Intel Labs and Gaurav Jain from d-Matrix.

 

Prof. David Harel is the incumbent of the William Sussman Professorial Chair of Mathematics.

END


ELSE PRESS RELEASES FROM THIS DATE:

What does it cost an animal to fight?

2025-07-16
How do animals decide when to fight and when to walk, fly, slither, or swim away? Most research on animal conflict has focused on the short-term costs of single interactions, but a pair of behavioral ecologists argue that these one-time events might paint an incomplete picture. In an opinion paper publishing July 16 in the Cell Press journal Trends in Ecology & Evolution, the researchers say that to really understand the consequences of animal conflict, we need to also consider its long-term and cumulative impact on an individual’s longevity and reproduction.  “By linking individual contests to lifetime reproductive success, we ...

Discovery could battle Alzheimer’s by boosting blood flow to brain

2025-07-16
New University of Virginia School of Medicine research suggests an unexpected way doctors may be able to improve blood flow to the brain to battle Alzheimer’s and other neurodegenerative diseases. Scientists led by Ukpong B. Eyo, PhD, of UVA’s Department of Neuroscience, found that immune cells called microglia play an essential role in determining how well tiny capillaries deliver blood and essential nourishment to our brains. The scientists believe problems with these microglia could be contributing to failing brain health, and targeting them could help us prevent or reverse memory-stealing diseases caused or worsened by lack of adequate blood flow. This could include Alzheimer’s, ...

New antibody selectively targets immune cells that suppress anti-tumor responses

2025-07-16
“Taken together, our studies suggest that 2B010 represents an anti-CD25 mAb with unique properties in that it deleted Treg from an inflammatory environment (GVHD) as well as from the TME.” BUFFALO, NY – July 16, 2025 – A new research paper was published in Volume 16 of Oncotarget on July 9, 2025, titled “A novel anti-human CD25 mAb with preferential reactivity to activated T regulatory cells depletes them from the tumor microenvironment.” In this study, researchers from the National Institute of Allergy and Infectious Diseases, led by first author Maja ...

OHSU scientists develop tool that improves tissue cancer analysis

2025-07-16
Researchers have developed a powerful new tool that makes it easier to study the mix of cell types in human tissue, which is crucial for understanding diseases such as cancer. Developed by researchers at Oregon Health & Science University’s Knight Cancer Institute, the tool, dubbed OmicsTweezer, uses advanced machine learning techniques to analyze biological data at a scale large enough to estimate the composition of cell types in a sample of tissue that may be taken from a biopsy. This process allows scientists to map the cellular makeup of tumors and surrounding tissues — an area ...

The 2025 World Cultural Council’s award winner is announced

2025-07-16
The 2025 World Cultural Council’s award winner is announced The winner of the 2025 “Albert Einstein” World Award of Science is Professor Mercouri G. Kanatzidis, Charles E. and Emma H. Morrison Professor in the Department of Chemistry and the Department of Materials Science and Engineering at Northwestern University, USA. He is also a Senior Scientist at Argonne National Laboratory. Professor Kanatzidis is recognized for his groundbreaking contributions as a pioneer in shaping the field solar photovoltaic materials through his seminal work on halide perovskite semiconductors. He has made fundamental contributions for creating materials enabling ...

Stephenson Global Scholar Grants Program awards $5.3 million to drive breakthroughs in pancreatic cancer research

2025-07-16
The significant philanthropic support comes at a time of uncertainty for federal research funding The grants will support new approaches to the deadliest cancer, from novel early detection methods, using AI to identify those with higher risk, and new immunotherapy treatments LOS ANGELES, July 16, 2025 — The Stephenson Global Pancreatic Cancer Research Institute and its partner City of Hope, one of the country’s largest and most advanced cancer research and treatment organizations, today announced the six inaugural recipients of the prestigious Stephenson Scholar Grants, awarding $5.25 million to support high-impact research aimed at transforming the understanding, ...

A statement from the Global Virus Network (GVN) on the rapidly escalating measles crisis in the U.S. and worldwide

2025-07-16
Tampa, FL, USA - The Global Virus Network (GVN), a coalition of leading human and animal virologists from 80+ Centers of Excellence and Affiliates in more than 40 countries, is sounding the alarm over a sharp resurgence of measles cases in the United States and globally. This resurgence, fueled by falling vaccination rates, threatens to erode decades of public health progress. Measles is one of the most contagious viruses known to humans and is entirely preventable through routine vaccination. The U.S. is now experiencing its highest ...

Restored wetlands reap benefits for climate, drought-resilience after just one year: study

2025-07-16
Reviving floodplain wetlands slashes carbon emissions by 39% and restores critical ecosystem functions in one year – without the methane spike typically seen in restored peatlands, a new study has found. Peatlands are known as top carbon sinks, but can produce up to 530% more methane after restoration, potentially offsetting short-term climate benefits. Whereas floodplain, or riparian wetlands, which comprise over half of global wetlands, are often overlooked due to their lower carbon storage. Now a new study in the Journal of Environmental ...

PPPL’s Jack Berkery receives Fulbright Specialist award to share research on spherical tokamaks

2025-07-16
In a field where collaboration is key to progress, Jack Berkery, a leader in U.S. fusion research, is heading to Japan as a Fulbright Specialist to help strengthen the ties that power the future of fusion energy. Berkery is the deputy director of the National Spherical Torus Experiment-Upgrade (NSTX-U) at the U.S. Department of Energy’s Princeton Plasma Physics Laboratory (PPPL). The Fulbright Specialist Program pairs specialists with select host institutions to build international partnerships.  Berkery’s two-week visit to Japan will include meetings with researchers at Kyushu University and participation ...

Survey shows GLP-1 weight-loss drugs are changing sex and dating for 50-60% of users

2025-07-16
GLP-1 weight-loss drugs are changing how people date and connect. In a nationally representative survey of 2,000 single U.S. adults (ages 18 to 91) led by the Kinsey Institute at Indiana University with DatingNews.com, GLP-1 users reported a wide range of physical, social, and psychological shifts they attributed to the drug. Among respondents, 8% reported having used a GLP-1 medication to assist with weight loss, with no significant difference difference in use between men and women. Among GLP-1 users, 59% reported at least one impact of the drug on their dating life including: 17% ...

LAST 30 PRESS RELEASES:

Root microbes could help oak trees adapt to drought

Emergency department–initiated buprenorphine for opioid use disorder

Call for action on understudied lung cancer in never-smokers

Different visual experiences give rise to different neural wiring

Wearable trackers can detect depression relapse weeks before it returns, study finds

Air pollution and the progression of physical function limitations and disability in aging adults

Historically Black college or university attendance and cognition in US Black adults

New “crucial” advance for quantum computers: researchers manage to read information stored in Majorana qubits

7,000 years of change: How humans reshaped Caribbean coral reef food chains

Virus-based therapy boosts anti-cancer immune responses to brain cancer

Ancient fish ear stones reveal modern Caribbean reefs have lost their dietary complexity

American College of Lifestyle Medicine announces updated dietary position statement for treatment and prevention of chronic disease

New findings highlight two decades of evidence supporting pecans in heart-healthy diets

Case report explores potential link between mRNA COVID-19 vaccines and cancer

Healthy versions of low-carb and low-fat diets linked to better cardiovascular and metabolic health

Low-carb and low-fat diets associated with lower heart disease risk if rich in high-quality, plant-based foods, low in animal products

ASH publishes clinical practice guidelines on frontline and relapsed/refractory management of all in adolescents and young adults

City of Hope research spotlight, January 2026

Keeping an eagle eye on carbon stored in the ocean

FAU study: Tiny worm offers clues to combat chemotherapy neurotoxicity

The ACMG Foundation 2026 Early Career Travel Award is presented to Bianca Seminotti, Ph.D.

Rural cancer patients do just as well when having surgery close to home

New biosensor technology could improve glucose monitoring

Successful press conference for Special Issue II of the JSE Himalayas Series

Hair extensions contain many more dangerous chemicals than previously thought

Elevated lead levels could flow from some US drinking water kiosks

Fragile X study uncovers brainwave biomarker bridging humans and mice

Robots that can see around corners using radio signals and AI

A non-invasive therapeutic strategy for improving bone healing in aged patients

Molecule found to drive skin cancer growth and evade immune detection

[Press-News.org] Faster, smarter, more open: a new way to accelerate AI models
Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one