PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Faster, smarter, more open: a new way to accelerate AI models

Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one

2025-07-16
(Press-News.org) Just as people from different countries speak different languages, AI models also create various internal “languages” – a unique set of tokens understood only by each model. Until recently, there was no way for models developed by different companies to communicate directly, collaborate or combine their strengths to improve performance. This week, at the International Conference on Machine Learning (ICML) in Vancouver, Canada, scientists from the Weizmann Institute of Science and Intel Labs are presenting a new set of algorithms that overcome this barrier, enabling users to benefit from combined computational power of AI models working together. The new algorithms, already available to millions of AI developers around the world, speed up the performance of large language models (LLMs) – today’s leading models of generative AI – by 1.5 times, on average.

LLMs, such as ChatGPT and Gemini, are powerful tools, but they come with significant drawbacks: They are slow and consume large amounts of computing power. In 2022, major tech companies realized that AI models, like people, could benefit from collaboration and division of labor. This led to the development of a method called speculative decoding, in which a small, fast model, possessing relatively limited knowledge, makes a first guess while answering a user’s query, and a larger, more powerful but slower model reviews and corrects the answer if needed. Speculative decoding was quickly adopted by tech giants because it maintains 100-percent accuracy – unlike most acceleration techniques, which reduce output quality. But it had one big limitation: Both models had to “speak” the exact same digital language, which meant that models developed by different companies could not be combined.

“Tech giants adopted speculative decoding, benefiting from faster performance and saving billions of dollars a year in cost of processing power, but they were the only ones to have access to small, faster models that speak the same language as larger models,” explains Nadav Timor, a PhD student in Prof. David Harel’s research team in Weizmann’s Computer Science and Applied Mathematics Department, who led the new development. “In contrast, a startup seeking to benefit from speculative decoding had to train its own small model that matched the language of the big one, and that takes a great deal of expertise and costly computational resources.”

The new algorithms developed by Weizmann and Intel researchers allow developers to pair any small model with any large model, causing them to work as a team. To overcome the language barrier, the researchers came up with two solutions.


First, they designed an algorithm that allows an LLM to translate its output from its internal token language into a shared format that all models can understand. Second, they created another algorithm that prompts such models to mainly rely in their collaborative work on tokens that have the same meaning across models, similarly to words like “banana” or “internet” that are nearly identical across human languages.

“At first, we worried that too much information would be ‘lost in translation’ and that different models wouldn’t be able to collaborate effectively,” says Timor. “But we were wrong. Our algorithms speed up the performance of LLMs by up to 2.8 times, leading to massive savings in spending on processing power.”

The significance of this research has been recognized by ICML organizers, who selected the study for public presentation – a distinction granted to only about 1 percent of the 15,000 submissions received this year. “We have solved a core inefficiency in generative AI,” says Oren Pereg, a senior researcher at Intel Labs and co-author of the study. “This isn’t just a theoretical improvement; these are practical tools that are already helping developers build faster and smarter applications.” 

In the past several months, the team released their algorithms on the open-source AI platform Hugging Face Transformers, making them freely available to developers around the world. The algorithms have since become part of standard tools for running efficient AI processes.

“This new development is especially important for edge devices, from phones and drones to autonomous cars, which must rely on limited computing power when not connected to the internet,” Timor adds. “Imagine, for example, a self-driving car that is guided by an AI model. In this case, a faster model can make the difference between a safe decision and a dangerous error.”

Also participating in the study were Dr. Jonathan Mamou, Daniel Korat, Moshe Berchansky and Moshe Wasserblat from Intel Labs and Gaurav Jain from d-Matrix.

 

Prof. David Harel is the incumbent of the William Sussman Professorial Chair of Mathematics.

END


ELSE PRESS RELEASES FROM THIS DATE:

What does it cost an animal to fight?

2025-07-16
How do animals decide when to fight and when to walk, fly, slither, or swim away? Most research on animal conflict has focused on the short-term costs of single interactions, but a pair of behavioral ecologists argue that these one-time events might paint an incomplete picture. In an opinion paper publishing July 16 in the Cell Press journal Trends in Ecology & Evolution, the researchers say that to really understand the consequences of animal conflict, we need to also consider its long-term and cumulative impact on an individual’s longevity and reproduction.  “By linking individual contests to lifetime reproductive success, we ...

Discovery could battle Alzheimer’s by boosting blood flow to brain

2025-07-16
New University of Virginia School of Medicine research suggests an unexpected way doctors may be able to improve blood flow to the brain to battle Alzheimer’s and other neurodegenerative diseases. Scientists led by Ukpong B. Eyo, PhD, of UVA’s Department of Neuroscience, found that immune cells called microglia play an essential role in determining how well tiny capillaries deliver blood and essential nourishment to our brains. The scientists believe problems with these microglia could be contributing to failing brain health, and targeting them could help us prevent or reverse memory-stealing diseases caused or worsened by lack of adequate blood flow. This could include Alzheimer’s, ...

New antibody selectively targets immune cells that suppress anti-tumor responses

2025-07-16
“Taken together, our studies suggest that 2B010 represents an anti-CD25 mAb with unique properties in that it deleted Treg from an inflammatory environment (GVHD) as well as from the TME.” BUFFALO, NY – July 16, 2025 – A new research paper was published in Volume 16 of Oncotarget on July 9, 2025, titled “A novel anti-human CD25 mAb with preferential reactivity to activated T regulatory cells depletes them from the tumor microenvironment.” In this study, researchers from the National Institute of Allergy and Infectious Diseases, led by first author Maja ...

OHSU scientists develop tool that improves tissue cancer analysis

2025-07-16
Researchers have developed a powerful new tool that makes it easier to study the mix of cell types in human tissue, which is crucial for understanding diseases such as cancer. Developed by researchers at Oregon Health & Science University’s Knight Cancer Institute, the tool, dubbed OmicsTweezer, uses advanced machine learning techniques to analyze biological data at a scale large enough to estimate the composition of cell types in a sample of tissue that may be taken from a biopsy. This process allows scientists to map the cellular makeup of tumors and surrounding tissues — an area ...

The 2025 World Cultural Council’s award winner is announced

2025-07-16
The 2025 World Cultural Council’s award winner is announced The winner of the 2025 “Albert Einstein” World Award of Science is Professor Mercouri G. Kanatzidis, Charles E. and Emma H. Morrison Professor in the Department of Chemistry and the Department of Materials Science and Engineering at Northwestern University, USA. He is also a Senior Scientist at Argonne National Laboratory. Professor Kanatzidis is recognized for his groundbreaking contributions as a pioneer in shaping the field solar photovoltaic materials through his seminal work on halide perovskite semiconductors. He has made fundamental contributions for creating materials enabling ...

Stephenson Global Scholar Grants Program awards $5.3 million to drive breakthroughs in pancreatic cancer research

2025-07-16
The significant philanthropic support comes at a time of uncertainty for federal research funding The grants will support new approaches to the deadliest cancer, from novel early detection methods, using AI to identify those with higher risk, and new immunotherapy treatments LOS ANGELES, July 16, 2025 — The Stephenson Global Pancreatic Cancer Research Institute and its partner City of Hope, one of the country’s largest and most advanced cancer research and treatment organizations, today announced the six inaugural recipients of the prestigious Stephenson Scholar Grants, awarding $5.25 million to support high-impact research aimed at transforming the understanding, ...

A statement from the Global Virus Network (GVN) on the rapidly escalating measles crisis in the U.S. and worldwide

2025-07-16
Tampa, FL, USA - The Global Virus Network (GVN), a coalition of leading human and animal virologists from 80+ Centers of Excellence and Affiliates in more than 40 countries, is sounding the alarm over a sharp resurgence of measles cases in the United States and globally. This resurgence, fueled by falling vaccination rates, threatens to erode decades of public health progress. Measles is one of the most contagious viruses known to humans and is entirely preventable through routine vaccination. The U.S. is now experiencing its highest ...

Restored wetlands reap benefits for climate, drought-resilience after just one year: study

2025-07-16
Reviving floodplain wetlands slashes carbon emissions by 39% and restores critical ecosystem functions in one year – without the methane spike typically seen in restored peatlands, a new study has found. Peatlands are known as top carbon sinks, but can produce up to 530% more methane after restoration, potentially offsetting short-term climate benefits. Whereas floodplain, or riparian wetlands, which comprise over half of global wetlands, are often overlooked due to their lower carbon storage. Now a new study in the Journal of Environmental ...

PPPL’s Jack Berkery receives Fulbright Specialist award to share research on spherical tokamaks

2025-07-16
In a field where collaboration is key to progress, Jack Berkery, a leader in U.S. fusion research, is heading to Japan as a Fulbright Specialist to help strengthen the ties that power the future of fusion energy. Berkery is the deputy director of the National Spherical Torus Experiment-Upgrade (NSTX-U) at the U.S. Department of Energy’s Princeton Plasma Physics Laboratory (PPPL). The Fulbright Specialist Program pairs specialists with select host institutions to build international partnerships.  Berkery’s two-week visit to Japan will include meetings with researchers at Kyushu University and participation ...

Survey shows GLP-1 weight-loss drugs are changing sex and dating for 50-60% of users

2025-07-16
GLP-1 weight-loss drugs are changing how people date and connect. In a nationally representative survey of 2,000 single U.S. adults (ages 18 to 91) led by the Kinsey Institute at Indiana University with DatingNews.com, GLP-1 users reported a wide range of physical, social, and psychological shifts they attributed to the drug. Among respondents, 8% reported having used a GLP-1 medication to assist with weight loss, with no significant difference difference in use between men and women. Among GLP-1 users, 59% reported at least one impact of the drug on their dating life including: 17% ...

LAST 30 PRESS RELEASES:

First-in-human trial shows promising results for DLL3-targeted antibody-drug conjugate SHR-4849 in relapsed small cell lung cancer

Ifinatamab deruxtecan demonstrates high response rate in previously treated extensive-stage small cell lung cancer: Phase 2 IDeate-Lung01 trial

Higher blood pressure in childhood linked to earlier death from heart disease in adulthood

AI helped older adults report accurate blood pressure readings at home

High blood pressure in childhood and premature cardiovascular disease mortality

Zidesamtinib shows durable responses in ROS1 TKI pre-treated NSCLC, including patients with CNS disease and ROS1 G2032R mutations

Crizotinib fails to improve disease-free survival in resected early-stage ALK+ NSCLC

Ivonescimab plus chemotherapy improves progression-free survival in patients with EGFR+ NSCLC following 3rd-generation EGFR-TKI therapy

FLAURA2 trial shows osimertinib plus chemotherapy improves overall survival in eGFR-mutated advanced NSCLC

Aumolertinib plus chemotherapy improves progression-free survival in NSCLC with EGFR and concomitant tumor suppressor genes: ACROSS 2 phase III study

New antibody-drug conjugate shows promising efficacy in EGFR-mutated NSCLC patients

Iza-Bren in combination with osimertinib shows 100% response rate in EGFR-mutated NSCLC, phase II study finds

COMPEL study shows continuing osimertinib treatment through progression with the addition of chemotherapy improves progression-free survival in EGFR-mutated NSCLC

CheckMate 77T: Nivolumab maintains quality of life and reduces symptom deterioration in resectable NSCLC

Study validates AI lung cancer risk model Sybil in predominantly Black population at urban safety-net hospital

New medication lowered hard-to-control high blood pressure in people with chronic kidney disease

Innovative oncolytic virus and immunotherapy combinations pave the way for advanced cancer treatment

New insights into energy metabolism and immune dynamics could transform head and neck cancer treatment

Pennington Biomedical’s Dr. Steven Heymsfield named LSU Boyd Professor – LSU’s highest faculty honor

Study prompts new theory of human-machine communication

New method calculates rate of gene expression to understand cell fate

Researchers quantify rate of essential evolutionary process in the ocean

Innovation Crossroads companies join forces, awarded U.S. Air Force contract

Using new blood biomarkers, USC researchers find Alzheimer’s disease trial eligibility differs among various populations

Pioneering advances in in vivo CAR T cell production

Natural medicines target tumor vascular microenvironment to inhibit cancer growth

Coral-inspired pill offers a new window into the hidden world of the gut

nTIDE September2025 Jobs Report: Employment for people with disabilities surpasses prior high

When getting a job makes you go hungry

Good vibrations could revolutionize assisted reproductive technology

[Press-News.org] Faster, smarter, more open: a new way to accelerate AI models
Algorithms developed by Weizmann Institute and Intel Labs researchers enable AI developers around the world to combine the power of different AI models “thinking” as one