Teaching large language models how to absorb new knowledge

With a new method developed at MIT, an LLM behaves more like a student, writing notes that it studies to memorize new information.

2025-11-18

(Press-News.org) CAMBRIDGE, MA -- In an MIT classroom, a professor lectures while students diligently write down notes they will reread later to study and internalize key information ahead of an exam.

Humans know how to learn new information, but large language models can’t do this in the same way. Once a fully trained LLM has been deployed, its “brain” is static and can’t permanently adapt itself to new knowledge.

This means that if a user tells an LLM something important today, it won’t remember that information the next time this person starts a new conversation with the chatbot.

Now, a new approach developed by MIT researchers enables LLMs to update themselves in a way that permanently internalizes new information. Just like a student, the LLM generates its own study sheets from a user’s input, which it uses to memorize the information by updating its inner workings.

The model generates multiple self-edits to learn from one input, then applies each one to see which improves its performance the most. This trial-and-error process teaches the model the best way to train itself.

The researchers found this approach improved the accuracy of LLMs at question-answering and pattern-recognition tasks, and it enabled a small model to outperform much larger LLMs.

While there are still limitations that must be overcome, the technique could someday help artificial intelligence agents consistently adapt to new tasks and achieve changing goals in evolving environments.

“Just like humans, complex AI systems can’t remain static for their entire lifetimes. These LLMs are not deployed in static environments. They are constantly facing new inputs from users. We want to make a model that is a bit more human-like — one that can keep improving itself,” says Jyothish Pari, an MIT graduate student and co-lead author of a paper on this technique.

He is joined on the paper by co-lead author Adam Zweiger, an MIT undergraduate; graduate students Han Guo and Ekin Akyürek; and senior authors Yoon Kim, an assistant professor in the Department of Electrical Engineering and Computer Science (EECS) and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL), and Pulkit Agrawal, an assistant professor in EECS and member of CSAIL. The research will be presented at the Conference on Neural Information Processing Systems.

Teaching the model to learn

LLMs are neural network models that have billions of parameters, called weights, that contain the model’s knowledge and process inputs to make predictions. During training, the model adapts these weights to learn new information contained in its training data.

But once it is deployed, the weights are static and can’t be permanently updated anymore.

However, LLMs are very good at a process called in-context learning, in which a trained model learns a new task by seeing a few examples. These examples guide the model’s responses, but the knowledge disappears before the next conversation.

The MIT researchers wanted to leverage a model’s powerful in-context learning capabilities to teach it how to permanently update its weights when it encounters new knowledge.

The framework they developed, called SEAL for “self-adapting LLMs,” enables an LLM to generate new synthetic data based on an input, and then determine the best way to adapt itself and learn from that synthetic data. Each piece of synthetic data is a self-edit the model can apply.

In the case of language, the LLM creates synthetic data by rewriting the information, and its implications, in an input passage. This is similar to how students make study sheets by rewriting and summarizing original lecture content.

The LLM does this multiple times, then quizzes itself on each self-edit to see which led to the biggest boost in performance on a downstream task like question answering. It uses a trial-and-error method known as reinforcement learning, where it receives a reward for the greatest performance boost.

Then the model memorizes the best study sheet by updating its weights to internalize the information in that self-edit.

“Our hope is that the model will learn to make the best kind of study sheet — one that is the right length and has the proper diversity of information — such that updating the model based on it leads to a better model,” Zweiger explains.

Choosing the best method

Their framework also allows the model to choose the way it wants to learn the information. For instance, the model can select the synthetic data it wants to use, the rate at which it learns, and how many iterations it wants to train on.

In this case, not only does the model generate its own training data, but it also configures the optimization that applies that self-edit to its weights.

“As humans, we know how we learn best. We want to grant that same ability to large language models. By providing the model with the ability to control how it digests this information, it can figure out the best way to parse all the data that are coming in,” Pari says.

SEAL outperformed several baseline methods across a range of tasks, including learning a new skill from a few examples and incorporating knowledge from a text passage. On question answering, SEAL improved model accuracy by nearly 15 percent and on some skill-learning tasks, it boosted the success rate by more than 50 percent.

But one limitation of this approach is a problem called catastrophic forgetting: As the model repeatedly adapts to new information, its performance on earlier tasks slowly declines.

The researchers plan to mitigate catastrophic forgetting in future work. They also want to apply this technique in a multi-agent setting where several LLMs train each other.

“One of the key barriers to LLMs that can do meaningful scientific research is their inability to update themselves based on their interactions with new information. Though fully deployed self-adapting models are still far off, we hope systems able to learn this way could eventually overcome this and help advance science,” Zweiger says.

###

This work is supported, in part, by the U.S. Army Research Office, the U.S. Air Force AI Accelerator, the Stevens Fund for MIT UROP, and the MIT-IBM Watson AI Lab.

END

ELSE PRESS RELEASES FROM THIS DATE:

Milestone on the road to the ‘quantum internet’

2025-11-18

Everyday life on the internet is insecure. Hackers can break into bank accounts or steal digital identities. Driven by AI, attacks are becoming increasingly sophisticated. Quantum cryptography promises more effective protection. It makes communication secure against eavesdropping by relying on the laws of quantum physics. However, the path toward a quantum internet is still fraught with technical hurdles. Researchers at the Institute of Semiconductor Optics and Functional Interfaces (IHFG) at the University of Stuttgart have now made a decisive breakthrough in one of the most technically challenging components, the ‘quantum repeater’. They report their results in Nature Communications ...

Blink to the beat

2025-11-18

Yi Du and colleagues from the Chinese Academy of Sciences published an article in the open access journal PLOS Biology on November 18th detailing their findings about a new way our bodies naturally respond to music. Given a steady beat, our eyes blink in synchrony. The neurological process that helps us move with the music is known as auditory-motor synchronization. This describes the way you tap your foot along with the radio or bob your head at a concert, or why some runners listen to songs with a specific number of beats per minute ...

Even low-intensity smoking increases risk of heart attack and death

2025-11-18

An analysis of data from almost two dozen long-term studies finds that even low-intensity smokers have a substantially higher risk of heart disease and death compared to people who never smoked, even years after they quit. Michael Blaha of the Johns Hopkins Ciccarone Center for Prevention of Cardiovascular Disease, USA, and colleagues report these findings November 18th in the open-access journal PLOS Medicine. Previous research has shown that smoking cigarettes increases a person’s risk of developing cardiovascular disease, but the exact relationship between how heavily a ...

Research on intelligent analysis method for dynamic response of onshore wind turbines

2025-11-18

Researchers have developed a high-fidelity 13-degree-of-freedom nonlinear model and an intelligent algorithm for wind turbine dynamic analysis. This framework accurately captures complex tower-blade interactions, including often-neglected torsional effects, achieving a remarkable agreement with high-fidelity benchmarks. Published in Smart Construction, this work provides a powerful and efficient tool for structural assessment and future optimization of large-scale wind energy systems. The global push for sustainable energy has cemented wind power's role in the renewable transition. However, designing safe and cost-effective ...

Type 1 diabetes cured in mice with gentle blood stem-cell and pancreatic islet transplant

2025-11-18

A combination blood stem cell and pancreatic islet cell transplant from an immunologically mismatched donor completely prevented or cured Type 1 diabetes in mice in a study by Stanford Medicine researchers. Type 1 diabetes arises when the immune system mistakenly destroys insulin-producing islet cells in the pancreas. None of the animals developed graft-versus-host disease — in which the immune system arising from the donated blood stem cells attacks healthy tissue in the recipient — and the destruction of islet cells by the native host immune system was halted. After the transplants, the animals did not require the use of the immune suppressive drugs ...

Serida sequences the first complete genome of the Faba Granja Asturiana, a key advance for its genetic improvement and conservation

2025-11-18

Researchers from the Plant Genetics team of the Regional Service for Agri-Food Research and Development of the Principality of Asturias (Serida) have just published a first version of the genome of the Faba Granja Asturiana variety. This advance is key for the genetic improvement and conservation of one of Asturias’ most emblematic legumes. The work has been published in the journal Data in Brief under the title “Chromosome-level dataset from de novo assembly of a Fabada common bean genotype using Illumina and PacBio ...

New clues reveal how gestational diabetes affects offspring

2025-11-18

Gestational diabetes can cause a multitude of complications in the offspring, but to date, the reasons are incompletely understood. A new study, exploring a foundational step in the process of building proteins from genetic material, called splicing, reveals that this process is affected, altering how the placenta reads and processes genetic instructions. Researchers found that in pregnancies affected by gestational diabetes, hundreds of genetic messages are assembled incorrectly, potentially disrupting how the placenta functions. ...

Study finds longer, more consistent addiction medication use among youth sharply lowers risk of overdose, hospitalization

2025-11-18

KEY TAKEAWAYS Among 11,600 youth in Massachusetts who started buprenorphine, only 1 in 4 maintained high adherence for a full year Those who remained adherent for 12 months had almost half the risk of overdose, and fewer emergency department visits and hospitalizations, compared with those who discontinued early Findings suggest that longer, more consistent treatment could be lifesaving for youth amid the ongoing fentanyl crisis New research from Mass General Brigham finds that adolescents and young adults who ...

Combating climate change with better semiconductor manufacturing

2025-11-18

WASHINGTON, Nov. 18, 2025 — The average global temperature has risen by 1.5 C since the pre-industrial era due to climate change, and it is poised to continue increasing. In response, the Intergovernmental Panel on Climate Change has developed the Global Warming Potential (GWP) metric, a unit of measurement that compares a specific gas’s contribution to climate change to that of carbon dioxide. Nitrogen trifluoride (NF3) is particularly bad, with a GWP about 17,000 times higher than carbon dioxide. But NF3 is critical in the semiconductor industry for etching and cleaning, and its use has increased more than twentyfold over the past 30 years. Though NF3 is often viewed as ...

Evaluation of a state-level incentive program to improve diet

2025-11-18

About The Study: In this cohort study of Supplemental Nutrition Assistance Program (SNAP) participants, the 50% incentive, automatic enrollment in the Eat Well, Be Well program, the first state-level SNAP fruit and vegetable incentive program launched in Rhode Island, was not associated with significant relative changes in fruit and vegetable intake, but was associated with benefits among participants already consuming more fruits and vegetables. Enhanced implementation, including broader retail ...

Teaching large language models how to absorb new knowledge

ELSE PRESS RELEASES FROM THIS DATE:

LAST 30 PRESS RELEASES: