(Press-News.org) The language capabilities of today’s artificial intelligence systems are astonishing. We can now engage in natural conversations with systems like ChatGPT, Gemini, and many others, with a fluency nearly comparable to that of a human being. Yet we still know very little about the internal processes in these networks that lead to such remarkable results.
A new study published in the Journal of Statistical Mechanics: Theory and Experiment (JSTAT) reveals a piece of this mystery. It shows that when small amounts of data are used for training, neural networks initially rely on the position of words in a sentence. However, as the system is exposed to enough data, it transitions to a new strategy based on the meaning of the words. The study finds that this transition occurs abruptly, once a critical data threshold is crossed — much like a phase transition in physical systems. The findings offer valuable insights for understanding the workings of these models.
Just like a child learning to read, a neural network starts by understanding sentences based on the positions of words: depending on where words are located in a sentence, the network can infer their relationships (are they subjects, verbs, objects?). However, as the training continues — the network “keeps going to school” — a shift occurs: word meaning becomes the primary source of information.
This, the new study explains, is what happens in a simplified model of self-attention mechanism — a core building block of transformer language models, like the ones we use every day (ChatGPT, Gemini, Claude, etc.). A transformer is a neural network architecture designed to process sequences of data, such as text, and it forms the backbone of many modern language models. Transformers specialize in understanding relationships within a sequence and use the self-attention mechanism to assess the importance of each word relative to the others.
“To assess relationships between words,” explains Hugo Cui, a postdoctoral researcher at Harvard University and first author of the study, “the network can use two strategies, one of which is to exploit the positions of words.” In a language like English, for example, the subject typically precedes the verb, which in turn precedes the object. “Mary eats the apple” is a simple example of this sequence.
“This is the first strategy that spontaneously emerges when the network is trained,” Cui explains. “However, in our study, we observed that if training continues and the network receives enough data, at a certain point — once a threshold is crossed — the strategy abruptly shifts: the network starts relying on meaning instead.”
“When we designed this work, we simply wanted to study which strategies, or mix of strategies, the networks would adopt. But what we found was somewhat surprising: below a certain threshold, the network relied exclusively on position, while above it, only on meaning.”
Cui describes this shift as a phase transition, borrowing a concept from physics. Statistical physics studies systems composed of enormous numbers of particles (like atoms or molecules) by describing their collective behavior statistically. Similarly, neural networks — the foundation of these AI systems — are composed of large numbers of “nodes,” or neurons (named by analogy to the human brain), each connected to many others and performing simple operations. The system’s intelligence emerges from the interaction of these neurons, a phenomenon that can be described with statistical methods.
This is why we can speak of an abrupt change in network behavior as a phase transition, similar to how water, under certain conditions of temperature and pressure, changes from liquid to gas.
“Understanding from a theoretical viewpoint that the strategy shift happens in this manner is important,” Cui emphasizes. “Our networks are simplified compared to the complex models people interact with daily, but they can give us hints to begin to understand the conditions that cause a model to stabilize on one strategy or another. This theoretical knowledge could hopefully be used in the future to make the use of neural networks more efficient, and safer.”
The research by Hugo Cui, Freya Behrens, Florent Krzakala, and Lenka Zdeborová, titled “A Phase Transition between Positional and Semantic Learning in a Solvable Model of Dot-Product Attention”, is published in JSTAT as part of the Machine Learning 2025 special issue and is included in the proceedings of the NeurIPS 2024 conference.
END
From position to meaning: how AI learns to read
A study in JSTAT describes the sharp shift in text comprehension strategies during neural network training
2025-07-07
ELSE PRESS RELEASES FROM THIS DATE:
AI revives classic microscopy for on-farm soil health testing
2025-07-06
The classic microscope is getting a modern twist - US researchers are developing an AI-powered microscope system that could make soil health testing faster, cheaper, and more accessible to farmers and land managers around the world.
Researchers at The University of Texas at San Antonio, USA, have successfully combined low-cost optical microscopy with machine learning to measure the presence and quantity of fungi in soil samples. Their early-stage proof-of-concept technology is presented at the Goldschmidt Conference in Prague on Wednesday 9 July.
Determining the abundance and diversity of soil fungi can ...
Fig trees convert atmospheric CO2 to stone
2025-07-05
Some species of fig trees store calcium carbonate in their trunks – essentially turning themselves (partially) into stone, new research has found. The team of Kenyan, U.S., Austrian, and Swiss scientists found that the trees could draw carbon dioxide (CO2) from the atmosphere and store it as calcium carbonate ‘rocks’ in the surrounding soil.
The research is being presented this week at the Goldschmidt conference in Prague.
The trees – native to Kenya – are one of the first fruit trees shown to have this ability, known as the oxalate carbonate pathway.
All trees use photosynthesis to turn CO2 into organic carbon, which forms their trunk, ...
Intra-arterial tenecteplase for acute stroke after successful endovascular therapy
2025-07-05
About The Study: In patients with acute large vessel occlusion presenting between 4.5 and 24 hours of symptom onset, intra-arterial tenecteplase after successful thrombectomy had a greater likelihood of excellent neurological outcome at 90 days without increasing the risk of symptomatic intracranial hemorrhage or mortality. However, because none of the secondary efficacy analyses supported the primary finding, further trials are needed to confirm the results.
Corresponding Authors: To contact the corresponding ...
Study reveals beneficial microbes that can sustain yields in unfertilized fields
2025-07-04
Despite rice being the staple food for more than half of the world’s population, its cultivation remains highly resource-intensive, requiring large amounts of water and chemical fertilizers. Even as environmental concerns pertaining to global food security and climate change continue to mount, there is a growing interest in finding more sustainable ways to grow this essential crop.
Microbes in plant roots are known to play a vital role in helping plants survive. It’s known that plants can survive in poor soils by recruiting helpful microbes and forming symbiotic relationships, but we still don’t ...
Robotic probe quickly measures key properties of new materials
2025-07-04
Scientists are striving to discover new semiconductor materials that could boost the efficiency of solar cells and other electronics. But the pace of innovation is bottlenecked by the speed at which researchers can manually measure important material properties.
A fully autonomous robotic system developed by MIT researchers could speed things up.
Their system utilizes a robotic probe to measure an important electrical property known as photoconductivity, which is how electrically responsive ...
Climate change cuts milk production, even when farmers cool their cows
2025-07-04
A new study finds extreme heat reduces milk production by up to 10 percent and adding cooling technologies only offsets about half of the loss.
While recent studies have shown climate change will cut crop production, there has been less research into its impacts on livestock. Dairy farmers already know their cows are vulnerable to heat. What will more heat mean? In one of the most comprehensive assessments of heat’s impact on dairy cows, a study in the journal Science Advances finds one day of extreme heat can cut milk production by up to 10 percent. The effects ...
Frozen, but not sealed: Arctic Ocean remained open to life during ice ages
2025-07-04
For years, scientists have debated whether a giant thick ice shelf once covered the entire Arctic Ocean during the coldest ice ages. Now a new study published in Science Advances, challenges this idea as the research team found no evidence for the presence of a massive ~1km ice shelf. Instead, the Arctic Ocean appears to have been covered by seasonal sea ice—leaving open water and life-sustaining conditions even during the harshest periods of cold periods during the last 750,000 years. This discovery gives insights ...
Some like it cold: Cryorhodopsins
2025-07-04
Imagine the magnificent glaciers of Greenland, the eternal snow of the Tibetan high mountains, and the permanently ice-cold groundwater in Finland. As cold and beautiful these are, for the structural biologist Kirill Kovalev, they are more importantly home to unusual molecules that could control brain cells’ activity.
Kovalev, EIPOD Postdoctoral Fellow at EMBL Hamburg’s Schneider Group and EMBL-EBI’s Bateman Group, is a physicist passionate about solving biological problems. He is particularly hooked by rhodopsins, a group of colourful proteins that enable aquatic microorganisms to harness ...
Demystifying gut bacteria with AI
2025-07-04
Gut bacteria are known to be a key factor in many health-related concerns. However, the number and variety of them is vast, as are the ways in which they interact with the body’s chemistry and each other. For the first time, researchers from the University of Tokyo used a special kind of artificial intelligence called a Bayesian neural network to probe a dataset on gut bacteria in order to find relationships that current analytical tools could not reliably identify.
The human body comprises about 30 trillion to 40 trillion cells, but your intestines contain about 100 trillion gut bacteria. Technically, you’re carrying around more cells that aren’t ...
Human wellbeing on a finite planet towards 2100: new study shows humanity at a crossroads
2025-07-04
The peer-reviewed study, The Earth4All Scenarios: Human Wellbeing on a Finite Planet Towards 2100, uses a system dynamics-based modelling approach to explore two future scenarios: Too Little Too Late, and the Giant Leap. The model presented in the paper provides the scientific basis for the analysis and policy recommendations of Earth for All: A Survival Guide for Humanity, published in 2022.
The model’s findings show that under our current ‘business as usual’ conditions – the Too Little Too Late scenario – ...
LAST 30 PRESS RELEASES:
Voracious honey bees threaten the food supply of native pollinators
Despite dwindling resources, report of successful arts education models worldwide paints bright picture
How does body mass index affect breast cancer risk in postmenopausal women with and without cardiovascular disease?
Where the feral buffalo roam in Hong Kong
Dark Dwarfs lurking at the center of our Galaxy might hint at the nature of dark matter
From position to meaning: how AI learns to read
AI revives classic microscopy for on-farm soil health testing
Fig trees convert atmospheric CO2 to stone
Intra-arterial tenecteplase for acute stroke after successful endovascular therapy
Study reveals beneficial microbes that can sustain yields in unfertilized fields
Robotic probe quickly measures key properties of new materials
Climate change cuts milk production, even when farmers cool their cows
Frozen, but not sealed: Arctic Ocean remained open to life during ice ages
Some like it cold: Cryorhodopsins
Demystifying gut bacteria with AI
Human wellbeing on a finite planet towards 2100: new study shows humanity at a crossroads
Unlocking the hidden biodiversity of Europe’s villages
Planned hydrogen refuelling stations may lead to millions of euros in yearly losses
Planned C-sections increase the risk of certain childhood cancers
Adults who have survived childhood cancer are at increased risk of severe COVID-19
Drones reveal extreme coral mortality after bleaching
New genetic finding uncovers hidden cause of arsenic resistance in acute promyelocytic leukemia
Native habitats hold the key to the much-loved smashed avocado’s future
Using lightning to make ammonia out of thin air
Machine learning potential-driven insights into pH-dependent CO₂ reduction
Physician associates provide safe care for diagnosed patients when directly supervised by a doctor
How game-play with robots can bring out their human side
Asthma: patient expectations influence the course of the disease
UNM physician tests drug that causes nerve tissue to emit light, enabling faster, safer surgery
New study identifies EMP1 as a key driver of pancreatic cancer progression and poor prognosis
[Press-News.org] From position to meaning: how AI learns to readA study in JSTAT describes the sharp shift in text comprehension strategies during neural network training