(Press-News.org) CAMBRIDGE, MA – A home robot trained to perform household tasks in a factory may fail to effectively scrub the sink or take out the trash when deployed in a user’s kitchen, since this new environment differs from its training space.
To avoid this, engineers often try to match the simulated training environment as closely as possible with the real world where the agent will be deployed.
However, researchers from MIT and elsewhere have now found that, despite this conventional wisdom, sometimes training in a completely different environment yields a better-performing artificial intelligence agent.
Their results indicate that, in some situations, training a simulated AI agent in a world with less uncertainty, or “noise,” enabled it to perform better than a competing AI agent trained in the same, noisy world they used to test both agents.
The researchers call this unexpected phenomenon the indoor training effect.
“If we learn to play tennis in an indoor environment where there is no noise, we might be able to more easily master different shots. Then, if we move to a noisier environment, like a windy tennis court, we could have a higher probability of playing tennis well than if we started learning in the windy environment,” explains Serena Bono, a research assistant in the MIT Media Lab and lead author of a paper on the indoor training effect.
The researchers studied this phenomenon by training AI agents to play Atari games, which they modified by adding some unpredictability. They were surprised to find that the indoor training effect consistently occurred across Atari games and game variations.
They hope these results fuel additional research toward developing better training methods for AI agents.
“This is an entirely new axis to think about. Rather than trying to match the training and testing environments, we may be able to construct simulated environments where an AI agent learns even better,” adds co-author Spandan Madan, a graduate student at Harvard University.
Bono and Madan are joined on the paper by Ishaan Grover, an MIT graduate student; Mao Yasueda, a graduate student at Yale University; Cynthia Breazeal, professor of media arts and sciences and leader of the Personal Robotics Group in the MIT Media Lab; Hanspeter Pfister, the An Wang Professor of Computer Science at Harvard; and Gabriel Kreiman, a professor at Harvard Medical School. The research will be presented at the Association for the Advancement of Artificial Intelligence Conference.
Training troubles
The researchers set out to explore why reinforcement learning agents tend to have such dismal performance when tested on environments that differ from their training space.
Reinforcement learning is a trial-and-error method in which the agent explores a training space and learns to take actions that maximize its reward.
The team developed a technique to explicitly add a certain amount of noise to one element of the reinforcement learning problem called the transition function. The transition function defines the probability an agent will move from one state to another, based on the action it chooses.
If the agent is playing Pac-Man, a transition function might define the probability that ghosts on the game board will move up, down, left, or right. In standard reinforcement learning, the AI would be trained and tested using the same transition function.
The researchers added noise to the transition function with this conventional approach and, as expected, it hurt the agent’s Pac-Man performance.
But when the researchers trained the agent with a noise-free Pac-Man game, then tested it in an environment where they injected noise into the transition function, it performed better than an agent trained on the noisy game.
“The rule of thumb is that you should try to capture the deployment condition’s transition function as well as you can during training to get the most bang for your buck. We really tested this insight to death because we couldn’t believe it ourselves,” Madan says.
Injecting varying amounts of noise into the transition function let the researchers test many environments, but it didn’t create realistic games. The more noise they injected into Pac-Man, the more likely ghosts would randomly teleport to different squares.
To see if the indoor training effect occurred in normal Pac-Man games, they adjusted underlying probabilities so ghosts moved normally but were more likely to move up and down, rather than left and right. AI agents trained in noise-free environments still performed better in these realistic games.
“It was not only due to the way we added noise to create ad hoc environments. This seems to be a property of the reinforcement learning problem. And that was even more surprising to see,” Bono says.
Exploration explanations
When the researchers dug deeper in search of an explanation, they saw some correlations in how the AI agents explore the training space.
When both AI agents explore mostly the same areas, the agent trained in the non-noisy environment performs better, perhaps because it is easier for the agent to learn the rules of the game without the interference of noise.
If their exploration patterns are different, then the agent trained in the noisy environment tends to perform better. This might occur because the agent needs to understand patterns it can’t learn in the noise-free environment.
“If I only learn to play tennis with my forehand in the non-noisy environment, but then in the noisy one I have to also play with my backhand, I won’t play as well in the non-noisy environment,” Bono explains.
In the future, the researchers hope to explore how the indoor training effect might occur in more complex reinforcement learning environments, or with other techniques like computer vision and natural language processing. They also want to build training environments designed to leverage the indoor training effect, which could help AI agents perform better in uncertain environments.
###
END
New training approach could help AI agents perform better in uncertain conditions
Sometimes, it might be better to train a robot in an environment that’s different from the one where it will be deployed
2025-01-29
ELSE PRESS RELEASES FROM THIS DATE:
A window into the future of Amazonia
2025-01-29
It’s a place where few living things can survive in the water.
Deep in the world’s largest rainforest, there is a boiling river. Found in eastern central Peru, it is a small tributary that eventually leads to the Amazon River.
Heated by cracks in the Earth’s crust, at its warmest spots, the water can reach 200 degrees Fahrenheit, an inhospitable environment with air temperatures hotter than anywhere else in the Amazon.
But the steamy river, known locally as “Shanay-Timpishka,” which translates as “boiled with the heat of the ...
3D models of uveal melanoma offer hope for improved treatments
2025-01-29
ROCHESTER, Minnesota — Mayo Clinic researchers have developed organoid models to study uveal melanoma, one of the most common types of eye cancer in adults. Their goal is to use these models to better understand how this disease works and develop treatments for unmet patient needs.
Organoids are 3D models grown from patient tissue that accurately reflect a patient's unique genetic and biological characteristics, also known as "avatars." When derived from a patient's cancer tumor, an organoid will behave and respond to treatments outside the body in a lab (in vitro) just like the original tumor would inside the body (in vivo).
In 50% of patients, ...
Chemical looping turns environmental waste into fuel
2025-01-29
COLUMBUS, Ohio – Turning environmental waste into useful chemical resources could solve many of the inevitable challenges of our growing amounts of discarded plastics, paper and food waste, according to new research.
In a significant breakthrough, researchers from The Ohio State University have developed a technology to transform materials like plastics and agricultural waste into syngas, a substance most often used to create chemicals and fuels like formaldehyde and methanol.
Using simulations to test how well the system could break down waste, scientists found that their approach, called ...
Working dogs take a day to adjust to Daylight Savings Time, but pets are more flexible
2025-01-29
Working dogs take a day to adjust to the change in routine caused by Daylight Savings Time, whereas pet dogs and their owners seem to be unaffected, according to a study publishing January 29, 2025 in the open-access journal PLOS One by Lavania Nagendran, Ming Fei Li and colleagues at the University of Toronto, Canada.
Daylight Savings Time (DST) is used by many countries to maintain the alignment between daylight hours and human activity patterns, by setting clocks forward one hour in the spring and back one hour in the autumn. Previous research has shown that DST can disrupt ...
Reviews of movies with female- versus male-dominated casts found to contain more sexist language
2025-01-29
In a new linguistic analysis, reviews of movies with female-dominated casts were found to have significantly higher levels of sexism than reviews of movies with male-dominated casts. Jad Doughman and Wael Khreich of the American University of Beirut, Lebanon, present these findings in the open-access journal PLOS One, on January 29, 2025.
Prior research suggests that negative movie reviews can affect actors’ finances, career paths, and mental well-being, while also influencing the broader media landscape. However, studies of gender bias in reviews have traditionally relied on movie ratings or box-office ...
Women exercising in gyms often face barriers including body image and harassment
2025-01-29
When exercising in gyms, women face barriers across various domains, including physical appearance and body image, gym attire, the physical gym environment, and interactions with others, according to a study published January 29, 2025, in the open-access journal PLOS One by Emma Cowley from the SHE Research Centre, TUS, Ireland, and Jekaterina Schneider from the University of the West of England, U.K.
Exercise significantly improves physical, mental, and psychosocial health. Recent research indicates that women who engage in regular exercise experience greater health benefits than men, including lower incidence of all-cause mortality and reduced ...
SNU researchers apply the principles of mantis shrimp and fleas to create soft robots with powerful movements
2025-01-29
Seoul National University College of Engineering announced that a research team led by Professor Kyu-Jin Cho (Director of the Soft Robotics Research Center) from the Department of Mechanical Engineering took inspiration from principles found in nature and developed the "Hyperelastic Torque Reversal Mechanism (HeTRM)," which enables robots made from rubber-like soft materials to perform rapid and powerful movements. This study was published in the prestigious international journal Science Robotics on January 29.
The mantis shrimp delivers a punch ...
Quantum-inspired computing drives major advance in simulating turbulence
2025-01-29
UNDER EMBARGO UNTIL 19:00 GMT / 14:00 ET, WEDNESDAY 29 JANUARY 2025
Quantum-inspired computing drives major advance in simulating turbulence
Researchers at the University of Oxford have pioneered a new approach to simulate turbulent systems, based on probabilities. The findings have been published today (29 January) in the journal Science Advances.
Predicting the dynamics of turbulent fluid flows has long been a central goal for scientists and engineers. Yet, even with modern computing technology, direct and accurate simulation of all but the simplest turbulent flows remains impossible.
This is due to turbulence being ...
New microscopy technique reveals dynamic Escherichia coli membrane stiffness
2025-01-29
Light and electron microscopy have distinct limitations. Light microscopy makes it difficult to resolve smaller and smaller features, and electron microscopy resolves small structures, but samples must be meticulously prepared, killing any live specimens.
Atomic force microscopy (AFM) is a technique originally developed to assess the physical and mechanical properties of materials at extremely high resolutions, but the imaging speeds aren’t fast enough (e.g. several minutes per frame) to capture relevant data for living biological samples. In contrast, another method, high-speed AFM (HS-AFM), is fast but cannot measure mechanical properties. Understanding ...
Bad hair bears! Greasy hair gives polar bears fur with anti-icing properties
2025-01-29
An international team of scientists has discovered the anti-icing secret of polar bear fur – something that allows one of the planet’s most iconic animals to survive and thrive in one of its most punishing climates. That secret? Greasy hair.
After some polar sleuthing, which involved scrutiny of hair collected from six polar bears in the wild, the scientists homed in on the hair “sebum” (or grease) as the all-important protectant. This sebum, which is made up of cholesterol, diacylglycerols, and fatty acids, makes it very hard for ice to attach to their fur.
While this finding ...
LAST 30 PRESS RELEASES:
A new therapeutic target for a lethal form of heart failure: ALPK2
Optimism can boost saving, especially for lower-income individuals
Findings may lead to blood test to predict risk of postpartum depression
New insights on radical trapping in 12-phosphatetraphene uncovered
Grossman wins 2025 Transatlantic Alliance Award in Endocrinology
Girish N. Nadkarni, MD, MPH, CPH, named to leadership roles in AI and Digital Health at the Icahn School of Medicine at Mount Sinai
A hearing aid for … your nose?
Borrowing nature’s blueprint: How scientists replicated bone marrow
Politically connected corporations received more exemptions from US tariffs on Chinese imports, study finds
Walk like a … gecko? Animal footpads inspire a polymer that sticks to ice
Role of barrier films in maintaining the stability of perovskite solar cells
New technology tracks dairy cows for improved health and productivity
Antibiotics of the future are prone to bacterial resistance
New ‘Matchless’ grass variety yields high seed count without need for field burning
Propranolol may reduce ischemic stroke risk in women with migraines
Stroke may increase risk of anxiety, depression and more in children
Eating a Mediterranean-style diet improved brain health in study of Hispanic/Latino adults
Blood test may detect stroke type before hospital arrival, allowing faster treatment
Changing therapy practice to add higher-intensity walking improves early stroke recovery
ECG tests may someday be used by AI model to detect premature aging and cognitive decline
Stroke warning sign acronyms drive 911 calls, F.A.S.T. leads in symptom recall for public
Regular dental flossing may lower risk of stroke from blood clots, irregular heartbeats
A common mouth and gut bacteria may be linked with increased stroke risk
Biomarker tied to premature cell aging may signal stroke, dementia, late-life depression
Australian researchers enhance next-generation gene-editing technologies for cancer and medical research
EMBARGOED MEDIA RELEASE: Zika uses human skin as ‘mosquito magnet’ to spread virus further
TU Delft develops 3D-printed brain-like environment that promotes neuron growth
E-mobility: TU Graz AI system accelerates the development of powertrains
Better digital memories with the help of noble gases
Smarter memory paves the way for EU independence in computer manufacturing
[Press-News.org] New training approach could help AI agents perform better in uncertain conditionsSometimes, it might be better to train a robot in an environment that’s different from the one where it will be deployed