PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

ACM A.M. Turing Award honors two researchers who led the development of cornerstone AI technology

Andrew Barto and Richard Sutton recognized as pioneers of reinforcement learning

ACM A.M. Turing Award honors two researchers who led the development of cornerstone AI technology
2025-03-06
(Press-News.org) ACM, the Association for Computing Machinery, today named Andrew G. Barto and Richard S. Sutton as the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. In a series of papers beginning in the 1980s, Barto and Sutton introduced the main ideas, constructed the mathematical foundations, and developed important algorithms for reinforcement learning—one of the most important approaches for creating intelligent systems.

Barto is Professor Emeritus of Information and Computer Sciences at the University of Massachusetts, Amherst. Sutton is a Professor of Computer Science at the University of Alberta, a Research Scientist at Keen Technologies, and a Fellow at Amii (Alberta Machine Intelligence Institute).  

The ACM A.M. Turing Award, often referred to as the “Nobel Prize in Computing,” carries a $1 million prize with financial support provided by Google, Inc. The award is named for Alan M. Turing, the British mathematician who articulated the mathematical foundations of computing.

What is Reinforcement Learning?
The field of artificial intelligence (AI) is generally concerned with constructing agents—that is, entities that perceive and act. More intelligent agents are those that choose better courses of action. Therefore, the notion that some courses of action are better than others is central to AI. Reward—a term borrowed from psychology and neuroscience—denotes a signal provided to an agent related to the quality of its behavior. Reinforcement learning (RL) is the process of learning to behave more successfully given this signal.

The idea of learning from reward has been familiar to animal trainers for thousands of years. Later, Alan Turing’s 1950 paper “Computing Machinery and Intelligence,” addressed the question “Can machines think?” and proposed an approach to machine learning based on rewards and punishments.

While Turing reported having conducted some initial experiments with this approach and Arthur Samuel developed a checker-playing program in the late 1950s that learned from self-play, little further progress occurred in this vein of AI in the following decades. In the early 1980s, motivated by observations from psychology, Barto and his PhD student Sutton began to formulate reinforcement learning as a general problem framework.

They drew on the mathematical foundation provided by Markov decision processes (MDPs), wherein an agent makes decisions in a stochastic (randomly determined) environment, receiving a reward signal after each transition and aiming to maximize its long-term cumulative reward. Whereas standard MDP theory assumes that everything about the MDP is known to the agent, the RL framework allows for the environment and the rewards to be unknown. The minimal information requirements of RL, combined with the generality of the MDP framework, allows RL algorithms to be applied to a vast range of problems, as explained further below.

Barto and Sutton, jointly and with others, developed many of the basic algorithmic approaches for RL. These include their foremost contribution, temporal difference learning, which made an important advance in solving reward prediction problems, as well as policy-gradient methods and the use of neural networks as a tool to represent learned functions. They also proposed agent designs that combined learning and planning, demonstrating the value of acquiring knowledge of the environment as a basis for planning.

Perhaps equally influential was their textbook, Reinforcement Learning: An Introduction (1998), which is still the standard reference in the field and has been cited over 75,000 times. It allowed thousands of researchers to understand and contribute to this emerging field and continues to inspire much significant research activity in computer science today.

Although Barto and Sutton’s algorithms were developed decades ago, major advances in the practical applications of RL came about in the past fifteen years by merging RL with deep learning algorithms (pioneered by 2018 Turing Awardees Bengio, Hinton, and LeCun). This led to the technique of deep reinforcement learning.

The most prominent example of RL was the victory by the AlphaGo computer program over the best human Go players in 2016 and 2017. Another major achievement recently has been the development of the chatbot ChatGPT. ChatGPT is a large language model (LLM) trained in two phases, the second of which employs a technique called reinforcement learning from human feedback (RLHF), to capture human expectations.

RL has achieved success in many other areas as well. A high-profile research example is robot motor skill learning in the in-hand robotic manipulation and solution of a physical (Rubik’s Cube), which showed it possible to do all the reinforcement learning in simulation yet ultimately be successful in the significantly different real world.

Other areas include network congestion control, chip design, internet advertising, optimization, global supply chain optimization, improving the behavior and reasoning capabilities of chatbots, and even improving algorithms for one of the oldest problems in computer science, matrix multiplication.

Finally, a technology that was partly inspired by neuroscience has returned the favor. Recent research, including work by Barto, has shown that specific RL algorithms developed in AI provide the best explanations for a wide range of findings concerning the dopamine system in the human brain.

“Barto and Sutton’s work demonstrates the immense potential of applying a multidisciplinary approach to longstanding challenges in our field,” explains ACM President Yannis Ioannidis. “Research areas ranging from cognitive science and psychology to neuroscience inspired the development of reinforcement learning, which has laid the foundations for some of the most important advances in AI and has given us greater insight into how the brain works. Barto and Sutton’s work is not a stepping stone that we have now moved on from. Reinforcement learning continues to grow and offers great potential for further advances in computing and many other disciplines. It is fitting that we are honoring them with the most prestigious award in our field.”

“In a 1947 lecture, Alan Turing stated ‘What we want is a machine that can learn from experience,’” noted Jeff Dean, Chief Scientist of Google. “Reinforcement learning, as pioneered by Barto and Sutton, directly answers Turing’s challenge. Their work has been a lynchpin of progress in AI over the last several decades. The tools they developed remain a central pillar of the AI boom and have rendered major advances, attracted legions of young researchers, and driven billions of dollars in investments. RL’s impact will continue well into the future. Google is proud to sponsor the ACM A.M. Turing Award and honor the individuals who have shaped the technologies that improve our lives.”  


Biographical Background

Andrew G. Barto
Andrew Barto is Professor Emeritus, Department of Information and Computer Sciences, University of Massachusetts, Amherst. He began his career at UMass Amherst as a postdoctoral Research Associate in 1977, and has subsequently held various positions including Associate Professor, Professor, and Department Chair. Barto received a BS degree in Mathematics (with distinction) from the University of Michigan, where he also earned his MS and PhD degrees in Computer and Communication Sciences.

Barto’s honors include the UMass Neurosciences Lifetime Achievement Award, the IJCAI Award for Research Excellence, and the IEEE Neural Network Society Pioneer Award. He is a Fellow of the Institute of Electrical and Electronics Engineers (IEEE), and a Fellow of the American Association for the Advancement of Science (AAAS).

Richard S. Sutton
Richard Sutton is a Professor in Computing Science at the University of Alberta, a Research Scientist at Keen Technologies (an artificial general intelligence company based in Dallas, Texas) and Chief Scientific Advisor of the Alberta Machine Intelligence Institute (Amii). Sutton was a Distinguished Research Scientist at Deep Mind from 2017 to 2023. Prior to joining the University of Alberta, he served as a Principal Technical Staff Member in the Artificial Intelligence Department at the AT&T Shannon Laboratory in Florham Park, New Jersey, from 1998 to 2002. Sutton’s collaborations with Andrew Barto began in 1978 at the University of Massachusetts at Amherst, where Barto was Sutton’s PhD and postdoctoral advisor. Sutton received his BA in Psychology from Stanford University and earned his MS and PhD degrees in Computer and Information Science from the University of Massachusetts at Amherst.

Sutton’s honors include receiving the IJCAI Research Excellence Award, a Lifetime Achievement Award from the Canadian Artificial Intelligence Association, and an Outstanding Achievement in Research Award from the University of Massachusetts at Amherst. Sutton is a Fellow of the Royal Society of London, a Fellow of the Association for the Advancement of Artificial Intelligence, and a Fellow of the Royal Society of Canada.

About the ACM A.M. Turing Award
The A.M. Turing Award is named for Alan M. Turing, the British mathematician who articulated the mathematical foundations of computing, and who was a key contributor to the Allied cryptanalysis of the Enigma cipher during World War II. Since its inception in 1966, the Turing Award has honored the computer scientists and engineers who created the systems and their underlying theoretical foundations that have propelled the information technology industry.

About ACM
ACM, the Association for Computing Machinery, is the world’s largest educational and scientific computing society, uniting computing educators, researchers, and professionals to inspire dialogue, share resources, and address the field’s challenges. ACM strengthens the computing profession’s collective voice through strong leadership, promotion of the highest standards, and recognition of technical excellence. ACM supports the professional growth of its members by providing opportunities for life-long learning, career development, and professional networking.

 

###

 

 

END


[Attachments] See images for this press release:
ACM A.M. Turing Award honors two researchers who led the development of cornerstone AI technology

ELSE PRESS RELEASES FROM THIS DATE:

Incarcerated people are disproportionately impacted by climate change, CU doctors say

2025-03-06
When a wildfire approaches a prison and an evacuation warning is issued, what are the health risks that incarcerated people face when officials decide to not evacuate? What happens if the evacuation warning turns into a mandate and there are no transportation options to securely move everyone, or there are no nearby facilities to go to?  These are some of the issues raised by two University of Colorado Department of Medicine faculty members — Katherine LeMasters, PhD, and Lawrence Haber, MD — in a correspondence titled, “The Hidden Crisis of Incarcerated Individuals During Wildfires,” which was recently ...

ESA 2025 Graduate Student Policy Award Cohort Named

ESA 2025 Graduate Student Policy Award Cohort Named
2025-03-06
The Ecological Society of America is pleased to announce the recipients of the 2025 Katherine S. McCarter Graduate Student Policy Award (GSPA). Students in the 2025 cohort are engaged in advocacy with an interest in science policy. Awardees will travel to Washington, D.C., for policy, communication and career training followed by meetings with lawmakers on Capitol Hill. “Kudos to these ten outstanding graduate students and scientists in training,” said ESA President Stephanie Hampton. “Their dedication to science policy is essential for bridging research and decision-making. By engaging with policymakers, they will help ensure that ecological science ...

Insomnia, lack of sleep linked to high blood pressure in teens

2025-03-06
Research Highlights: Teenagers who slept less than 7.7 hours in a sleep lab were observed to be almost three times more likely to have elevated blood pressure than well-rested peers. Those who reported insomnia and slept less than 7.7 hours in a sleep lab were five times more likely to have stage 2 hypertension when compared with well-rested peers. The study did not find a notable link between elevated blood pressure or stage 2 hypertension risk among adolescents who reported insomnia but slept 7.7 hours or more. Note: The study featured in this news release is a research abstract. Abstracts presented at the American Heart Association’s scientific ...

Heart & stroke risks vary among Asian American, Native Hawaiian & Pacific Islander adults

2025-03-06
Research Highlights: The prevalence of cardiovascular disease risk factors varies greatly among Asian American, Native Hawaiian and other Pacific Islander (AANHPI) populations, according to an analysis of electronic health records for more than 700,000 adults in California and Hawaii. The 10-year predicted risk of a major cardiovascular event, such as a heart attack, stroke or heart failure, also varied among the different groups. These results highlight differential risks and raise awareness for the importance of identifying and managing cardiovascular disease risk factors in high-risk populations, the researchers noted. Note: The study ...

Levels of select vitamins & minerals in pregnancy may be linked to lower midlife BP risk

2025-03-06
Research Highlights: Higher levels of the minerals copper and manganese in pregnant women were associated with lower blood pressure and a reduced risk of developing high blood pressure decades later, according to a long-term study of women in Massachusetts. Higher levels of vitamin B12 were also associated with lower blood pressure in midlife. Note: The study featured in this news release is a research abstract presenting at the American Heart Association’s Epidemiology, Prevention, Lifestyle and Metabolic Health Scientific Sessions 2025, and the full manuscript is simultaneously published in the American Heart Association’s peer-reviewed journal Hypertension. Embargoed ...

Large study of dietary habits suggests more plant oils, less butter could lead to better health

2025-03-06
People who consume plant-based oil instead of butter may experience beneficial health effects and even have a lower risk of premature death, according to a new study by investigators from Mass General Brigham, Harvard T.H. Chan School of Public Health, and the Broad Institute of MIT and Harvard. The researchers examined diet and health data from 200,000 people followed for more than 30 years and found that higher intake of plant-based oils, especially soybean, canola, and olive oil, was associated with lower ...

Butter and plant-based oils intake and mortality

2025-03-06
About The Study: In this cohort study, higher intake of butter was associated with increased mortality, while higher plant-based oils intake was associated with lower mortality. Substituting butter with plant-based oils may confer substantial benefits for preventing premature deaths.  Corresponding Author: To contact the corresponding author, Dong D. Wang, MD, ScD, email dow471@mail.harvard.edu. To access the embargoed study: Visit our For The Media website at this link https://media.jamanetwork.com/ (doi:10.1001/jamainternmed.2025.0205) Editor’s ...

20% of butterflies in the U.S. have disappeared since 2000

20% of butterflies in the U.S. have disappeared since 2000
2025-03-06
BINGHAMTON, N.Y. -- Butterflies are beloved creatures that inspire art and play an important ecological role, but you might have noticed less of them brightening your day in recent years. According to new research featuring faculty at Binghamton University, State University of New York, these cherished insects are disappearing at an alarming rate. A new study published in Science examines butterfly data in the United States, and the results are troubling. Looking across 76,000 surveys, the study revealed that butterfly abundance fell by 22% between 2000 and 2020. To put it starkly: for every five butterflies in the U.S. ...

Bacterial ‘jumping genes’ can target and control chromosome ends

2025-03-06
ITHACA, N.Y. -- Transposons, or “jumping genes” – DNA segments that can move from one part of the genome to another – are key to bacterial evolution and the development of antibiotic resistance. Cornell University researchers have discovered a new mechanism these genes use to survive and propagate in bacteria with linear DNA, with applications in biotechnology and drug development. In a paper under embargo in Science until 2pm ET on March 6, 2025, researchers show that transposons can target and insert themselves at the ends ...

Scientists identify genes that make humans and Labradors more likely to become obese

Scientists identify genes that make humans and Labradors more likely to become obese
2025-03-06
Researchers studying British Labrador retrievers have identified multiple genes associated with canine obesity and shown that these genes are also associated with obesity in humans. The dog gene found to be most strongly associated with obesity in Labradors is called DENND1B. Humans also carry the DENND1B gene, and the researchers found that this gene is also linked with obesity in people.   DENND1B was found to directly affect a brain pathway responsible for regulating the energy balance in the body, called the leptin melanocortin pathway.   An additional four genes associated with canine obesity, ...

LAST 30 PRESS RELEASES:

No evidence that substituting NHS doctors with physician associates is necessarily safe

At-home brain speed tests bridge cognitive data gaps

CRF appoints Josep Rodés-Cabau, M.D., Ph.D., as editor-in-chief of structural heart: the journal of the heart team

Violent crime is indeed a root cause of migration, according to new study

Customized smartphone app shows promise in preventing further cognitive decline among older adults diagnosed with mild impairment

Impact of COVID-19 on education not going away, UM study finds

School of Public Health researchers receive National Academies grant to assess environmental conditions in two Houston neighborhoods

Three Speculum articles recognized with prizes

ACM A.M. Turing Award honors two researchers who led the development of cornerstone AI technology

Incarcerated people are disproportionately impacted by climate change, CU doctors say

ESA 2025 Graduate Student Policy Award Cohort Named

Insomnia, lack of sleep linked to high blood pressure in teens

Heart & stroke risks vary among Asian American, Native Hawaiian & Pacific Islander adults

Levels of select vitamins & minerals in pregnancy may be linked to lower midlife BP risk

Large study of dietary habits suggests more plant oils, less butter could lead to better health

Butter and plant-based oils intake and mortality

20% of butterflies in the U.S. have disappeared since 2000

Bacterial ‘jumping genes’ can target and control chromosome ends

Scientists identify genes that make humans and Labradors more likely to become obese

Early-life gut microbes may protect against diabetes, research in mice suggests

Study raises the possibility of a country without butterflies

Study reveals obesity gene in dogs that is relevant to human obesity studies

A rapid decline in US butterfly populations

Indigenous farming practices have shaped manioc’s genetic diversity for millennia

Controlling electrons in molecules at ultrafast timescales

Tropical forests in the Americas are struggling to keep pace with climate change

Brain mapping unlocks key Alzheimer’s insights

Clinical trial tests novel stem-cell treatment for Parkinson’s disease

Awareness of rocky mountain spotted fever saves lives

Breakthrough in noninvasive monitoring of molecular processes in deep tissue

[Press-News.org] ACM A.M. Turing Award honors two researchers who led the development of cornerstone AI technology
Andrew Barto and Richard Sutton recognized as pioneers of reinforcement learning