Evolutionary reinforcement learning promises further advances in machine learning

2023-05-19

(Press-News.org)

Evolutionary reinforcement learning is an exciting frontier in machine learning, combining the strengths of two distinct approaches: reinforcement learning and evolutionary computation. In evolutionary reinforcement learning, an intelligent agent learns optimal strategies by actively exploring different approaches and receiving rewards for successful performance. This innovative paradigm combines reinforcement learning's trial-and-error learning with evolutionary algorithms' ability to mimic natural selection, resulting in a powerful methodology for artificial intelligence development that promises breakthroughs in various domains.

A groundbreaking review article on evolutionary reinforcement learning was published Apr. 21 in Intelligent Computing, a Science Partner Journal. It sheds light on the latest advancements in the integration of evolutionary computation with reinforcement learning and presents a comprehensive survey of state-of-the-art methods.

Reinforcement learning, a subfield of machine learning, focuses on developing algorithms that learn to make decisions based on feedback from the environment. Remarkable examples of successful reinforcement learning include AlphaGo and, more recently, Google DeepMind robots that play soccer. However, reinforcement learning still faces several challenges, including the exploration and exploitation trade-off, reward design, generalization and credit assignment.

Evolutionary computation, which emulates the process of natural evolution to solve problems, offers a potential solution to the problems of reinforcement learning. By combining these two approaches, researchers created the field of evolutionary reinforcement learning.

Evolutionary reinforcement learning encompasses six key research areas:

Hyperparameter optimization: Evolutionary computing methods can be used for hyperparameter optimization. That is, they can automatically determine the best settings for reinforcement learning systems. Discovering the best settings manually can be challenging due to the multitude of factors involved, such as the learning speed of the algorithm and its inclination towards future rewards. Furthermore, the performance of reinforcement learning relies heavily on the architecture of the neural network employed, including factors like the number and size of its layers. Policy search: Policy search entails finding the best approach to a task by experimenting with different strategies, aided by neural networks. These networks, akin to powerful calculators, approximate task execution and make use of advancements in deep learning. Since there are numerous task execution possibilities, the search process resembles navigating a vast maze. Stochastic gradient descent is a common method for training neural networks and navigating this maze. Evolutionary computing offers alternative “neuroevolution” methods based on evolution strategies, genetic algorithms and genetic programming. These methods can determine the best weights and other properties of neural networks for reinforcement learning. Exploration: Reinforcement learning agents improve by interacting with their environment. Too little exploration can lead to poor decisions, while too much exploration is costly. Thus there is a trade-off between an agent’s exploration to discover good behaviors and an agent’s exploitation of the discovered good behaviors. Agents explore by adding randomness to their actions. Efficient exploration faces challenges: a large number of possible actions, rare and delayed rewards, unpredictable environments and complex multi-agent scenarios. Evolutionary computation methods address these challenges by promoting competition, cooperation and parallelization. They encourage exploration through diversity and guided evolution. Reward shaping: Rewards are important in reinforcement learning, but they are often rare and hard for agents to learn from. Reward shaping adds extra fine-grained rewards to help agents learn better. However, these rewards can alter agents’ behavior in undesired ways, and figuring out exactly what these extra rewards should be, how to balance them and how to assign credit among multiple agents typically requires specific knowledge of the task at hand. To tackle the challenge of reward design, researchers have used evolutionary computation to adjust the extra rewards and their settings in both single-agent and multi-agent reinforcement learning. Meta-reinforcement learning: Meta-reinforcement learning aims to develop a general learning algorithm that adapts to different tasks using knowledge from previous ones. This approach addresses the issue of requiring a large number of samples to learn each task from scratch in traditional reinforcement learning. However, the number and complexity of tasks that can be solved using meta-reinforcement learning are still limited, and the computational cost associated with it is high. Therefore, exploiting the model-agnostic and highly parallel properties of evolutionary computation is a promising direction to unlock the full potential of meta-reinforcement learning, enabling it to learn, generalize and be more computationally efficient in real-world scenarios. Multi-objective reinforcement learning: In some real-world problems, there are multiple goals that conflict with each other. A multi-objective evolutionary algorithm can balance these goals and suggest a compromise when no solution seems better than the others. Multi-objective reinforcement learning methods can be grouped into two types: those that combine multiple goals into one to find a single best solution and those that find a range of good solutions. Conversely, some single-goal problems can be usefully broken down into multiple goals to make problem-solving easier.

Evolutionary reinforcement learning can solve complex reinforcement learning tasks, even in scenarios with rare or misleading rewards. However, it requires significant computational resources, making it computationally expensive. There is a growing need for more efficient methods, including improvements in encoding, sampling, search operators, algorithmic frameworks and evaluation.

While evolutionary reinforcement learning has shown promising results in addressing challenging reinforcement learning problems, further advancements are still possible. By enhancing its computational efficiency and exploring new benchmarks, platforms and applications, researchers in the field of evolutionary reinforcement learning can make evolutionary methods even more effective and useful for solving complex reinforcement learning tasks.

END

[Attachments] See images for this press release:

ELSE PRESS RELEASES FROM THIS DATE:

Room-temperature, solid-state synthesis of high-quality Cs3Cu2I5 thin films

2023-05-19

Advanced electronic devices require high-quality materials such as metal halide phosphors that can effectively convert light into measurable signals. Toxic element-free copper-based iodides such as cesium copper iodide (Cs3Cu2I5: CCI) are particularly promising in this regard. CCI is an efficient blue light-emitting material that can convert almost all the absorbed energy into detectable light, making them ideal for use in deep-UV photodetectors and γ-ray scintillators for detecting ionizing radiation, ...

Complex biological behaviors: how multiple oscillators interact in live cells

2023-05-19

Oscillatory dynamics in fundamental biological processes, such as circadian clocks, segmentation, and transcription factor responses, requires precise quantitative control for proper cell regulation and fate decisions. Many biological oscillators are influenced by multiple oscillatory signals, and their behavior is understood through the framework of Arnold tongues. However, this approach simplifies the situation to a single external signal and one internal oscillator, which oversimplifies real biological systems. Our understanding of ...

Study reveals the persistent effects of corruption on trust and voting

2023-05-19

The short-term effects of corruption are often obvious. Numerous sources, both in Russia and in the West, consider the military's endemic corruption one of the main reasons of the logistical problems, very low troop morale, and massive casualties of the Red Army in Ukraine. In late 2016, a corruption scandal cost the first woman elected head of state in an Asian country, South Korea's Park Geun-hye, impeachment. We can well imagine that the ongoing “Qatargate,” a political scandal raised by the suspicion ...

Penn Medicine to open new crisis response center as part of a unified mental health care hub at Hospital of the University of Pennsylvania – Cedar Avenue

2023-05-19

PHILADELPHIA—Penn Medicine is launching a new community mental health hub at the Hospital of the University of Pennsylvania — Cedar Avenue (HUP Cedar), co-locating inpatient and outpatient psychiatric care with a new crisis response center (CRC) at the facility. The multi-year plan will put crucial psychiatric and substance use care in easy reach for West and Southwest Philadelphia residents, at a time when both mental illness and drug and alcohol dependence are surging in the city. The project will begin with moving inpatient psychiatric and drug and alcohol detoxification units from Penn Presbyterian ...

Insilico Medicine-led study combines quantum computing and generative AI for drug discovery

2023-05-19

Insilico Medicine (“Insilico”), a clinical stage generative artificial intelligence (AI)-driven drug discovery company, today announced that it combined two rapidly developing technologies, quantum computing and generative AI, to explore lead candidate discovery in drug development and successfully demonstrated the potential advantages of quantum generative adversarial networks in generative chemistry. The study, published May 13 in the American Chemical Society’s Journal of Chemical Information and Modeling, a leading journal in computational modeling, was led by Insilico’s Taiwan and UAE centers which focus on pioneering ...

Rice, Baylor developing ‘glyco-immune’ checkpoint inhibitor

2023-05-19

HOUSTON – (May 19, 2023) – Researchers from Rice University and Baylor College of Medicine are hoping a first-of-its-kind “glyco-immune” checkpoint inhibitor could be the key to stopping bone cancer metastasis for breast cancer survivors. Breast cancer often migrates to other organs. As many as 40% of breast cancer survivors are diagnosed with metastatic cancer, sometimes years after their initial treatment. Bone metastasis is involved in more than two-thirds of those cases, and bone metastatic lesions are known to “seed” metastatic cancer in other organs of the body. Rice chemist Han Xiao and Baylor biologist ...

Even weak traffic noise has a negative impact on work performance

2023-05-19

Researchers at Chalmers’ Division of Applied Acoustics have conducted a laboratory study in which test subjects took concentration tests while being exposed to background traffic noise. The subjects were asked to look at a computer screen and react to certain letters, then to assess their perceived workload afterwards. The study shows that the subjects had significantly poorer results on the performance test, and also felt that the task was more difficult to carry out, with traffic noise in the background. “What is unique about our study is that we were able to demonstrate a decline in performance at noise levels as low as 40 dB, which ...

Johanna Spyri and Heidi archives included in the Memory of the World Register of UNESCO

2023-05-19

The Johanna Spyri and Heidi archives in Zurich have been added to UNESCO's Memory of the World International Register. The decision by the Executive Board of UNESCO acknowledges the collections' universal importance. The University of Zurich will be working with both institutions to promote the academic study of the collections. Heidi is important to Switzerland’s cultural heritage and has influenced art and popular culture around the world for more than a century. The documentary heritage ...

An X-ray look at the heart of powerful quasars

2023-05-19

Researchers have observed the X-ray emission of the most luminous quasar seen in the last 9 billion years of cosmic history, known as SMSS J114447.77-430859.3, or J1144 for short. The new perspective sheds light on the inner workings of quasars and how they interact with their environment. The research is published in Monthly Notices of the Royal Astronomical Society. Hosted by a galaxy 9.6 billion light years away from the Earth, between the constellations of Centaurus and Hydra, J1144 is extremely powerful, shining 100,000 billion times brighter ...

Our brain prefers positive vocal sounds that come from our left

2023-05-19

Sounds that we hear around us are defined physically by their frequency and amplitude. But for us, sounds have a meaning beyond those parameters: we may perceive them as pleasant or unpleasant, ominous or reassuring, and interesting and rich in information, or just noise. One aspect that affects the emotional ‘valence’ of sounds – that is, whether we perceive them as positive, neutral, or negative – is where they come from. Most people rate looming sounds, which move towards them, as more unpleasant, potent, arousing, and intense than receding sounds, ...

International experts and patients unite to help ensure all patients are fully informed before consenting to new surgical procedures

Findings of study on how illegally manufactured fentanyl enters U.S. contradict common assumptions, undermining efforts to control supply

Satellite observations provide insight into post-wildfire forest recovery

[Press-News.org] Evolutionary reinforcement learning promises further advances in machine learning

Evolutionary reinforcement learning promises further advances in machine learning

ELSE PRESS RELEASES FROM THIS DATE:

LAST 30 PRESS RELEASES: