PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Evolutionary reinforcement learning promises further advances in machine learning

Evolutionary reinforcement learning promises further advances in machine learning
2023-05-19
(Press-News.org)

Evolutionary reinforcement learning is an exciting frontier in machine learning, combining the strengths of two distinct approaches: reinforcement learning and evolutionary computation. In evolutionary reinforcement learning, an intelligent agent learns optimal strategies by actively exploring different approaches and receiving rewards for successful performance. This innovative paradigm combines reinforcement learning's trial-and-error learning with evolutionary algorithms' ability to mimic natural selection, resulting in a powerful methodology for artificial intelligence development that promises breakthroughs in various domains.

A groundbreaking review article on evolutionary reinforcement learning was published Apr. 21 in Intelligent Computing, a Science Partner Journal. It sheds light on the latest advancements in the integration of evolutionary computation with reinforcement learning and presents a comprehensive survey of state-of-the-art methods.

Reinforcement learning, a subfield of machine learning, focuses on developing algorithms that learn to make decisions based on feedback from the environment. Remarkable examples of successful reinforcement learning include AlphaGo and, more recently, Google DeepMind robots that play soccer. However, reinforcement learning still faces several challenges, including the exploration and exploitation trade-off, reward design, generalization and credit assignment.

Evolutionary computation, which emulates the process of natural evolution to solve problems, offers a potential solution to the problems of reinforcement learning. By combining these two approaches, researchers created the field of evolutionary reinforcement learning.

Evolutionary reinforcement learning encompasses six key research areas:

Hyperparameter optimization: Evolutionary computing methods can be used for hyperparameter optimization. That is, they can automatically determine the best settings for reinforcement learning systems. Discovering the best settings manually can be challenging due to the multitude of factors involved, such as the learning speed of the algorithm and its inclination towards future rewards. Furthermore, the performance of reinforcement learning relies heavily on the architecture of the neural network employed, including factors like the number and size of its layers. Policy search: Policy search entails finding the best approach to a task by experimenting with different strategies, aided by neural networks. These networks, akin to powerful calculators, approximate task execution and make use of advancements in deep learning. Since there are numerous task execution possibilities, the search process resembles navigating a vast maze. Stochastic gradient descent is a common method for training neural networks and navigating this maze. Evolutionary computing offers alternative “neuroevolution” methods based on evolution strategies, genetic algorithms and genetic programming. These methods can determine the best weights and other properties of neural networks for reinforcement learning. Exploration: Reinforcement learning agents improve by interacting with their environment. Too little exploration can lead to poor decisions, while too much exploration is costly. Thus there is a trade-off between an agent’s exploration to discover good behaviors and an agent’s exploitation of the discovered good behaviors. Agents explore by adding randomness to their actions. Efficient exploration faces challenges: a large number of possible actions, rare and delayed rewards, unpredictable environments and complex multi-agent scenarios. Evolutionary computation methods address these challenges by promoting competition, cooperation and parallelization. They encourage exploration through diversity and guided evolution. Reward shaping: Rewards are important in reinforcement learning, but they are often rare and hard for agents to learn from. Reward shaping adds extra fine-grained rewards to help agents learn better. However, these rewards can alter agents’ behavior in undesired ways, and figuring out exactly what these extra rewards should be, how to balance them and how to assign credit among multiple agents typically requires specific knowledge of the task at hand. To tackle the challenge of reward design, researchers have used evolutionary computation to adjust the extra rewards and their settings in both single-agent and multi-agent reinforcement learning. Meta-reinforcement learning: Meta-reinforcement learning aims to develop a general learning algorithm that adapts to different tasks using knowledge from previous ones. This approach addresses the issue of requiring a large number of samples to learn each task from scratch in traditional reinforcement learning. However, the number and complexity of tasks that can be solved using meta-reinforcement learning are still limited, and the computational cost associated with it is high. Therefore, exploiting the model-agnostic and highly parallel properties of evolutionary computation is a promising direction to unlock the full potential of meta-reinforcement learning, enabling it to learn, generalize and be more computationally efficient in real-world scenarios. Multi-objective reinforcement learning: In some real-world problems, there are multiple goals that conflict with each other. A multi-objective evolutionary algorithm can balance these goals and suggest a compromise when no solution seems better than the others. Multi-objective reinforcement learning methods can be grouped into two types: those that combine multiple goals into one to find a single best solution and those that find a range of good solutions. Conversely, some single-goal problems can be usefully broken down into multiple goals to make problem-solving easier.

Evolutionary reinforcement learning can solve complex reinforcement learning tasks, even in scenarios with rare or misleading rewards. However, it requires significant computational resources, making it computationally expensive. There is a growing need for more efficient methods, including improvements in encoding, sampling, search operators, algorithmic frameworks and evaluation.

While evolutionary reinforcement learning has shown promising results in addressing challenging reinforcement learning problems, further advancements are still possible. By enhancing its computational efficiency and exploring new benchmarks, platforms and applications, researchers in the field of evolutionary reinforcement learning can make evolutionary methods even more effective and useful for solving complex reinforcement learning tasks.

END


[Attachments] See images for this press release:
Evolutionary reinforcement learning promises further advances in machine learning

ELSE PRESS RELEASES FROM THIS DATE:

Room-temperature, solid-state synthesis of high-quality Cs3Cu2I5 thin films

Room-temperature, solid-state synthesis of high-quality Cs3Cu2I5 thin films
2023-05-19
Advanced electronic devices require high-quality materials such as metal halide phosphors that can effectively convert light into measurable signals. Toxic element-free copper-based iodides such as cesium copper iodide (Cs3Cu2I5: CCI) are particularly promising in this regard. CCI is an efficient blue light-emitting material that can convert almost all the absorbed energy into detectable light, making them ideal for use in deep-UV photodetectors and γ-ray scintillators for detecting ionizing radiation, ...

Complex biological behaviors: how multiple oscillators interact in live cells

Complex biological behaviors: how multiple oscillators interact in live cells
2023-05-19
Oscillatory dynamics in fundamental biological processes, such as circadian clocks, segmentation, and transcription factor responses, requires precise quantitative control for proper cell regulation and fate decisions. Many biological oscillators are influenced by multiple oscillatory signals, and their behavior is understood through the framework of Arnold tongues. However, this approach simplifies the situation to a single external signal and one internal oscillator, which oversimplifies real biological systems. Our understanding of ...

Study reveals the persistent effects of corruption on trust and voting

2023-05-19
The short-term effects of corruption are often obvious. Numerous sources, both in Russia and in the West, consider the military's endemic corruption one of the main reasons of the logistical problems, very low troop morale, and massive casualties of the Red Army in Ukraine. In late 2016, a corruption scandal cost the first woman elected head of state in an Asian country, South Korea's Park Geun-hye, impeachment.   We can well imagine that the ongoing “Qatargate,” a political scandal raised by the suspicion ...

Penn Medicine to open new crisis response center as part of a unified mental health care hub at Hospital of the University of Pennsylvania – Cedar Avenue

2023-05-19
PHILADELPHIA—Penn Medicine is launching a new community mental health hub at the Hospital of the University of Pennsylvania — Cedar Avenue (HUP Cedar), co-locating inpatient and outpatient psychiatric care with a new crisis response center (CRC) at the facility. The multi-year plan will put crucial psychiatric and substance use care in easy reach for West and Southwest Philadelphia residents, at a time when both mental illness and drug and alcohol dependence are surging in the city. The project will begin with moving inpatient psychiatric and drug and alcohol detoxification units from Penn Presbyterian ...

Insilico Medicine-led study combines quantum computing and generative AI for drug discovery

Insilico Medicine-led study combines quantum computing and generative AI for drug discovery
2023-05-19
Insilico Medicine (“Insilico”), a clinical stage generative artificial intelligence (AI)-driven drug discovery company, today announced that it combined two rapidly developing technologies, quantum computing and generative AI, to explore lead candidate discovery in drug development and successfully demonstrated the potential advantages of quantum generative adversarial networks in generative chemistry.  The study, published May 13 in the American Chemical Society’s Journal of Chemical Information and Modeling, a leading journal in computational modeling, was led by Insilico’s Taiwan and UAE centers which focus on pioneering ...

Rice, Baylor developing ‘glyco-immune’ checkpoint inhibitor

Rice, Baylor developing ‘glyco-immune’ checkpoint inhibitor
2023-05-19
HOUSTON – (May 19, 2023) – Researchers from Rice University and Baylor College of Medicine are hoping a first-of-its-kind “glyco-immune” checkpoint inhibitor could be the key to stopping bone cancer metastasis for breast cancer survivors. Breast cancer often migrates to other organs. As many as 40% of breast cancer survivors are diagnosed with metastatic cancer, sometimes years after their initial treatment. Bone metastasis is involved in more than two-thirds of those cases, and bone metastatic lesions are known to “seed” metastatic cancer in other organs of the body. Rice chemist Han Xiao and Baylor biologist ...

Even weak traffic noise has a negative impact on work performance

Even weak traffic noise has a negative impact on work performance
2023-05-19
Researchers at Chalmers’ Division of Applied Acoustics have conducted a laboratory study in which test subjects took concentration tests while being exposed to background traffic noise. The subjects were asked to look at a computer screen and react to certain letters, then to assess their perceived workload afterwards. The study shows that the subjects had significantly poorer results on the performance test, and also felt that the task was more difficult to carry out, with traffic noise in the background. “What is unique about our study is that we were able to demonstrate a decline in performance at noise levels as low as 40 dB, which ...

Johanna Spyri and Heidi archives included in the Memory of the World Register of UNESCO

Johanna Spyri and Heidi archives included in the Memory of the World Register of UNESCO
2023-05-19
The Johanna Spyri and Heidi archives in Zurich have been added to UNESCO's Memory of the World International Register. The decision by the Executive Board of UNESCO acknowledges the collections' universal importance. The University of Zurich will be working with both institutions to promote the academic study of the collections. Heidi is important to Switzerland’s cultural heritage and has influenced art and popular culture around the world for more than a century. The documentary heritage ...

An X-ray look at the heart of powerful quasars

An X-ray look at the heart of powerful quasars
2023-05-19
Researchers have observed the X-ray emission of the most luminous quasar seen in the last 9 billion years of cosmic history, known as SMSS J114447.77-430859.3, or J1144 for short. The new perspective sheds light on the inner workings of quasars and how they interact with their environment. The research is published in Monthly Notices of the Royal Astronomical Society. Hosted by a galaxy 9.6 billion light years away from the Earth, between the constellations of Centaurus and Hydra, J1144 is extremely powerful, shining 100,000 billion times brighter ...

Our brain prefers positive vocal sounds that come from our left

2023-05-19
Sounds that we hear around us are defined physically by their frequency and amplitude. But for us, sounds have a meaning beyond those parameters: we may perceive them as pleasant or unpleasant, ominous or reassuring, and interesting and rich in information, or just noise. One aspect that affects the emotional ‘valence’ of sounds – that is, whether we perceive them as positive, neutral, or negative – is where they come from. Most people rate looming sounds, which move towards them, as more unpleasant, potent, arousing, and intense than receding sounds, ...

LAST 30 PRESS RELEASES:

Breakthrough review links hormone receptors to age-related brain disease prevention

New West Health-Gallup survey finds desire for better access to mental healthcare is nonpartisan issue

Cancer prevalence across vertebrate species decreases with gestation time, may increase with adult mass

Epic voyage to uncover what causes tsunamis

USC Stem Cell mouse study sheds light on the secret to maintaining a youthful immune system

Suicide risk highest on Mondays and New Year’s Day

Gene signature shows promise to improve survival for breast cancer patients

Investigation finds “unexplained” millions in drug industry payments to the NHS

Maternal antibodies interfere with malaria vaccine responses

Teaching must be made more attractive as a profession to tackle shortages

Airbnb rentals linked to increased crime rates in London neighborhoods – study

UK budget 'blindness' risks handing green economy future to China, report argues

Marri trees a lifeline for many native bee species in biodiversity hotspot

Treatments used for HER2-positive breast cancers could help patients with rare gastrointestinal cancer

Little-studied RNA might be key to regulating genetic disorders like epilepsy, autism

UB researchers show why cannabis policies should shift to a harm reduction, health promotion approach to safeguard public health

Live well, think well: Research shows healthy habits tied to brain health

Could poor sleep in middle age speed up brain aging?

Fossils unveil how southern Europe’s ecosystem changed through Glacial-Interglacial Stages

Your ability to balance on one leg may be a reliable indicator of neuromuscular aging, with men and women showing significant declines over the decades

Most young adults in the UK consider non-consensual condom removal during sex to be wrong and a violation of consent, with almost 9 in 10 seeing it as a form of sexual assault, per survey of 18-25-yea

Under climate change scenarios, 30-44% more land in Ethiopia might become suitable for growing arabica coffee by 2080, although some cultivated areas might also become unsuitable, per modelling study

Cockroaches and maggots might be able to turn an invasive seaweed into a high quality compost, finds a new experimental study which provides hope for the environment and the circular economy

Implantable device may prevent death from opioid overdose

Half of young adults support prison time for non-consensual condom removal

‘Paleo-robots’ to help scientists understand how fish started to walk on land

Study: Robotic automation, AI will speed up scientific progress in science laboratories

Paleontologists discover Colorado ‘swamp dweller’ that lived alongside dinosaurs

Repeated COVID vaccines enhance mucosal immunity against the virus

MD Anderson expands arts experience program to enhance healing and well-being for patients

[Press-News.org] Evolutionary reinforcement learning promises further advances in machine learning