Cognitive neuroscience could pave the way for emotionally intelligent robots
Researchers propose a novel auditory perception based feature for extracting emotions from human speech using neural networks
2021-04-28
(Press-News.org) Ishikawa, Japan - Human beings have the ability to recognize emotions in others, but the same cannot be said for robots. Although perfectly capable of communicating with humans through speech, robots and virtual agents are only good at processing logical instructions, which greatly restricts human-robot interaction (HRI). Consequently, a great deal of research in HRI is about emotion recognition from speech. But first, how do we describe emotions?
Categorical emotions such as happiness, sadness, and anger are well-understood by us but can be hard for robots to register. Researchers have focused on "dimensional emotions," which constitute a gradual emotional transition in natural speech. "Continuous dimensional emotion can help a robot capture the time dynamics of a speaker's emotional state and accordingly adjust its manner of interaction and content in real time," explains Prof. Masashi Unoki from Japan Advanced Institute of Science and Technology (JAIST), who works on speech recognition and processing.
Studies have shown that an auditory perception model simulating the working of a human ear can generate what are called "temporal modulation cues," which faithfully capture the time dynamics of dimensional emotions. Neural networks can then be employed to extract features from these cues that reflect this time dynamics. However, due to the complexity and variety of auditory perception models, the feature extraction part turns out to be pretty challenging.
In a new study published in Neural Networks, Prof. Unoki and his colleagues, including Zhichao Peng, from Tianjin University, China (who led the study), Jianwu Dang from Pengcheng Laboratory, China, and Prof. Masato Akagi from JAIST, have now taken inspiration from a recent finding in cognitive neuroscience suggesting that our brain forms multiple representations of natural sounds with different degrees of spectral (i.e., frequency) and temporal resolutions through a combined analysis of spectral-temporal modulations. Accordingly, they have proposed a novel feature called multi-resolution modulation-filtered cochleagram (MMCG), which combines four modulation-filtered cochleagrams (time-frequency representations of the input sound) at different resolutions to obtain the temporal and contextual modulation cues. To account for the diversity of the cochleagrams, researchers designed a parallel neural network architecture called "long short-term memory" (LSTM), which modeled the time variations of multi-resolution signals from the cochleagrams and carried out extensive experiments on two datasets of spontaneous speech.
The results were encouraging. The researchers found that MMCG showed a significantly better emotion recognition performance than traditional acoustic-based features and other auditory-based features for both the datasets. Furthermore, the parallel LSTM network demonstrated a superior prediction of dimensional emotions than that with a plain LSTM-based approach.
Prof. Unoki is thrilled and contemplates improving upon the MMCG feature in future research. "Our next goal is to analyze the robustness of environmental noise sources and investigate our feature for other tasks, such as categorical emotion recognition, speech separation, and voice activity detection," he concludes.
Looks like it may not be too long before emotionally intelligent robots become a reality!
INFORMATION:
Reference
Title of original paper: "Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech"
Journal: Neural Networks
DOI: 10.1016/j.neunet.2021.03.027
About Japan Advanced Institute of Science and Technology, Japan
Founded in 1990 in Ishikawa prefecture, the Japan Advanced Institute of Science and Technology (JAIST) was the first independent national graduate school in Japan. Now, after 30 years of steady progress, JAIST has become one of Japan's top-ranking universities. JAIST counts with multiple satellite campuses and strives to foster capable leaders with a state-of-the-art education system where diversity is key; about 40% of its alumni are international students. The university has a unique style of graduate education based on a carefully designed coursework-oriented curriculum to ensure that its students have a solid foundation on which to carry out cutting-edge research. JAIST also works closely both with local and overseas communities by promoting industry-academia collaborative research.
About Professor Masashi Unoki from Japan Advanced Institute of Science and Technology, Japan
Masashi Unoki is a Professor at the School of Information Science at Japan Advanced Institute of Science and Technology (JAIST), where he received his M.S. and Ph.D. degrees in 1996 and 1999, respectively. His main research interests lie in auditory motivated signal processing and modeling auditory systems. Prof. Unoki received the Sato Prize from the Acoustical Society of Japan (ASJ) in 1999, 2010, and 2013 for an Outstanding Paper and the Yamashita Taro "Young Researcher" Prize from the Yamashita Taro Research Foundation in 2005. As a senior researcher, he has 420 publications to his credit, with over 2000 citations.
Funding information
The study was funded by a Grant in Aid for Innovative Areas (no. 18H05004) from MEXT, Japan, and was partially supported by the Research Foundation of Education Bureau of Hunan Province, China (grant no. 18A414).
[Attachments] See images for this press release:
ELSE PRESS RELEASES FROM THIS DATE:
2021-04-28
New Curtin research has found urgent action is needed to ensure man-made underwater noise in Australian waters does not escalate to levels which could be harmful to marine animals, such as whales, and negatively impact our pristine oceans.
Lead author Professor Christine Erbe, Director of Curtin's Centre for Marine Science and Technology, said recent studies from the northern hemisphere showed man-made noise, in particular from ships, often dominates the underwater soundscape over large areas, such as entire seas, and could interfere with marine fauna that rely on sound for communication, navigation and foraging.
"When ...
2021-04-28
New research shows that physical activity equivalent to 100 PAI a week can counteract excessive weight gain.
PAI stands for Personal Activity Intelligence and tracks how physically active you are throughout the week. You can measure PAI with just about any device that can measure heart rate.
The activity metric has been developed by the Cardiac Exercise Research Group (CERG) at the Norwegian University of Science and Technology (NTNU) under the leadership of NTNU Professor Ulrik Wisløff.
"Previously, we found that 100 PAI a week can give us a longer and healthier life without cardiovascular disease. Our new study shows that PAI can also help people maintain a healthy body weight," says researcher Javaid ...
2021-04-28
More sleep could offset children's excess indulgence over the school holidays as new research from the University of South Australia shows that the same decline in body mass index may be achieved by either extra sleep or extra exercise.
The striking new finding is part of a study that shows how children can achieve equivalent physical and mental health benefits by choosing different activity trade-offs across the 24-hour day.
Conducted in partnership with the Murdoch Children's Research Institute, and supported by the National Heart Foundation of Australia, the team examined the optimal balance between children's physical activity, sleep, and sedentary time across the 24-hour day to better inform tailored ...
2021-04-28
RIVERSIDE, Calif. -- Researchers at the University of California, Riverside, have used a nanoscale synthetic antiferromagnet to control the interaction between magnons -- research that could lead to faster and more energy-efficient computers.
In ferromagnets, electron spins point in the same direction. To make future computer technologies faster and more energy-efficient, spintronics research employs spin dynamics -- fluctuations of the electron spins -- to process information. Magnons, the quantum-mechanical units of spin fluctuations, interact with each other, leading to nonlinear features of the spin dynamics. Such nonlinearities play a central ...
2021-04-28
Collagen is a protein found widely in almost all cells of animals, and scientifically can be used to learn much about an animal's life history including human being in the present or in the past. Scientists at the Research Institute for Humanity and Nature (RIHN) and Japan Fisheries Research and Education Agency (FRA), Japan, prove this point for Japanese flounder by measuring isotope ratios in vertebral-bone collagen. The new study, which can be read in Marine Biology, shows that there exist behavioral groups of fish with different migrating and/or feeding patterns.
A school of fish will decide their habitat on fundamental needs for survival, ...
2021-04-28
Researchers at the National Food Institute have come up with a solution that can help combat both food loss and food waste: They have generated a natural lactic acid bacterium, which secretes the antimicrobial peptide nisin, when grown on dairy waste.
Nisin is a food-grade preservative, which can extend the shelf life of foods, and thus can be used to reduce food waste. The discovery also makes it possible to better utilize the large quantities of whey generated when cheese is made.
Nisin is approved for use in a number of foods, where it can prevent the growth of certain spoilage microorganisms as well as microorganisms that make consumers sick. It can for instance inhibit spore ...
2021-04-28
The target of carbon-neutral and net-zero emissions is the development and utilization of renewable energy. High-energy-density energy storage systems are critical technologies for the integration of renewable energy.
Li metal is highly recognized as a promising alternative anode for next-generation rechargeable batteries due to its high theoretical capacity of 3860 mAh g-1 and ultralow electrode potential of -3.04 V compared to the standard hydrogen electrode.
However, Li metal batteries' (LMBs) main issue is their low Coulombic efficiency (CE), which limits batteries' cycle life. The low CE in LMBs occurs ...
2021-04-28
Age may adversely affect women's fertility by impairing levels of RNA molecules which in turn alter the function of genes involved in key biological pathways during the final maturation stage of a human egg cell, according to the findings of a new study published today in the journal Aging Cell.
Researchers from the Centre for Genomic Regulation (CRG), the Centro Nacional de Análisis Genómico (CNAG-CRG) and Clínica Eugin sequenced the RNA molecules, also known as the transcriptome, within oocytes to understand which genes are affected in their activity by age. They used single-cell sequencing to analyse the transcriptome of 72 individual oocytes ...
2021-04-28
BUFFALO, N.Y. -- A national, University at Buffalo-led study on genes in pediatric cardiomyopathy demonstrates strong evidence for routine genetic screening in children with the disease. The study, published April 28 in the Journal of the American Heart Association, revealed wide variation in screening, with some centers conducting routine genetic testing and others conducting none.
Conducted at 14 centers, the National Institutes of Health-funded study of 152 children with cardiomyopathy found that only half had undergone genetic screening. Of those who hadn't undergone screening, 21% were found to have a genetic cause for the ...
2021-04-28
Under normal, healthy circulatory conditions, the von Willebrand Factor (vWF) keeps to itself. The large and mysterious glycoprotein moves through the blood, balled up tightly, its reaction sites unexposed. But when significant bleeding occurs, it springs into action, initiating the clotting process.
When it works properly, vWF helps stop bleeding and saves lives. However, according to the Centers for Disease Control and Prevention (CDC), about 60,000 to 100,000 Americans die each year from thrombosis, a disorder characterized by too much clotting. Blood clots can trigger a stroke or heart attack.
According ...
LAST 30 PRESS RELEASES:
[Press-News.org] Cognitive neuroscience could pave the way for emotionally intelligent robots
Researchers propose a novel auditory perception based feature for extracting emotions from human speech using neural networks