PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Verbal nonsense reveals limitations of AI chatbots

In a new study, researchers tracked how current language models, such as ChatGPT, mistake nonsense sentences as meaningful. Can these AI flaws open new windows on the brain?

Verbal nonsense reveals limitations of AI chatbots
2023-09-14
(Press-News.org) NEW YORK – The era of artificial-intelligence chatbots that seem to understand and use language the way we humans do has begun. Under the hood, these chatbots use large language models, a particular kind of neural network. But a new study shows that large language models remain vulnerable to mistaking nonsense for natural language. To a team of researchers at Columbia University, it’s a flaw that might point toward ways to improve chatbot performance and help reveal how humans process language. 

 

In a paper published online today in Nature Machine Intelligence, the scientists describe how they challenged nine different language models with hundreds of pairs of sentences. For each pair, people who participated in the study picked which of the two sentences they thought was more natural, meaning that it was more likely to be read or heard in everyday life. The researchers then tested the models to see if they would rate each sentence pair the same way the humans had. 

 

In head-to-head tests, more sophisticated AIs based on what researchers refer to as transformer neural networks tended to perform better than simpler recurrent neural network models and statistical models that just tally the frequency of word pairs found on the internet or in online databases. But all the models made mistakes, sometimes choosing sentences that sound like nonsense to a human ear. 

 

“That some of the large language models perform as well as they do suggests that they capture something important that the simpler models are missing,” said Dr. Nikolaus Kriegeskorte, PhD, a principal investigator at Columbia’s Zuckerman Institute and a coauthor on the paper. “That even the best models we studied still can be fooled by nonsense sentences shows that their computations are missing something about the way humans process language.”

 

Consider the following sentence pair that both human participants and the AI’s assessed in the study:

 

That is the narrative we have been sold. 

This is the week you have been dying. 

 

People given these sentences in the study judged the first sentence as more likely to be encountered than the second. But according to BERT, one of the better models, the second sentence is more natural. GPT-2, perhaps the most widely known model, correctly identified the first sentence as more natural, matching the human judgments.

 

“Every model exhibited blind spots, labeling some sentences as meaningful that human participants thought were gibberish,” said senior author Christopher Baldassano, PhD, an assistant professor of psychology at Columbia. “That should give us pause about the extent to which we want AI systems making important decisions, at least for now.” 

 

The good but imperfect performance of many models is one of the study results that most intrigues Dr. Kriegeskorte. “Understanding why that gap exists and why some models outperform others can drive progress with language models,” he said. 

 

Another key question for the research team is whether the computations in AI chatbots can inspire new scientific questions and hypotheses that could guide neuroscientists toward a better understanding of human brains. Might the ways these chatbots work point to something about the circuitry of our brains?

 

Further analysis of the strengths and flaws of various chatbots and their underlying algorithms could help answer that question.

 

“Ultimately, we are interested in understanding how people think,” said Tal Golan, PhD, the paper’s corresponding author who this year segued from a postdoctoral position at Columbia’s Zuckerman Institute to set up his own lab at Ben-Gurion University of the Negev in Israel. “These AI tools are increasingly powerful but they process language differently from the way we do. Comparing their language understanding to ours gives us a new approach to thinking about how we think.”


 

###

 

The paper, “Testing the limits of natural language models for predicting human language judgements,” was published online in Nature Machine Intelligence on September 14, 2023. Its full list of authors includes Tal Golan, Matthew Siegelman,  Nikolaus Kriegeskorte and Christopher Baldassano.

 

END


[Attachments] See images for this press release:
Verbal nonsense reveals limitations of AI chatbots Verbal nonsense reveals limitations of AI chatbots 2

ELSE PRESS RELEASES FROM THIS DATE:

Revolutionizing brain monitoring and stimulation with thin-film neural electrodes

Revolutionizing brain monitoring and stimulation with thin-film neural electrodes
2023-09-14
 Flexible thin-film electrodes placed directly on brain tissue show promise for the diagnosis and treatment of epilepsy, as demonstrated recently by scientists at Tokyo Tech. Thanks to an innovative yet straightforward design, these durable electrodes accurately match the mechanical properties of brain tissue, leading to better performance during electrocorticography recordings and targeted neural stimulation. Measuring brain activity is a useful technique for diagnosing epilepsy and other neuropsychiatric disorders. Among the several approaches adopted, electroencephalography (EEG) is the least invasive. During EEG recordings, electrodes ...

Researchers present novel principle for nitric oxide-mediated signalling in blood vessels

2023-09-14
Although a simple molecule, nitric oxide is an important signal substance that helps to reduce blood pressure by relaxing the blood vessels. But how it goes about doing this has long been unclear. Researchers at Karolinska Institutet in Sweden now present an entirely novel principle that challenges the Nobel Prize-winning hypothesis that the substance signals in its gaseous form. Their findings are presented in the journal Nature Chemical Biology. That the simple molecule nitric oxide or nitrogen monoxide (NO) serves as a signal substance in many important physiological processes has been known for some time. For example, the discovery of the compound’s ...

Electrons from Earth may be forming water on the Moon

Electrons from Earth may be forming water on the Moon
2023-09-14
A team of researchers, led by a University of Hawai‘i (UH) at Mānoa planetary scientist, discovered that high energy electrons in Earth’s plasma sheet are contributing to weathering processes on the Moon's surface and, importantly, the electrons may have aided the formation of water on the lunar surface. The study was published today in Nature Astronomy.  Understanding the concentrations and distributions of water on the Moon is critical to understanding its formation and evolution, and to providing water resources for future human exploration. The new ...

New tool can reveal inequitable distribution of ‘healing’ green spaces

New tool can reveal inequitable distribution of ‘healing’ green spaces
2023-09-14
Areas in Vancouver with the greatest need for restorative nature often have the least exposure to it, according to a new UBC study published recently in Ambio. These neighbourhoods include Strathcona, downtown Vancouver, the West End, southern Sunset and Marpole. The researchers developed a new tool, the local restorative nature (LRN) index to assess spaces for the presence of qualities that promote mental well-being. While initially applied in Vancouver, the index can also be used in any urban landscape, according to lead author Dr. Tahia Devisscher, an assistant professor in the faculty of forestry. We sat down with Dr. Devisscher to discuss the study findings and ...

Many don’t know key facts about US Constitution, Annenberg study finds

Many don’t know key facts about US Constitution, Annenberg study finds
2023-09-14
PHILADELPHIA – Many Americans do not know what rights are protected under the First Amendment and a substantial number cannot name all three branches of government, according to the 2023 Annenberg Constitution Day Civics Survey. The Annenberg Public Policy Center’s annual, nationally representative survey finds that when U.S. adults are asked to name the specific rights guaranteed by the First Amendment to the Constitution, only one right is recalled by most of the respondents: Freedom of speech, ...

All work and no play will really make a dull life - new research reveals

2023-09-14
The study across three countries led by the Department of Psychology’s Dr Paul Hanel discovered people who prioritised achievement over enjoyment were less happy on the next day. Whereas those who aimed for freedom said they had a 13% increase in well-being, recording better sleep quality and life satisfaction. And participants who tried to relax and follow their hobbies recorded an average well-being boost of 8% and a 10% drop in stress and anxiety. Dr Hanel worked with colleagues at the University of Bath on the Journal of Personality-published study. For the first ...

New poll shows 77% of Massachusetts residents support $600 child & family tax credit

2023-09-14
Boston, MA – New polling data released late last week shows 77% of surveyed Massachusetts residents support a $600 state Child and Family Tax Credit. This polling confirms the popularity of the more generous Child and Family Tax Credit included in the House tax package, which is under consideration alongside the Senate tax bill by a bicameral conference committee. “The overwhelming support for a $600 tax credit per child matches up with the stories I have heard from families across my district, and the experiences of working Massachusetts families that they need more financial ...

New camera offers ultrafast imaging at a fraction of the normal cost

New camera offers ultrafast imaging at a fraction of the normal cost
2023-09-14
WASHINGTON — Capturing blur-free images of fast movements like falling water droplets or molecular interactions requires expensive ultrafast cameras that acquire millions of images per second. In a new paper, researchers report a camera that could offer a much less expensive way to achieve ultrafast imaging for a wide range of applications such as real-time monitoring of drug delivery or high-speed lidar systems for autonomous driving. “Our camera uses a completely new method to achieve high-speed imaging,” said Jinyang Liang from the Institut national de la recherche scientifique (INRS) ...

Peer-led patient navigation helps minoritized patients engage in their own mental healthcare

2023-09-14
INDIANAPOLIS – Research scientists led by Johanne Eliacin, PhD, of the U.S. Department of Veterans’ Affairs (VA) and Regenstrief Institute, have developed PARTNER-MH, an innovative, peer-led patient navigation program to support racially and ethnically minoritized veterans seeking mental healthcare, regardless of the types of mental health services needed or their mental health diagnoses. In two peer-reviewed published papers they report significant improvements in mental health outcomes and high participant satisfaction with the program. PARTNER-MH, developed for VA mental ...

Enhancing neonatal health: Genomic sequencing as a primary screening tool

Enhancing neonatal health: Genomic sequencing as a primary screening tool
2023-09-14
Newborn screening (NBS) is routinely performed across the world using biochemical testing methods. Recent advancements in genetic sequencing are a potential game-changer for newborn screening, swiftly assessing a comprehensive range of monogenic disorders. Yet, the effectiveness of genetic sequencing as an alternative method for NBS has not previously been studied. To evaluate the outcomes of applying gene panel sequencing as a first-tier newborn screening test, a recent study conducted by eight NBS centers and BGI Genomics was ...

LAST 30 PRESS RELEASES:

Insulin resistance is linked to over 30 diseases – and to early death in women, study of people in the UK finds

Innovative semaglutide hydrogel could reduce diabetes shots to once a month

Weight loss could reduce the risk of severe infections in people with diabetes, UK research suggests

Long-term exposure to air pollution and a lack of green space increases the risk of hospitalization for respiratory conditions

Better cardiovascular health in early pregnancy may offset high genetic risk

Artificial intelligence method transforms gene mutation prediction in lung cancer: DeepGEM data releases at IASLC 2024 World Conference on Lung Cancer

Antibody–drug conjugate I-DXd shows clinically meaningful response in patients with extensive-stage small cell lung cancer

IASLC Global Survey on biomarker testing reveals progress and persistent barriers in lung cancer biomarker testing

Research shows pathway to developing predictive biomarkers for immune checkpoint inhibitors

Just how dangerous is Great Salt Lake dust? New research looks for clues

Maroulas appointed Associate Vice Chancellor, Director of AI Tennessee

New chickadee research finds cognitive skills impact lifespan

Cognitive behavioral therapy enhances brain circuits to relieve depression

Terasaki Institute awarded $2.3 Million grant from NIH for organ transplantation research using organs-on-a-chip technology

Atoms on the edge

Postdoc takes multipronged approach to muon detection

Mathematical proof: Five satellites needed for precise navigation

Scalable, multi-functional device lays groundwork for advanced quantum applications

Falling for financial scams? It may signal early Alzheimer’s disease

Integrating MRI and OCT for new insights into brain microstructure

Designing a normative neuroimaging library to support diagnosis of traumatic brain injury

Department of Energy announces $68 million in funding for artificial intelligence for scientific research

DOE, ORNL announce opportunity to define future of high-performance computing

Molecular simulations, supercomputing lead to energy-saving biomaterials breakthrough

Low-impact yoga and exercise found to help older women manage urinary incontinence

Genetic studies reveal new insights into cognitive impairment in schizophrenia

Researcher develops technology to provide cleaner energy and cleaner water

Expect the unexpected: nanoscale silver unveils intrinsic self-healing abilities

nTIDE September 2024 Jobs Report: Gains in employment for people with disabilities appear to level off after reducing gaps with non-disabled workers

Wiley enhances NMR Spectral Library Collection with extensive new databases

[Press-News.org] Verbal nonsense reveals limitations of AI chatbots
In a new study, researchers tracked how current language models, such as ChatGPT, mistake nonsense sentences as meaningful. Can these AI flaws open new windows on the brain?