PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Verbal nonsense reveals limitations of AI chatbots

In a new study, researchers tracked how current language models, such as ChatGPT, mistake nonsense sentences as meaningful. Can these AI flaws open new windows on the brain?

Verbal nonsense reveals limitations of AI chatbots
2023-09-14
(Press-News.org) NEW YORK – The era of artificial-intelligence chatbots that seem to understand and use language the way we humans do has begun. Under the hood, these chatbots use large language models, a particular kind of neural network. But a new study shows that large language models remain vulnerable to mistaking nonsense for natural language. To a team of researchers at Columbia University, it’s a flaw that might point toward ways to improve chatbot performance and help reveal how humans process language. 

 

In a paper published online today in Nature Machine Intelligence, the scientists describe how they challenged nine different language models with hundreds of pairs of sentences. For each pair, people who participated in the study picked which of the two sentences they thought was more natural, meaning that it was more likely to be read or heard in everyday life. The researchers then tested the models to see if they would rate each sentence pair the same way the humans had. 

 

In head-to-head tests, more sophisticated AIs based on what researchers refer to as transformer neural networks tended to perform better than simpler recurrent neural network models and statistical models that just tally the frequency of word pairs found on the internet or in online databases. But all the models made mistakes, sometimes choosing sentences that sound like nonsense to a human ear. 

 

“That some of the large language models perform as well as they do suggests that they capture something important that the simpler models are missing,” said Dr. Nikolaus Kriegeskorte, PhD, a principal investigator at Columbia’s Zuckerman Institute and a coauthor on the paper. “That even the best models we studied still can be fooled by nonsense sentences shows that their computations are missing something about the way humans process language.”

 

Consider the following sentence pair that both human participants and the AI’s assessed in the study:

 

That is the narrative we have been sold. 

This is the week you have been dying. 

 

People given these sentences in the study judged the first sentence as more likely to be encountered than the second. But according to BERT, one of the better models, the second sentence is more natural. GPT-2, perhaps the most widely known model, correctly identified the first sentence as more natural, matching the human judgments.

 

“Every model exhibited blind spots, labeling some sentences as meaningful that human participants thought were gibberish,” said senior author Christopher Baldassano, PhD, an assistant professor of psychology at Columbia. “That should give us pause about the extent to which we want AI systems making important decisions, at least for now.” 

 

The good but imperfect performance of many models is one of the study results that most intrigues Dr. Kriegeskorte. “Understanding why that gap exists and why some models outperform others can drive progress with language models,” he said. 

 

Another key question for the research team is whether the computations in AI chatbots can inspire new scientific questions and hypotheses that could guide neuroscientists toward a better understanding of human brains. Might the ways these chatbots work point to something about the circuitry of our brains?

 

Further analysis of the strengths and flaws of various chatbots and their underlying algorithms could help answer that question.

 

“Ultimately, we are interested in understanding how people think,” said Tal Golan, PhD, the paper’s corresponding author who this year segued from a postdoctoral position at Columbia’s Zuckerman Institute to set up his own lab at Ben-Gurion University of the Negev in Israel. “These AI tools are increasingly powerful but they process language differently from the way we do. Comparing their language understanding to ours gives us a new approach to thinking about how we think.”


 

###

 

The paper, “Testing the limits of natural language models for predicting human language judgements,” was published online in Nature Machine Intelligence on September 14, 2023. Its full list of authors includes Tal Golan, Matthew Siegelman,  Nikolaus Kriegeskorte and Christopher Baldassano.

 

END


[Attachments] See images for this press release:
Verbal nonsense reveals limitations of AI chatbots Verbal nonsense reveals limitations of AI chatbots 2

ELSE PRESS RELEASES FROM THIS DATE:

Revolutionizing brain monitoring and stimulation with thin-film neural electrodes

Revolutionizing brain monitoring and stimulation with thin-film neural electrodes
2023-09-14
 Flexible thin-film electrodes placed directly on brain tissue show promise for the diagnosis and treatment of epilepsy, as demonstrated recently by scientists at Tokyo Tech. Thanks to an innovative yet straightforward design, these durable electrodes accurately match the mechanical properties of brain tissue, leading to better performance during electrocorticography recordings and targeted neural stimulation. Measuring brain activity is a useful technique for diagnosing epilepsy and other neuropsychiatric disorders. Among the several approaches adopted, electroencephalography (EEG) is the least invasive. During EEG recordings, electrodes ...

Researchers present novel principle for nitric oxide-mediated signalling in blood vessels

2023-09-14
Although a simple molecule, nitric oxide is an important signal substance that helps to reduce blood pressure by relaxing the blood vessels. But how it goes about doing this has long been unclear. Researchers at Karolinska Institutet in Sweden now present an entirely novel principle that challenges the Nobel Prize-winning hypothesis that the substance signals in its gaseous form. Their findings are presented in the journal Nature Chemical Biology. That the simple molecule nitric oxide or nitrogen monoxide (NO) serves as a signal substance in many important physiological processes has been known for some time. For example, the discovery of the compound’s ...

Electrons from Earth may be forming water on the Moon

Electrons from Earth may be forming water on the Moon
2023-09-14
A team of researchers, led by a University of Hawai‘i (UH) at Mānoa planetary scientist, discovered that high energy electrons in Earth’s plasma sheet are contributing to weathering processes on the Moon's surface and, importantly, the electrons may have aided the formation of water on the lunar surface. The study was published today in Nature Astronomy.  Understanding the concentrations and distributions of water on the Moon is critical to understanding its formation and evolution, and to providing water resources for future human exploration. The new ...

New tool can reveal inequitable distribution of ‘healing’ green spaces

New tool can reveal inequitable distribution of ‘healing’ green spaces
2023-09-14
Areas in Vancouver with the greatest need for restorative nature often have the least exposure to it, according to a new UBC study published recently in Ambio. These neighbourhoods include Strathcona, downtown Vancouver, the West End, southern Sunset and Marpole. The researchers developed a new tool, the local restorative nature (LRN) index to assess spaces for the presence of qualities that promote mental well-being. While initially applied in Vancouver, the index can also be used in any urban landscape, according to lead author Dr. Tahia Devisscher, an assistant professor in the faculty of forestry. We sat down with Dr. Devisscher to discuss the study findings and ...

Many don’t know key facts about US Constitution, Annenberg study finds

Many don’t know key facts about US Constitution, Annenberg study finds
2023-09-14
PHILADELPHIA – Many Americans do not know what rights are protected under the First Amendment and a substantial number cannot name all three branches of government, according to the 2023 Annenberg Constitution Day Civics Survey. The Annenberg Public Policy Center’s annual, nationally representative survey finds that when U.S. adults are asked to name the specific rights guaranteed by the First Amendment to the Constitution, only one right is recalled by most of the respondents: Freedom of speech, ...

All work and no play will really make a dull life - new research reveals

2023-09-14
The study across three countries led by the Department of Psychology’s Dr Paul Hanel discovered people who prioritised achievement over enjoyment were less happy on the next day. Whereas those who aimed for freedom said they had a 13% increase in well-being, recording better sleep quality and life satisfaction. And participants who tried to relax and follow their hobbies recorded an average well-being boost of 8% and a 10% drop in stress and anxiety. Dr Hanel worked with colleagues at the University of Bath on the Journal of Personality-published study. For the first ...

New poll shows 77% of Massachusetts residents support $600 child & family tax credit

2023-09-14
Boston, MA – New polling data released late last week shows 77% of surveyed Massachusetts residents support a $600 state Child and Family Tax Credit. This polling confirms the popularity of the more generous Child and Family Tax Credit included in the House tax package, which is under consideration alongside the Senate tax bill by a bicameral conference committee. “The overwhelming support for a $600 tax credit per child matches up with the stories I have heard from families across my district, and the experiences of working Massachusetts families that they need more financial ...

New camera offers ultrafast imaging at a fraction of the normal cost

New camera offers ultrafast imaging at a fraction of the normal cost
2023-09-14
WASHINGTON — Capturing blur-free images of fast movements like falling water droplets or molecular interactions requires expensive ultrafast cameras that acquire millions of images per second. In a new paper, researchers report a camera that could offer a much less expensive way to achieve ultrafast imaging for a wide range of applications such as real-time monitoring of drug delivery or high-speed lidar systems for autonomous driving. “Our camera uses a completely new method to achieve high-speed imaging,” said Jinyang Liang from the Institut national de la recherche scientifique (INRS) ...

Peer-led patient navigation helps minoritized patients engage in their own mental healthcare

2023-09-14
INDIANAPOLIS – Research scientists led by Johanne Eliacin, PhD, of the U.S. Department of Veterans’ Affairs (VA) and Regenstrief Institute, have developed PARTNER-MH, an innovative, peer-led patient navigation program to support racially and ethnically minoritized veterans seeking mental healthcare, regardless of the types of mental health services needed or their mental health diagnoses. In two peer-reviewed published papers they report significant improvements in mental health outcomes and high participant satisfaction with the program. PARTNER-MH, developed for VA mental ...

Enhancing neonatal health: Genomic sequencing as a primary screening tool

Enhancing neonatal health: Genomic sequencing as a primary screening tool
2023-09-14
Newborn screening (NBS) is routinely performed across the world using biochemical testing methods. Recent advancements in genetic sequencing are a potential game-changer for newborn screening, swiftly assessing a comprehensive range of monogenic disorders. Yet, the effectiveness of genetic sequencing as an alternative method for NBS has not previously been studied. To evaluate the outcomes of applying gene panel sequencing as a first-tier newborn screening test, a recent study conducted by eight NBS centers and BGI Genomics was ...

LAST 30 PRESS RELEASES:

Fatty liver in pregnancy may increase risk of preterm birth

World record for lithium-ion conductors

Researchers map 7,000-year-old genetic mutation that protects against HIV

KIST leads next-generation energy storage technology with development of supercapacitor that overcomes limitations

Urine, not water for efficient production of green hydrogen

Chip-scale polydimethylsiloxane acousto-optic phase modulator boosts higher-resolution plasmonic comb spectroscopy

Blood test for many cancers could potentially thwart progression to late stage in up to half of cases

Women non-smokers still around 50% more likely than men to develop COPD

AI tool uses face photos to estimate biological age and predict cancer outcomes

North Korea’s illegal wildlife trade threatens endangered species

Health care workers, firefighters have increased PFAS levels, study finds

Turning light into usable energy

Important step towards improving diagnosis and treatment of brain metastases

Maternal cardiometabolic health during pregnancy associated with higher blood pressure in children, NIH study finds

Mercury levels in the atmosphere have decreased throughout the 21st century

This soft robot “thinks” with its legs

Biologists identify targets for new pancreatic cancer treatments

Simple tweaks to a gene underlie the stench of rotten-smelling flowers

Simple, effective interventions reduce emissions from Bangladesh’s informal brick kilns

Ultrasound-guided 3D bioprinting enables deep-tissue implant fabrication in vivo

Soft limbs of flexible tubes and air enable dynamic, autonomous robotic locomotion

Researchers develop practical solution to reduce emissions and improve air quality from brick manufacturing in Bangladesh

Durham University scientists solve 500-million-year fossil mystery

Red alert for our closest relatives

3D printing in vivo using sound

Global Virus Network meeting unites Caribbean and Latin America to tackle emerging viral threats

MD Anderson Research Highlights for May 8, 2025

Study of Türkiye gold mine landslide highlights need for future monitoring

Researchers find new defense against hard-to-treat plant diseases

Characterization of research grant terminations at the National Institutes of Health

[Press-News.org] Verbal nonsense reveals limitations of AI chatbots
In a new study, researchers tracked how current language models, such as ChatGPT, mistake nonsense sentences as meaningful. Can these AI flaws open new windows on the brain?