(Press-News.org) Almost all leading large language models or “chatbots” show signs of mild cognitive impairment in tests widely used to spot early signs of dementia, finds a study in the Christmas issue of The BMJ.
The results also show that “older” versions of chatbots, like older patients, tend to perform worse on the tests. The authors say these findings “challenge the assumption that artificial intelligence will soon replace human doctors.”
Huge advances in the field of artificial intelligence have led to a flurry of excited and fearful speculation as to whether chatbots can surpass human physicians.
Several studies have shown large language models (LLMs) to be remarkably adept at a range of medical diagnostic tasks, but their susceptibility to human impairments such as cognitive decline have not yet been examined.
To fill this knowledge gap, researchers assessed the cognitive abilities of the leading, publicly available LLMs - ChatGPT versions 4 and 4o (developed by OpenAI), Claude 3.5 “Sonnet” (developed by Anthropic), and Gemini versions 1 and 1.5 (developed by Alphabet) - using the Montreal Cognitive Assessment (MoCA) test.
The MoCA test is widely used to detect cognitive impairment and early signs of dementia, usually in older adults. Through a number of short tasks and questions, it assesses abilities including attention, memory, language, visuospatial skills, and executive functions. The maximum score is 30 points, with a score of 26 or above generally considered normal.
The instructions given to the LLMs for each task were the same as those given to human patients. Scoring followed official guidelines and was evaluated by a practising neurologist.
ChatGPT 4o achieved the highest score on the MoCA test (26 out of 30), followed by ChatGPT 4 and Claude (25 out of 30), with Gemini 1.0 scoring lowest (16 out of 30).
All chatbots showed poor performance in visuospatial skills and executive tasks, such as the trail making task (connecting encircled numbers and letters in ascending order) and the clock drawing test (drawing a clock face showing a specific time). Gemini models failed at the delayed recall task (remembering a five word sequence).
Most other tasks, including naming, attention, language, and abstraction were performed well by all chatbots.
But in further visuospatial tests, chatbots were unable to show empathy or accurately interpret complex visual scenes. Only ChatGPT 4o succeeded in the incongruent stage of the Stroop test, which uses combinations of colour names and font colours to measure how interference affects reaction time.
These are observational findings and the authors acknowledge the essential differences between the human brain and large language models.
However, they point out that the uniform failure of all large language models in tasks requiring visual abstraction and executive function highlights a significant area of weakness that could impede their use in clinical settings.
As such, they conclude: “Not only are neurologists unlikely to be replaced by large language models any time soon, but our findings suggest that they may soon find themselves treating new, virtual patients - artificial intelligence models presenting with cognitive impairment.”
[Ends]
END
Almost all leading AI chatbots show signs of cognitive decline
Findings challenge assumption that AI will soon replace human doctors
2024-12-19
ELSE PRESS RELEASES FROM THIS DATE:
Surgeons show greater dexterity in children’s buzz wire game than other hospital staff
2024-12-19
Surgeons are quicker and more successful at completing a buzz wire game compared with other hospital staff, finds a study in the Christmas issue of The BMJ.
However, surgeons are also more likely to swear during the task, while nurses and non-clinical staff show the highest rates of audible noises of frustration.
The researchers say their study highlights the diverse skill sets across hospital staff roles, and they suggest surgical swear jars should be considered for future fundraising events.
Within a hospital, ...
Fairy tales can help teach children about healthy sleep
2024-12-19
Some traditional fairy tales and classic children’s fiction that have soothed many a child to sleep may also provide accessible and engaging ways to discuss healthy sleep with children, suggest researchers in the Christmas issue of The BMJ.
Megan Thomas and colleagues analysed four popular fairy tales that include information about the benefits of sleep and the characteristics of sleep disorder.
For example, Snow White illustrates some of the daytime consequences of poor sleep due to obstructive sleep apnoea which is common in some conditions associated with short stature. These can ...
Diarrheal diseases remain a leading killer for children under 5, adults 70+
2024-12-19
SEATTLE, Wash., Dec. 18, 2024 – New global study reports a 60% drop in global mortality from diarrheal diseases, but children and the elderly still have the highest death rates, particularly in sub-Saharan Africa and South Asia. That’s according to the latest and most comprehensive study from the Global Burden of Disease (GBD) conducted by the Institute for Health Metrics and Evaluation (IHME) and published today in The Lancet Infectious Diseases journal.
In 2021, diarrheal diseases caused 1.2 million deaths worldwide, which is a substantial drop from 2.9 million deaths recorded in 1990. The largest decrease was among children under 5 years with a 79% decline, but that age group ...
Unlocking new insights into in-plane magnetic field-induced hall effects
2024-12-19
In-plane magnetic fields are responsible for inducing anomalous Hall effect in EuCd₂Sb₂ films, report researchers from the Institute of Science Tokyo. By studying how these fields change electronic structures, the team discovered a large in-plane anomalous Hall effect. These findings pave the way for new strategies for controlling electronic transport under magnetic fields, potentially advancing applications in magnetic sensors.
The Hall effect is a fundamental phenomenon in material science. It occurs when a material carrying an electric current is exposed to a magnetic field, producing a voltage perpendicular to both the current and the magnetic field. This effect has been ...
MouseGoggles offer immersive look into neural activity
2024-12-18
ITHACA, N.Y. – In recent years, mice have entered a new arena – virtual reality – and now Cornell University researchers have built mini VR headsets to more fully immerse them.
The team’s MouseGoggles were created using low-cost, off-the-shelf components, such as smartwatch displays and tiny lenses, and track the mouse’s eye movements and changes in pupil size.
The technology has the potential to help reveal the neural activity that informs spatial navigation and memory function, giving researchers new insights into disorders such as Alzheimer’s disease and its potential treatments.
The research was led by Chris Schaffer, professor of biomedical ...
For optimal marathon performance, check training plan, gear, nutrition, weather — and air quality?
2024-12-18
PROVIDENCE, R.I. [Brown University] — When preparing for a marathon, runners don’t usually think much about air quality. But maybe they should, according to findings from a new study by researchers at the Brown University School of Public Health.
When the research team assessed the association between fine particulate matter in the air and marathon finish times, they found that greater race-day pollution is associated with slower average marathon finish times. Their findings were published in the journal Sports Medicine.
The difference seems small, said study author Elvira Fleury, who led the research while enrolled as a graduate student at Brown, but ...
Researchers find new way to 'starve' prostate cancer tumors at the cellular level
2024-12-18
INDIANAPOLIS — New research by a team of Indiana University School of Medicine scientists and their collaborators has uncovered a novel vulnerability in prostate cancer animal models that starves prostate tumors of critical nutrients and stunts their growth, which could lead to the development of new treatments for the deadly disease.
Led by IU School of Medicine's Kirk Staschke, PhD, assistant research professor of biochemistry and molecular biology, and Ronald C. Wek, PhD, Showalter Professor of Biochemistry, the study was recently published in Science Signaling.
Prostate cancer is a ...
Are AI chatbots helping the planet—or repeating old biases?
2024-12-18
AI chatbots may seem like neutral tools, but a new study from UBC researchers suggests they often contain biases that could shape environmental discourse in unhelpful ways.
The research team examined how four leading AI chatbots respond to questions about environmental issues—and the findings are surprising.
“It was striking how narrow-minded AI models were in discussing environmental challenges,” said lead researcher Hamish van der Ven, an assistant professor in the faculty ...
Q&A: New AI training method lets systems better adjust to users’ values
2024-12-18
Ask most major artificial intelligence chatbots, such as OpenAI’s ChatGPT, to say something cruel or inappropriate and the system will say it wants to keep things “respectful.” These systems, trained on the content of a profusely disrespectful internet, learned what constitutes respect through human training. The standard method, called reinforcement learning from human feedback, or RLHF, has people compare two outputs from the systems and select whichever is better. It’s used to improve the quality of responses — ...
New study unlocks parental identity with new lens on education spending
2024-12-18
How much parents spend on their children’s education has a big impact on family well-being and a country’s overall development. While past studies suggested that ethnic and racial backgrounds affect this spending, they lacked solid experimental proof – making their findings less reliable.
A new study led by Lingjiang Lora Tu, Ph.D., from Baylor University’s Hankamer School of Business examines the psychological factors driving parental investment in education, highlighting how a parent’s self-view – whether they see themselves as independent or connected to others – shapes their spending patterns. ...
LAST 30 PRESS RELEASES:
AI-generated voices which sound like you are perceived as more trustworthy and likeable, with implications for deep-fakes and manipulation
The cacao tree species (Theobroma cacao L.), from which we get chocolate, is likely about 7.5 million years old, with chloroplast genomes indicating that the current known diversity diversified during
After sexual misconduct accusations, scholars’ work is cited less
Menopause symptoms associated with future memory and neuropsychiatric problems
Findings may advance understanding of infertility in mothers
Engineered cartilage from nasal septum cells helps treat complex knee injuries
Damaged but not defeated: Bacteria use nano-spearguns to retaliate against attacks
Among older women, hormone therapy linked to tau accumulation, a hallmark of Alzheimer’s disease
Scientists catch water molecules flipping before splitting
New antibodies show potential to defeat all SARS-CoV-2 variants
Mental health may be linked to how confident we are of our decisions
Research identifies key antibodies for development of broadly protective norovirus vaccine
NHS urged to offer single pill to all over-50s to prevent heart attacks and strokes
Australian researchers call for greater diversity in genomics
The pot is already boiling for 2% of the world’s amphibians: new study
A new way to predict cancer's spread? Scientists look at 'stickiness' of tumor cells
Prehistoric bone tool ‘factory’ hints at early development of abstract reasoning in human ancestors
Study: Vaping does not help US tobacco smokers quit
Insect populations are declining — and that is not a good thing
Scientists discover genes to grow bigger tomatoes and eggplants
Effects of combining coronary calcium score with treatment on plaque progression in familial coronary artery disease
Cancer screening 3 years after the onset of the COVID-19 pandemic
Trajectories of sleep duration, sleep onset timing, and continuous glucose monitoring in adults
Sports gambling and drinking behaviors over time
For better quantum sensing, go with the flow
Toxic environmental pollutants linked to faster aging and health risks in US adults
Jerome Morris voted AERA President-Elect; key members elected to AERA Council
Study reveals how agave plants survive extreme droughts
Aligning Science Across Parkinson’s (ASAP) launches a second funding opportunity to accelerate novel tool development to advance Parkinson's disease research
New study: Eating mangos daily shown to improve insulin sensitivity and blood glucose control
[Press-News.org] Almost all leading AI chatbots show signs of cognitive declineFindings challenge assumption that AI will soon replace human doctors