PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Almost all leading AI chatbots show signs of cognitive decline

Findings challenge assumption that AI will soon replace human doctors

2024-12-19
(Press-News.org) Almost all leading large language models or “chatbots” show signs of mild cognitive impairment in tests widely used to spot early signs of dementia, finds a study in the Christmas issue of The BMJ.

The results also show that “older” versions of chatbots, like older patients, tend to perform worse on the tests. The authors say these findings “challenge the assumption that artificial intelligence will soon replace human doctors.”

Huge advances in the field of artificial intelligence have led to a flurry of excited and fearful speculation as to whether chatbots can surpass human physicians.

Several studies have shown large language models (LLMs) to be remarkably adept at a range of medical diagnostic tasks, but their susceptibility to human impairments such as cognitive decline have not yet been examined.

To fill this knowledge gap, researchers assessed the cognitive abilities of the leading, publicly available LLMs - ChatGPT versions 4 and 4o (developed by OpenAI), Claude 3.5 “Sonnet” (developed by Anthropic), and Gemini versions 1 and 1.5 (developed by Alphabet) - using the Montreal Cognitive Assessment (MoCA) test.

The MoCA test is widely used to detect cognitive impairment and early signs of dementia, usually in older adults. Through a number of short tasks and questions, it assesses abilities including attention, memory, language, visuospatial skills, and executive functions. The maximum score is 30 points, with a score of 26 or above generally considered normal.

The instructions given to the LLMs for each task were the same as those given to human patients. Scoring followed official guidelines and was evaluated by a practising neurologist. 

ChatGPT 4o achieved the highest score on the MoCA test (26 out of 30), followed by ChatGPT 4 and Claude (25 out of 30), with Gemini 1.0 scoring lowest (16 out of 30). 

All chatbots showed poor performance in visuospatial skills and executive tasks, such as the trail making task (connecting encircled numbers and letters in ascending order) and the clock drawing test (drawing a clock face showing a specific time). Gemini models failed at the delayed recall task (remembering a five word sequence).

Most other tasks, including naming, attention, language, and abstraction were performed well by all chatbots.

But in further visuospatial tests, chatbots were unable to show empathy or accurately interpret complex visual scenes. Only ChatGPT 4o succeeded in the incongruent stage of the Stroop test, which uses combinations of colour names and font colours to measure how interference affects reaction time.

These are observational findings and the authors acknowledge the essential differences between the human brain and large language models.

However, they point out that the uniform failure of all large language models in tasks requiring visual abstraction and executive function highlights a significant area of weakness that could impede their use in clinical settings. 

As such, they conclude: “Not only are neurologists unlikely to be replaced by large language models any time soon, but our findings suggest that they may soon find themselves treating new, virtual patients - artificial intelligence models presenting with cognitive impairment.”

[Ends]

END


ELSE PRESS RELEASES FROM THIS DATE:

Surgeons show greater dexterity in children’s buzz wire game than other hospital staff

2024-12-19
Surgeons are quicker and more successful at completing a buzz wire game compared with other hospital staff, finds a study in the Christmas issue of The BMJ. However, surgeons are also more likely to swear during the task, while nurses and non-clinical staff show the highest rates of audible noises of frustration. The researchers say their study highlights the diverse skill sets across hospital staff roles, and they suggest surgical swear jars should be considered for future fundraising events. Within a hospital, ...

Fairy tales can help teach children about healthy sleep

2024-12-19
Some traditional fairy tales and classic children’s fiction that have soothed many a child to sleep may also provide accessible and engaging ways to discuss healthy sleep with children, suggest researchers in the Christmas issue of The BMJ. Megan Thomas and colleagues analysed four popular fairy tales that include information about the benefits of sleep and the characteristics of sleep disorder. For example, Snow White illustrates some of the daytime consequences of poor sleep due to obstructive sleep apnoea which is common in some conditions associated with short stature. These can ...

Diarrheal diseases remain a leading killer for children under 5, adults 70+

2024-12-19
SEATTLE, Wash., Dec. 18, 2024 – New global study reports a 60% drop in global mortality from diarrheal diseases, but children and the elderly still have the highest death rates, particularly in sub-Saharan Africa and South Asia. That’s according to the latest and most comprehensive study from the Global Burden of Disease (GBD) conducted by the Institute for Health Metrics and Evaluation (IHME) and published today in The Lancet Infectious Diseases journal. In 2021, diarrheal diseases caused 1.2 million deaths worldwide, which is a substantial drop from 2.9 million deaths recorded in 1990. The largest decrease was among children under 5 years with a 79% decline, but that age group ...

Unlocking new insights into in-plane magnetic field-induced hall effects

Unlocking new insights into in-plane magnetic field-induced hall effects
2024-12-19
In-plane magnetic fields are responsible for inducing anomalous Hall effect in EuCd₂Sb₂ films, report researchers from the Institute of Science Tokyo. By studying how these fields change electronic structures, the team discovered a large in-plane anomalous Hall effect. These findings pave the way for new strategies for controlling electronic transport under magnetic fields, potentially advancing applications in magnetic sensors. The Hall effect is a fundamental phenomenon in material science. It occurs when a material carrying an electric current is exposed to a magnetic field, producing a voltage perpendicular to both the current and the magnetic field. This effect has been ...

MouseGoggles offer immersive look into neural activity

2024-12-18
ITHACA, N.Y. – In recent years, mice have entered a new arena – virtual reality – and now Cornell University researchers have built mini VR headsets to more fully immerse them. The team’s MouseGoggles were created using low-cost, off-the-shelf components, such as smartwatch displays and tiny lenses, and track the mouse’s eye movements and changes in pupil size. The technology has the potential to help reveal the neural activity that informs spatial navigation and memory function, giving researchers new insights into disorders such as Alzheimer’s disease and its potential treatments. The research was led by Chris Schaffer, professor of biomedical ...

For optimal marathon performance, check training plan, gear, nutrition, weather — and air quality?

2024-12-18
PROVIDENCE, R.I. [Brown University] — When preparing for a marathon, runners don’t usually think much about air quality. But maybe they should, according to findings from a new study by researchers at the Brown University School of Public Health.  When the research team assessed the association between fine particulate matter in the air and marathon finish times, they found that greater race-day pollution is associated with slower average marathon finish times. Their findings were published in the journal Sports Medicine. The difference seems small, said study author Elvira Fleury, who led the research while enrolled as a graduate student at Brown, but ...

Researchers find new way to 'starve' prostate cancer tumors at the cellular level

Researchers find new way to starve prostate cancer tumors at the cellular level
2024-12-18
INDIANAPOLIS — New research by a team of Indiana University School of Medicine scientists and their collaborators has uncovered a novel vulnerability in prostate cancer animal models that starves prostate tumors of critical nutrients and stunts their growth, which could lead to the development of new treatments for the deadly disease. Led by IU School of Medicine's Kirk Staschke, PhD, assistant research professor of biochemistry and molecular biology, and Ronald C. Wek, PhD, Showalter Professor of Biochemistry, the study was recently published in Science Signaling. Prostate cancer is a ...

Are AI chatbots helping the planet—or repeating old biases?

2024-12-18
AI chatbots may seem like neutral tools, but a new study from UBC researchers suggests they often contain biases that could shape environmental discourse in unhelpful ways. The research team examined how four leading AI chatbots respond to questions about environmental issues—and the findings are surprising. “It was striking how narrow-minded AI models were in discussing environmental challenges,” said lead researcher Hamish van der Ven, an assistant professor in the faculty ...

Q&A: New AI training method lets systems better adjust to users’ values

2024-12-18
Ask most major artificial intelligence chatbots, such as OpenAI’s ChatGPT, to say something cruel or inappropriate and the system will say it wants to keep things “respectful.” These systems, trained on the content of a profusely disrespectful internet, learned what constitutes respect through human training. The standard method, called reinforcement learning from human feedback, or RLHF, has people compare two outputs from the systems and select whichever is better. It’s used to improve the quality of responses — ...

New study unlocks parental identity with new lens on education spending

2024-12-18
How much parents spend on their children’s education has a big impact on family well-being and a country’s overall development. While past studies suggested that ethnic and racial backgrounds affect this spending, they lacked solid experimental proof – making their findings less reliable. A new study led by Lingjiang Lora Tu, Ph.D., from Baylor University’s Hankamer School of Business examines the psychological factors driving parental investment in education, highlighting how a parent’s self-view – whether they see themselves as independent or connected to others – shapes their spending patterns. ...

LAST 30 PRESS RELEASES:

No evidence that maternal sickness during pregnancy causes autism

Healthy gut bacteria that feed on sugar analyzed for the first time

240-year-old drug could save UK National Health Service £100 million a year treating common heart rhythm disorder

Detections of poliovirus in sewage samples require enhanced routine and catch-up vaccination and increased surveillance, according to ECDC report

Scientists unlock ice-repelling secrets of polar bear fur for sustainable anti-freezing solutions 

Ear muscle we thought humans didn’t use — except for wiggling our ears — actually activates when people listen hard

COVID-19 pandemic drove significant rise in patients choosing to leave ERs before medically recommended

Burn grasslands to maintain them: What is good for biodiversity?

Ventilation in hospitals could cause viruses to spread further

New study finds high concentrations of plastics in the placentae of infants born prematurely

New robotic surgical systems revolutionizing patient care

New MSK research a step toward off-the-shelf CAR T cell therapy for cancer

UTEP professor wins prestigious research award from American Psychological Association

New national study finds homicide and suicide is the #1 cause of maternal death in the U.S.

Women’s pelvic tissue tears during childbirth unstudied, until now

Earth scientists study Sikkim flood in India to help others prepare for similar disasters

Leveraging data to improve health equity and care

Why you shouldn’t scratch an itchy rash: New study explains

Linking citation and retraction data aids in responsible research evaluation

Antibody treatment prevents severe bird flu in monkeys

Polar bear energetic model reveals drivers of polar bear population decline

Socioeconomic and political stability bolstered wild tiger recovery in India

Scratching an itch promotes antibacterial inflammation

Drivers, causes and impacts of the 2023 Sikkim flood in India

Most engineered human cells created for studying disease

Polar bear population decline the direct result of extended ‘energy deficit’ due to lack of food

Lifecycle Journal launches: A new vision for scholarly publishing

Ancient DNA analyses bring to life the 11,000-year intertwined genomic history of sheep and humans

Climate change increases risk of successive natural hazards in the Himalayas

From bowling balls to hip joints: Chemists create recyclable alternative to durable plastics

[Press-News.org] Almost all leading AI chatbots show signs of cognitive decline
Findings challenge assumption that AI will soon replace human doctors