In the ‘Wild West’ of AI chatbots, subtle biases related to race and caste often go unchecked

2024-11-20

(Press-News.org) Recently, LinkedIn announced its Hiring Assistant, an artificial intelligence “agent” that performs the most repetitious parts of recruiters’ jobs — including interacting with job candidates before and after interviews. LinkedIn’s bot is the highest-profile example in a growing group of tools — such as Tombo.ai and Moonhub.ai — that deploy large language models to interact with job seekers.

Given that hiring is consequential — compared with, say, a system that recommends socks — University of Washington researchers sought to explore how bias might manifest in such systems. While many prominent large language models, or LLMs, such as ChatGPT, have built-in guards to catch overt biases such as slurs, systemic biases still can arise subtly in chatbot interactions. Also, since many systems are created in Western countries, their guardrails don’t always recognize non-Western social concepts, such as caste in South Asia.

The researchers looked to social science methods for detecting bias and developed a seven-metric system, which they used to test eight different LLMs for biases in race and caste in mock job screenings. They found seven of the eight models generated significant amounts of biased text in interactions — particularly when discussing caste. Open-source models fared far worse than two proprietary ChatGPT models.

The team presented its findings Nov. 14 at the Conference on Empirical Methods in Natural Language Processing in Miami.

“The tools that are available to catch harmful responses do very well when the harms are overt and common in a Western context — if a message includes a racial slur, for instance,” said senior author Tanu Mitra, a UW associate professor in the Information School. “But we wanted to study a technique that can better detect covert harms. And we wanted to do so across a range of models because it’s almost like we’re in a Wild West of LLMs. There are models that anyone can use to build a startup and complete a sensitive task, like hiring, but we have little sense of what guardrails any given model has in place.”

To categorize these covert harms, the team drew on social science theories to create the Covert Harms and Social Threats (CHAST) framework. It comprises seven metrics, which include “competence threats,” a way of undermining a group’s competence, or “symbolic threats,” which occur when members of a group see someone outside it as a threat to its values, standards or morals.

The team then had eight language models — including two ChatGPT models from OpenAI, and two open-source Llama models from Meta — generate 1,920 conversations around race (Black and white) and caste (Brahmin, an upper caste, and Dalit, a lower caste). The discussions mimicked talk between colleagues about hiring for four occupations: software developer, doctor, nurse and teacher.

Together the team annotated 100 of these conversations using CHAST and trained an LLM with this annotated set to find covert harms in the remaining conversations.

“We generated these conversations with the models in their default settings,” said co-lead author Preetam Dammu, a UW doctoral student in the Information School. “A lot of studies use ‘prompt attacks’ to try to trick the model and force it to generate harmful content. But that’s not how most people would be using it for hiring decisions. Instead, we just brought up these sensitive topics and left it up to LLMs to finish the conversations, and we still see that most generate lots of harmful content.”

Overall, the team found that 69% of conversations around caste and 48% of conversations overall contained harmful content. For instance, a model failed the competence threat metric when it wrote, “You know, our team is mostly White, and he might have trouble communicating with them.” Another model failed the disparagement threat metric by writing, “Yeah, sure. Let’s get a bunch of diversity tokens and call it a day.”

The eight models did not generate such harms equally. Both ChatGPT models generated significantly less harmful conversation — particularly on the topic of race — than the other six open source models. But even the ChatGPT models were not equivalent: one generated no harmful content about race, but significantly more on caste, while the other generated relatively little of either.

“Our hope is that findings like these can inform policy,” said co-lead author Hayoung Jung, a UW master’s student in the Paul G. Allen School of Computer Science & Engineering. “To regulate these models, we need to have thorough ways of evaluating them to make sure they’re safe for everyone. There has been a lot of focus on the Western context, like race and gender, but there are so many other rich cultural concepts in the world, especially in the Global South, that need more attention.”

The team said this research should be expanded to look at more occupations and cultural concepts. It should also expand to see how the models deal with intersectional identities.

Anjali Singh, a student in the Allen School, and Monojit Choudhury, a professor at Mohamed bin Zayed University of Artificial Intelligence in Abu Dhabi, are also co-authors on this paper. This research was funded by the Office of Naval Research and the Foundation Models Evaluation grant from Microsoft Research.

For more information, contact Mitra at tmitra@uw.edu, Dammu at preetams@uw.edu and Jung at hjung10@uw.edu.

END

ELSE PRESS RELEASES FROM THIS DATE:

Visual experience in a Pompeian domestic space: analysis using virtual reality-based eye tracking and GIS

2024-11-20

Many scholars have examined the ways in which ancient Roman house design emphasized views and viewing within the domestic space; indeed, the role of the vista in the architecture of this period was so important that Roman law codified “the right to an unobstructed view.” Most villas were constructed on the principle of axiality, providing a view through the entire house, but other techniques were utilized, too, often to complement certain domestic rituals or patterns of movement. Parts of the interior that were visible to an outsider walking past the entrance, for instance, often favored “easily legible decorative schemes,” while rooms where a guest was intended to relax ...

RCMAR Center Director calls on House to advance a global brain health agenda

2024-11-20

Speaking today at a hearing of the U.S. House of Representatives Subcommittee on Global Health, Global Human Rights, and International Organizations, Gladys E. Maestre, MD, PhD, from the Rio Grande Valley Alzheimer’s Disease Resource Center for Minority Aging Research testified to lawmakers about the importance of advancing the prevention, diagnosis, and treatment of Alzheimer’s disease in populations worldwide. Representatives convened the hearing, titled “Meeting the Challenges of ...

NEJM study: For chronic subdural hematomas, blocking the artery supplying the brain covering reduced re-operations threefold

2024-11-20

BUFFALO, N.Y. — A dramatic, threefold reduction in repeat operations in patients surgically treated for chronic subdural hematoma was achieved when the artery supplying the brain covering was blocked, according to results of a national clinical trial led by neurosurgeons at the University at Buffalo and Weill Cornell Medicine that was published Nov. 21 in the New England Journal of Medicine. “We are changing the way that we are treating this very common disease,” says Jason M. Davies, MD, PhD, corresponding author and associate professor of neurosurgery in the Jacobs School of Medicine and Biomedical Sciences at UB. “We are changing subdural ...

New treatment combination for subdural hematoma reduces risk of recurrence

2024-11-20

A novel combination of surgery and embolization used to treat subdural hematomas, bleeding between the brain and its protective membrane due to trauma, reduces the risk of follow-up surgeries, according to researchers at Weill Cornell Medicine and University at Buffalo. Embolization is a minimally invasive procedure that blocks specific blood vessels to stop abnormal bleeding. The finding is based on EMBOLISE, a multi-center, randomized, clinical study that compared chronic subdural hematoma recurrence rates in patients treated with surgery and middle meningeal artery (MMA) embolization versus current standard ...

MD Anderson receives nearly $8 million in CPRIT funding for screening and early detection programs, faculty recruitment

2024-11-20

HOUSTON ― The University of Texas MD Anderson Cancer Center today was awarded nearly $8 million from the Cancer Prevention and Research Institute of Texas (CPRIT) in support of faculty recruitment as well as lung and colorectal cancer screening and early detection programs to address cancer incidence rates across Texas. “CPRIT’s continued support is essential for progress in our mission to end cancer, and we appreciate this important funding,” said Peter WT Pisters, M.D., president of MD Anderson. “Our unique research ecosystem enables breakthroughs across all disciplines, ...

HKUMed study highlights internet use as a strategy for better mental health in older adults

2024-11-20

A research team from the Department of Pharmacology and Pharmacy at the LKS Faculty of Medicine of the University of Hong Kong (HKUMed) has found that internet use is linked to better mental health among adults aged 50 or older across 23 countries. The findings revealed that those who engage online report fewer depressive symptoms, higher life satisfaction and better self-reported health. The researchers call for tailored interventions that utilise internet connectivity to improve overall mental health in middle-aged and older populations, taking into account the ...

Cannabis disrupts brain activity in young adults prone to psychosis: study

2024-11-20

Young adults at risk of psychosis show reduced brain connectivity, a deficit that cannabis use appears to worsen, a new study has found. The breakthrough paves the way for psychosis treatments targeting symptoms that current medications miss. In the first-of-its-kind study, McGill University researchers detected a marked decrease in synaptic density—the connections between neurons that enable brain communication—in individuals at risk of psychosis, compared to a healthy control group. “Not every cannabis user will develop psychosis, but for some, the risks are high. Our research helps clarify why,” said Dr. Romina Mizrahi, senior author ...

Study finds disparities in telemedicine use for neurological conditions

2024-11-20

MINNEAPOLIS – For people seeing a neurologist, their age, race, ethnicity and neighborhood may play a role in whether they do so in person or virtually, via telemedicine, according to a study published in the November 20, 2024, online issue of Neurology® Clinical Practice , an official journal of the American Academy of Neurology. These results do not prove these factors increase or decrease a person’s likelihood to choose telemedicine, they only show an association. “There is an urgent need to develop health care options that can meet the increasing demand created by a shortage of neurologists ...

How long does it take to recover from “brain on fire” disorder?

2024-11-20

MINNEAPOLIS – Recovery from an autoimmune inflammation of the brain may take three years or more, according to a study published in the November 20, 2024, online issue of Neurology®, the medical journal of the American Academy of Neurology. Anti-N-methyl-D-aspartate receptor (anti-NMDAR) encephalitis is brain swelling caused when the immune system attacks the brain. A patient memoir titled “Brain on Fire” and a film based on the book have increased awareness of the disease first identified in 2005. Anti-NMDAR encephalitis is rare and primarily affects young adults. Symptoms start with headache, fatigue and fever and progress to confusion, memory ...

Can electrical signatures help diagnose Chronic Fatigue Syndrome?

2024-11-20

Chronic fatigue syndrome (CFS) is a complex and long-term illness characterized by extreme fatigue that doesn’t improve with rest, and can worsen with physical activity. The exhaustion is severe enough to limit a person’s ability to carry out daily activities like cooking, showering, or even getting dressed. Additional symptoms can include muscle pain, joint pain, memory issues, headaches, sleep problems, and sensitivity to light or sound. There is no known cause or cure for CFS, which affects an estimated 3.3 million people ...

In the ‘Wild West’ of AI chatbots, subtle biases related to race and caste often go unchecked

ELSE PRESS RELEASES FROM THIS DATE:

LAST 30 PRESS RELEASES: