PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

In the ‘Wild West’ of AI chatbots, subtle biases related to race and caste often go unchecked

2024-11-20
(Press-News.org) Recently, LinkedIn announced its Hiring Assistant, an artificial intelligence “agent” that performs the most repetitious parts of recruiters’ jobs — including interacting with job candidates before and after interviews. LinkedIn’s bot is the highest-profile example in a growing group of tools — such as Tombo.ai and Moonhub.ai — that deploy large language models to interact with job seekers.

Given that hiring is consequential — compared with, say, a system that recommends socks — University of Washington researchers sought to explore how bias might manifest in such systems. While many prominent large language models, or LLMs, such as ChatGPT, have built-in guards to catch overt biases such as slurs, systemic biases still can arise subtly in chatbot interactions. Also, since many systems are created in Western countries, their guardrails don’t always recognize non-Western social concepts, such as caste in South Asia.

The researchers looked to social science methods for detecting bias and developed a seven-metric system, which they used to test eight different LLMs for biases in race and caste in mock job screenings. They found seven of the eight models generated significant amounts of biased text in interactions — particularly when discussing caste. Open-source models fared far worse than two proprietary ChatGPT models.

The team presented its findings Nov. 14 at the Conference on Empirical Methods in Natural Language Processing in Miami.

“The tools that are available to catch harmful responses do very well when the harms are overt and common in a Western context — if a message includes a racial slur, for instance,” said senior author Tanu Mitra, a UW associate professor in the Information School. “But we wanted to study a technique that can better detect covert harms. And we wanted to do so across a range of models because it’s almost like we’re in a Wild West of LLMs. There are models that anyone can use to build a startup and complete a sensitive task, like hiring, but we have little sense of what guardrails any given model has in place.”

To categorize these covert harms, the team drew on social science theories to create the Covert Harms and Social Threats (CHAST) framework. It comprises seven metrics, which include “competence threats,” a way of undermining a group’s competence, or “symbolic threats,” which occur when members of a group see someone outside it as a threat to its values, standards or morals.

The team then had eight language models — including two ChatGPT models from OpenAI, and two open-source Llama models from Meta — generate 1,920 conversations around race (Black and white) and caste (Brahmin, an upper caste, and Dalit, a lower caste). The discussions mimicked talk between colleagues about hiring for four occupations: software developer, doctor, nurse and teacher.

Together the team annotated 100 of these conversations using CHAST and trained an LLM with this annotated set to find covert harms in the remaining conversations.

“We generated these conversations with the models in their default settings,” said co-lead author Preetam Dammu, a UW doctoral student in the Information School. “A lot of studies use ‘prompt attacks’ to try to trick the model and force it to generate harmful content. But that’s not how most people would be using it for hiring decisions. Instead, we just brought up these sensitive topics and left it up to LLMs to finish the conversations, and we still see that most generate lots of harmful content.”

Overall, the team found that 69% of conversations around caste and 48% of conversations overall contained harmful content. For instance, a model failed the competence threat metric when it wrote, “You know, our team is mostly White, and he might have trouble communicating with them.” Another model failed the disparagement threat metric by writing, “Yeah, sure. Let’s get a bunch of diversity tokens and call it a day.”

The eight models did not generate such harms equally. Both ChatGPT models generated significantly less harmful conversation — particularly on the topic of race — than the other six open source models. But even the ChatGPT models were not equivalent: one generated no harmful content about race, but significantly more on caste, while the other generated relatively little of either.

“Our hope is that findings like these can inform policy,” said co-lead author Hayoung Jung, a UW master’s student in the Paul G. Allen School of Computer Science & Engineering. “To regulate these models, we need to have thorough ways of evaluating them to make sure they’re safe for everyone. There has been a lot of focus on the Western context, like race and gender, but there are so many other rich cultural concepts in the world, especially in the Global South, that need more attention.”

The team said this research should be expanded to look at more occupations and cultural concepts. It should also expand to see how the models deal with intersectional identities.

Anjali Singh, a student in the Allen School, and Monojit Choudhury, a professor at Mohamed bin Zayed University of Artificial Intelligence in Abu Dhabi, are also co-authors on this paper. This research was funded by the Office of Naval Research and the Foundation Models Evaluation grant from Microsoft Research.

For more information, contact Mitra at tmitra@uw.edu, Dammu at preetams@uw.edu and Jung at hjung10@uw.edu.

END


ELSE PRESS RELEASES FROM THIS DATE:

Visual experience in a Pompeian domestic space: analysis using virtual reality-based eye tracking and GIS

2024-11-20
Many scholars have examined the ways in which ancient Roman house design emphasized views and viewing within the domestic space; indeed, the role of the vista in the architecture of this period was so important that Roman law codified “the right to an unobstructed view.” Most villas were constructed on the principle of axiality, providing a view through the entire house, but other techniques were utilized, too, often to complement certain domestic rituals or patterns of movement. Parts of the interior that were visible to an outsider walking past the entrance, for instance, often favored “easily legible decorative schemes,” while rooms where a guest was intended to relax ...

RCMAR Center Director calls on House to advance a global brain health agenda

2024-11-20
Speaking today at a hearing of the U.S. House of Representatives Subcommittee on Global Health, Global Human Rights, and International Organizations, Gladys E. Maestre, MD, PhD, from the Rio Grande Valley Alzheimer’s Disease Resource Center for Minority Aging Research testified to lawmakers about the importance of advancing the prevention, diagnosis, and treatment of Alzheimer’s disease in populations worldwide. Representatives convened the hearing, titled “Meeting the Challenges of ...

NEJM study: For chronic subdural hematomas, blocking the artery supplying the brain covering reduced re-operations threefold

2024-11-20
BUFFALO, N.Y. — A dramatic, threefold reduction in repeat operations in patients surgically treated for chronic subdural hematoma was achieved when the artery supplying the brain covering was blocked, according to results of a national clinical trial led by neurosurgeons at the University at Buffalo and Weill Cornell Medicine that was published Nov. 21 in the New England Journal of Medicine. “We are changing the way that we are treating this very common disease,” says Jason M. Davies, MD, PhD, corresponding author and associate professor of neurosurgery in the Jacobs School of Medicine and Biomedical Sciences at UB. “We are changing subdural ...

New treatment combination for subdural hematoma reduces risk of recurrence

2024-11-20
A novel combination of surgery and embolization used to treat subdural hematomas, bleeding between the brain and its protective membrane due to trauma, reduces the risk of follow-up surgeries, according to researchers at Weill Cornell Medicine and University at Buffalo. Embolization is a minimally invasive procedure that blocks specific blood vessels to stop abnormal bleeding. The finding is based on EMBOLISE, a multi-center, randomized, clinical study that compared chronic subdural hematoma recurrence rates in patients treated with surgery and middle meningeal artery (MMA) embolization versus current standard ...

MD Anderson receives nearly $8 million in CPRIT funding for screening and early detection programs, faculty recruitment

2024-11-20
HOUSTON ― The University of Texas MD Anderson Cancer Center today was awarded nearly $8 million from the Cancer Prevention and Research Institute of Texas (CPRIT) in support of faculty recruitment as well as lung and colorectal cancer screening and early detection programs to address cancer incidence rates across Texas. “CPRIT’s continued support is essential for progress in our mission to end cancer, and we appreciate this important funding,” said Peter WT Pisters, M.D., president of MD Anderson. “Our unique research ecosystem enables breakthroughs across all disciplines, ...

HKUMed study highlights internet use as a strategy for better mental health in older adults

2024-11-20
A research team from the Department of Pharmacology and Pharmacy at the LKS Faculty of Medicine of the University of Hong Kong (HKUMed) has found that internet use is linked to better mental health among adults aged 50 or older across 23 countries. The findings revealed that those who engage online report fewer depressive symptoms, higher life satisfaction and better self-reported health. The researchers call for tailored interventions that utilise internet connectivity to improve overall mental health in middle-aged and older populations, taking into account the ...

Cannabis disrupts brain activity in young adults prone to psychosis: study

2024-11-20
Young adults at risk of psychosis show reduced brain connectivity, a deficit that cannabis use appears to worsen, a new study has found. The breakthrough paves the way for psychosis treatments targeting symptoms that current medications miss. In the first-of-its-kind study, McGill University researchers detected a marked decrease in synaptic density—the connections between neurons that enable brain communication—in individuals at risk of psychosis, compared to a healthy control group. “Not every cannabis user will develop psychosis, but for some, the risks are high. Our research helps clarify why,” said Dr. Romina Mizrahi, senior author ...

Study finds disparities in telemedicine use for neurological conditions

2024-11-20
MINNEAPOLIS – For people seeing a neurologist, their age, race, ethnicity and neighborhood may play a role in whether they do so in person or virtually, via telemedicine, according to a study published in the November 20, 2024, online issue of Neurology® Clinical Practice , an official journal of the American Academy of Neurology. These results do not prove these factors increase or decrease a person’s likelihood to choose telemedicine, they only show an association. “There is an urgent need to develop health care options that can meet the increasing demand created by a shortage of neurologists ...

How long does it take to recover from “brain on fire” disorder?

2024-11-20
MINNEAPOLIS – Recovery from an autoimmune inflammation of the brain may take three years or more, according to a study published in the November 20, 2024, online issue of Neurology®, the medical journal of the American Academy of Neurology. Anti-N-methyl-D-aspartate receptor (anti-NMDAR) encephalitis is brain swelling caused when the immune system attacks the brain. A patient memoir titled “Brain on Fire” and a film based on the book have increased awareness of the disease first identified in 2005. Anti-NMDAR encephalitis is rare and primarily affects young adults. Symptoms start with headache, fatigue and fever and progress to confusion, memory ...

Can electrical signatures help diagnose Chronic Fatigue Syndrome?

Can electrical signatures help diagnose Chronic Fatigue Syndrome?
2024-11-20
Chronic fatigue syndrome (CFS) is a complex and long-term illness characterized by extreme fatigue that doesn’t improve with rest, and can worsen with physical activity. The exhaustion is severe enough to limit a person’s ability to carry out daily activities like cooking, showering, or even getting dressed. Additional symptoms can include muscle pain, joint pain, memory issues, headaches, sleep problems, and sensitivity to light or sound.  There is no known cause or cure for CFS, which affects an estimated 3.3 million people ...

LAST 30 PRESS RELEASES:

COVID-19 vaccination during pregnancy may help prevent preeclampsia

Menopausal hormone therapy not linked to increased risk of death

Chronic shortage of family doctors in England, reveals BMJ analysis

Booster jabs reduce the risks of COVID-19 deaths, study finds

Screening increases survival rate for stage IV breast cancer by 60%

ACC announces inaugural fellow for the Thad and Gerry Waites Rural Cardiovascular Research Fellowship

University of Oklahoma researchers develop durable hybrid materials for faster radiation detection

Medicaid disenrollment spikes at age 19, study finds

Turning agricultural waste into advanced materials: Review highlights how torrefaction could power a sustainable carbon future

New study warns emerging pollutants in livestock and aquaculture waste may threaten ecosystems and public health

Integrated rice–aquatic farming systems may hold the key to smarter nitrogen use and lower agricultural emissions

Hope for global banana farming in genetic discovery

Mirror image pheromones help beetles swipe right

Prenatal lead exposure related to worse cognitive function in adults

Research alert: Understanding substance use across the full spectrum of sexual identity

Pekingese, Shih Tzu and Staffordshire Bull Terrier among twelve dog breeds at risk of serious breathing condition

Selected dog breeds with most breathing trouble identified in new study

Interplay of class and gender may influence social judgments differently between cultures

Pollen counts can be predicted by machine learning models using meteorological data with more than 80% accuracy even a week ahead, for both grass and birch tree pollen, which could be key in effective

Rewriting our understanding of early hominin dispersal to Eurasia

Rising simultaneous wildfire risk compromises international firefighting efforts

Honey bee "dance floors" can be accurately located with a new method, mapping where in the hive forager bees perform waggle dances to signal the location of pollen and nectar for their nestmates

Exercise and nutritional drinks can reduce the need for care in dementia

Michelson Medical Research Foundation awards $750,000 to rising immunology leaders

SfN announces Early Career Policy Ambassadors Class of 2026

Spiritual practices strongly associated with reduced risk for hazardous alcohol and drug use

Novel vaccine protects against C. diff disease and recurrence

An “electrical” circadian clock balances growth between shoots and roots

Largest study of rare skin cancer in Mexican patients shows its more complex than previously thought

Colonists dredged away Sydney’s natural oyster reefs. Now science knows how best to restore them.

[Press-News.org] In the ‘Wild West’ of AI chatbots, subtle biases related to race and caste often go unchecked