Study: AI models fail to reproduce human judgements about rule violations

Models trained using common data-collection techniques judge rule violations more harshly than humans would, researchers report

2023-05-10

(Press-News.org)

In an effort to improve fairness or reduce backlogs, machine-learning models are sometimes designed to mimic human decision making, such as deciding whether social media posts violate toxic content policies.

But researchers from MIT and elsewhere have found that these models often do not replicate human decisions about rule violations. If models are not trained with the right data, they are likely to make different, often harsher judgements than humans would.

In this case, the “right” data are those that have been labeled by humans who were explicitly asked whether items defy a certain rule. Training involves showing a machine-learning model millions of examples of this “normative data” so it can learn a task.

But data used to train machine-learning models are typically labeled descriptively — meaning humans are asked to identify factual features, such as, say, the presence of fried food in a photo. If “descriptive data” are used to train models that judge rule violations, such as whether a meal violates a school policy that prohibits fried food, the models tend to over-predict rule violations.

This drop in accuracy could have serious implications in the real world. For instance, if a descriptive model is used to make decisions about whether an individual is likely to reoffend, the researchers’ findings suggest it may cast stricter judgements than a human would, which could lead to higher bail amounts or longer criminal sentences.

“I think most artificial intelligence/machine-learning researchers assume that the human judgements in data and labels are biased, but this result is saying something worse. These models are not even reproducing already-biased human judgments because the data they’re being trained on has a flaw: Humans would label the features of images and text differently if they knew those features would be used for a judgment. This has huge ramifications for machine learning systems in human processes,” says Marzyeh Ghassemi, an assistant professor and head of the Healthy ML Group in the Computer Science and Artificial Intelligence Laboratory (CSAIL).

Ghassemi is senior author of a new paper detailing these findings, which will be published Science Advances. Joining her on the paper are lead author Aparna Balagopalan, an electrical engineering and computer science graduate student; David Madras, a graduate student at the University of Toronto; David H. Yang, a former graduate student who is now co-founder of ML Estimation; Dylan Hadfield-Menell, an MIT assistant professor; and Gillian K. Hadfield, Schwartz Reisman Chair in Technology and Society and professor of law at the University of Toronto.

Labeling discrepancy

This study grew out of a different project that explored how a machine-learning model can justify its predictions. As they gathered data for that study, the researchers noticed that humans sometimes give different answers if they are asked to provide descriptive or normative labels about the same data.

To gather descriptive labels, researchers ask labelers to identify factual features — does this text contain obscene language? To gather normative labels, researchers give labelers a rule and ask if the data violates that rule — does this text violate the platform’s explicit language policy?

Surprised by this finding, the researchers launched a user study to dig deeper. They gathered four datasets to mimic different policies, such as a dataset of dog images that could be in violation of an apartment’s rule against aggressive breeds. Then they asked groups of participants to provide descriptive or normative labels.

In each case, the descriptive labelers were asked to indicate whether three factual features were present in the image or text, such as whether the dog appears aggressive. Their responses were then used to craft judgements. (If a user said a photo contained an aggressive dog, then the policy was violated.) The labelers did not know the pet policy. On the other hand, normative labelers were given the policy prohibiting aggressive dogs, and then asked whether it had been violated by each image, and why.

The researchers found that humans were significantly more likely to label an object as a violation in the descriptive setting. The disparity, which they computed using the absolute difference in labels on average, ranged from 8 percent on a dataset of images used to judge dress code violations to 20 percent for the dog images.

“While we didn’t explicitly test why this happens, one hypothesis is that maybe how people think about rule violations is different from how they think about descriptive data. Generally, normative decisions are more lenient,” Balagopalan says.

Yet data are usually gathered with descriptive labels to train a model for a particular machine-learning task. These data are often repurposed later to train different models that perform normative judgements, like rule violations.

Training troubles

To study the potential impacts of repurposing descriptive data, the researchers trained two models to judge rule violations using one of their four data settings. They trained one model using descriptive data and the other using normative data, and then compared their performance.

They found that if descriptive data are used to train a model, it will underperform a model trained to perform the same judgements using normative data. Specifically, the descriptive model is more likely to misclassify inputs by falsely predicting a rule violation. And the descriptive model’s accuracy was even lower when classifying objects that human labelers disagreed about.

“This shows that the data do really matter. It is important to match the training context to the deployment context if you are training models to detect if a rule has been violated,” Balagopalan says.

It can be very difficult for users to determine how data have been gathered; this information can be buried in the appendix of a research paper or not revealed by a private company, Ghassemi says.

Improving dataset transparency is one way this problem could be mitigated. If researchers know how data were gathered, then they know how those data should be used. Another possible strategy is to fine-tune a descriptively trained model on a small amount of normative data. This idea, known as transfer learning, is something the researchers want to explore in future work.

They also want to conduct a similar study with expert labelers, like doctors or lawyers, to see if it leads to the same label disparity.

“The way to fix this is to transparently acknowledge that if we want to reproduce human judgment, we must only use data that were collected in that setting. Otherwise, we are going to end up with systems that are going to have extremely harsh moderations, much harsher than what humans would do. Humans would see nuance or make another distinction, whereas these models don’t,” Ghassemi says.

This research was funded, in part, by the Schwartz Reisman Institute for Technology and Society, Microsoft Research, the Vector Institute, and a Canada Research Council Chain.

###

Written by Adam Zewe, MIT News Office

END

ELSE PRESS RELEASES FROM THIS DATE:

Built to outlast: Body type may give athletes upper hand in certain climates

2023-05-10

Triathlons such as Ironman and Norway's Norseman competition epitomize human endurance with competitors undertaking nearly 150 miles of running, swimming and biking in grueling conditions. But behind the training and resilience may be basic rules of ecology that help determine the victor long before contestants leave the starting line, according to research from Dartmouth. An analysis of nearly 200 Ironman contestants over two decades suggests that performance — specifically in the marathon portion of the event — is linked to how an athlete’s physique is adapted to shedding or retaining heat in certain climates. Published in the journal PLOS ...

Coping Under COVID: Study provides lessons from the pandemic on how to cope with large-scale traumatic events

2023-05-10

A new study in the journal PLOS ONE examines how individuals coped with stressors during the COVID-19 pandemic and which strategies were associated with higher quality of life. The study’s findings provide important insights for both individuals and institutions as they prepare for and respond to future large-scale traumatic events. It was based on responses from more than 1,000 Americans on their experiences and behaviors during the pandemic. The research found that problem-focused and emotion-focused coping strategies were associated with higher quality of life, while avoidant coping had a negative correlation. Problem-focused coping involves ...

MD Anderson research highlights for May 10, 2023

2023-05-10

HOUSTON ― The University of Texas MD Anderson Cancer Center’s Research Highlights showcases the latest breakthroughs in cancer care, research and prevention. These advances are made possible through seamless collaboration between MD Anderson’s world-leading clinicians and scientists, bringing discoveries from the lab to the clinic and back. Recent developments include a combination therapy for acute lymphoblastic leukemia, new insights into the evolution of anaplastic thyroid cancer, a promising new treatment approach for PTEN/p53-deficient pancreatic cancer, a novel pan-species artificial intelligence model to detect cancer cells, a ...

Millions of U.S. households may struggle to afford basic water services

2023-05-10

A new analysis suggests that about one in seven households across the U.S. may face financial hardship in paying for access to water and wastewater services. Lauren Patterson and colleagues at Duke University, North Carolina, present these findings in the open-access journal PLOS Water. U.S. households pay utilities for access to water for drinking, cooking, cleaning, and sanitation, as well as for wastewater services. However, in recent years, the cost of these services has increased alongside a widening income gap, fueling affordability concerns. ...

Data from Argonne’s Advanced Photon Source provides foundation for first US approved RSV vaccine

2023-05-10

Respiratory syncytial virus (RSV) is a highly contagious disease that affects millions of people each year around the world, resulting in an estimated 160,000 deaths. In the United States, severe RSV causes 6,000 to 10,000 deaths among people 65 years of age or older. On May 3, the U.S. Food and Drug Administration approved Arexvy, an RSV vaccine developed by pharmaceutical company GSK plc, formerly GlaxoSmithKline plc. It is the first RSV vaccine to be approved in the United States, and according to GSK’s press release, the first for older adults to be approved anywhere in the world. This is a ...

New procedure allows micro-printing inside existing materials with greater accuracy

2023-05-10

3D printers form objects by layering melted plastic or metal, but this only works on large scales. What you need to fabricate microdevices for which the layering step is not feasible? What if it were possible to print directly into the bulk of an existing three-dimensional material? The research groups of Lynford Goddard and Paul Braun, professors at the University of Illinois Urbana-Champaign, have been collaborating to develop such a process. They use the technique of multiphoton lithography to print inside an existing ...

Purdue April Consumer Food Insights report explores role of dollar stores in food landscape

2023-05-10

Purdue April Consumer Food Insights report explores role of dollar stores in food landscape A market for an expanded grocery selection at dollar stores potentially exists, especially with consumers who live less than 10 minutes away, according to data reported in the April Consumer Food Insights report. The survey-based report out of Purdue University’s Center for Food Demand Analysis and Sustainability assesses food spending, consumer satisfaction and values, support of agricultural and food policies, and trust in information sources. Purdue experts conducted and evaluated ...

Using reflections to see the world from new points of view

2023-05-10

As a car travels along a narrow city street, reflections off the glossy paint or side mirrors of parked vehicles can help the driver glimpse things that would otherwise be hidden from view, like a child playing on the sidewalk behind the parked cars. Drawing on this idea, researchers from MIT and Rice University have created a computer vision technique that leverages reflections to image the world. Their method uses reflections to turn glossy objects into “cameras,” enabling a user to see the world as if they were looking through the “lenses” of everyday objects like a ceramic coffee mug or a metallic ...

Stimulating hope: Personalizing treatment options for depression

2023-05-10

Artificial intelligence. Gene editing. mRNA vaccines. It’s safe to say the past few decades have felt like the next big wave of medicine. However, what continues to be needed in virtually every field is a personalized approach to care. That’s certainly needed when it comes to using transcranial magnetic stimulation (TMS) to treat depression, said Medical University of South Carolina Distinguished University Professor Mark George, M.D. TMS uses a magnet to increase brain activity in ...

Gene p16 drives colorectal cancer emerging as a target for potential therapies

2023-05-10

Colorectal cancer is the fourth most common and second deadliest cancer. How colorectal cancer develops is not well understood, but a team led by researchers at Baylor College of Medicine reports in the Journal of Experimental & Clinical Cancer Research that silencing the gene p16, even though the DNA itself does not change, can drive colorectal cancer progression in animal models. The researchers also revealed a strategy that reduced tumor growth and improved survival in tumor-bearing mice, opening new possibilities for future targeted therapies in patients with gene p16 alterations. “Years of research have shown ...

LAST 30 PRESS RELEASES:

Longest observation of an active solar region

Why nail-biting, procrastination and other self-sabotaging behaviors are rooted in survival instincts

Regional variations in mechanical properties of porcine leptomeninges

Artificial empathy in therapy and healthcare: advancements in interpersonal interaction technologies

Why some brains switch gears more efficiently than others

UVA’s Jundong Li wins ICDM’S 2025 Tao Li Award for data mining, machine learning

UVA’s low-power, high-performance computer power player Mircea Stan earns National Academy of Inventors fellowship

Not playing by the rules: USU researcher explores filamentous algae dynamics in rivers

Do our body clocks influence our risk of dementia?

Anthropologists offer new evidence of bipedalism in long-debated fossil discovery

Safer receipt paper from wood

Dosage-sensitive genes suggest no whole-genome duplications in ancestral angiosperm

First ancient human herpesvirus genomes document their deep history with humans

Why Some Bacteria Survive Antibiotics and How to Stop Them - New study reveals that bacteria can survive antibiotic treatment through two fundamentally different “shutdown modes”