PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

UCSB's Haewon Jeong receives an NSF Early CAREER Award

UCSB's Haewon Jeong receives an NSF Early CAREER Award
2024-06-18
(Press-News.org) (Santa Barbara, Calif.) — Haewon Jeong, an assistant professor in UC Santa Barbara’s Electrical and Computer Engineering (ECE) Department, experienced a pivotal moment in her academic career when she was a postdoctoral fellow at Harvard University. She was investigating how machine learning (ML) models can discriminate against students in education-related applications. Discrimination, or bias, occurs when a model used to train algorithms makes incorrect predictions that systematically disadvantage a group of people. Bias in ML models can lead to inaccurate or unfair predictions, which can have serious consequences in fields such as healthcare, finance and criminal justice. For example, an unfair model that relies on historical data reflecting systematic social and economic inequities could result in mortgage applications being rejected more often for women than for men, or skin cancer being detected more for white patients than for Black patients, who might be denied treatment.

“I was working with education-related datasets collected by my collaborator, and I realized there was a lot of missing data,” Jeong recalled. 

Concerned about adding to the bias in the data, she searched for research papers on how to avoid adding more bias when substituting missing entries with new values, a process called imputation. That was when she made a shocking discovery.

“No one had studied the fairness aspect of imputation before, which was surprising because missing data is such a prevalent problem in the real world,” she said. “Nearly all of the research at the time centered around developing better training algorithms to eliminate bias, but not many people thought about addressing the bias that happened during data collection.” 

That realization provided the framework for Jeong’s novel approach to identifying and mitigating the ever-evolving ethical challenges presented by AI-powered systems, launching her study of how various steps in the data-preparation pipeline can introduce bias or fairness. 

“People in my field say, ‘Bad data in, bad algorithm out. Biased data in, biased algorithm out,’” said Jeong, “but I have proposed that if we focus on cleaning the bad data, we could reduce the bias from the start.” 

As a testament to the potential impact of her proposed research, the National Science Foundation (NSF) has granted Jeong an Early CAREER Award, the federal agency’s most highly regarded honor for junior faculty. She said that the five-year, $558,000 grant provided a significant boost to her research group and to her, personally. 

“I am honored and thrilled,” said Jeong. “This award has made me more confident that the direction of my research is meaningful and supported by the NSF.”

Her project, “From Dirty Data to Fair Prediction: Data Preparation Framework for End-to-End Equitable Machine Learning,” targets the data-preparation pipeline as a strategic opportunity for eliminating unwanted bias and bolstering desirable ethical objectives. Typically, Jeong said, data is “dirty” — missing values and entries, and including varying formats that require standardization. There are many steps required to prepare, or clean, the data, and underlying disparities can encode significant inaccuracies along the way. To mitigate the bias early in the process, Jeong has proposed a three-step process to insert fairness in, when addressing missing values, encoding data and balancing data. 

“Right now, AI algorithms learn from examples, and algorithmic interventions can only do so much with the given data,” said Jeong, who earned her Ph.D. in ECE from Carnegie Mellon University. “I propose that supplying better examples and data to the algorithm will result in more fair and ethical learning.”

Datasets are often missing values. For example, when conducting a survey, some questions are not answered completely or are left empty. Before feeding any dataset into an ML algorithm, researchers have two main options for handling missing data: they can exclude the entries that contain missing data, or they can fill in the missing data with an estimate based on the other available information. Jeong’s prior work showed that both methods significantly increased bias. She was the first researcher to publish a paper calling attention to that problem. 

“In that paper, we proposed a simple algorithm to deal with bias created through imputation, but it was not very efficient,” she said. “In this project, I want to dive deeper into the problem to investigate if there are more efficient ways to perform data imputation and consider fairness at the same time.”

The second thread that she will address is data encoding, which is the process of changing raw data into a numerical format than an algorithm can read and interpret. Returning to the survey example, some answers may range from zero to five, while others include text fields. Data encoding involves converting the words into numbers. Encoding also enables computers to process and transmit information that is not numerically based, such as text, audio, and video. 

“The process of encoding text is already known to cause gender bias and perpetuate social stereotypes, but it’s unclear how these biases flow through the subsequent steps,” explained Jeong, who will rely on her training in information theory to address data encoding. “By looking at it from an information-theory perspective, we hope to develop a fairer algorithm to preserve useful information and suppress information related to bias.” 

The third step involves increasing fairness when balancing data, which is the process of ensuring that a ML dataset represents the real-world population from which it is drawn. Having an uneven number of observations among different groups significantly impacts an ML models predictive performance and fairness. This particular thrust is driven by an experiment with education data that Jeong performed as a postdoctoral fellow. In the project, she grouped students into Black/Hispanic/Native American (BHN) and White/Asian (WA). The data was imbalanced, and a majority of the students were in the WA group. Seeking the best way to balance the data and mitigate bias, Jeong varied the proportion of the groups in the training set while keeping the size of the set constant. By varying the percentage of the BHN student data in the training set to range from zero to one hundred percent, she made a surprising discovery.  

“One might intuitively think that the mix of fifty-fifty or one that aligns with national demographics would yield the most equitable model, but it did not,” she explained. “We found that fairness increased most when we included more data points in the set from the majority group and fewer from the minority group.”

As part of her NSF project, Jeong wants to explore what causes the counterintuitive results and establish guidelines for data scientists on the optimal demographic mixture to use. She believes that the amount of noise in the data plays a role in how the data should be balanced. Noise here means inaccuracies in the data, such as people not answering surveys truthfully, giving incorrect answers, or problems created by a language barrier. Jeong hypothesizes that the fairest and least-biased mixture includes more data from the group having the lowest noise level.

Through her novel three-pronged approach to attacking real-world dataset issues, Jeong hopes to create guidelines and best practices in data preparation for equitable and fair ML. Given the skyrocketing use of ML and AI in nearly every sector of society, she believes that her work has significant real-world implications. 

“Data and computer scientists want AI to embody and promote essential societal values, like fairness and diversity, not stereotypes,” said Jeong. “Removing unwanted bias and inserting ethical objectives into the data-preparation pipeline could make that possible.”  

The end goal of Jeong’s project is to develop a software library that any data scientist or AI developer can use for fairness-aware data preparation. The library would include her group’s fair-imputation methods, bias-flow measurement toolkit and algorithms. 

Jeong also proposed an educational agenda that prioritizes the attraction and retention of talented female students to the study of AI. Research shows that only 12% of AI researchers and a mere 6% of professional software developments in the AI field are women. Jeong plans to design and host the “Girls’ AI Bootcamp,” which will be specifically tailored to engaging female high school students and introducing them to the exciting possibilities within CS and AI. 

“I have experienced firsthand the challenges of being in the minority in this field, and I am personally committed to closing the gender gap,” said Jeong. “I not only want to pique the interest among female high school students, but also instill self-confidence in them that they can be leading innovators in the fields of AI and CS.”

END

[Attachments] See images for this press release:
UCSB's Haewon Jeong receives an NSF Early CAREER Award

ELSE PRESS RELEASES FROM THIS DATE:

This new way to recycle steel could reduce the industry’s carbon footprint

This new way to recycle steel could reduce the industry’s carbon footprint
2024-06-18
University of Toronto engineering researchers have designed a new way to recycle steel that has the potential to decarbonize a range of manufacturing industries and usher in a circular steel economy.  The method is outlined in a new paper published in Resources, Conservation & Recycling and co-authored by Jaesuk (Jay) Paeng, William Judge and Professor Gisele Azimi.   It introduces an innovative oxysulfide electrolyte for electrorefining, ...

Journal of Nutrition recognizes distinguished Texas A&M nutrition scientist

2024-06-18
      MEDIA INQUIRES   WRITTEN BY Laura Muntean   Paul Schattenberg laura.muntean@ag.tamu.edu   paschattenberg@ag.tamu.edu 601-248-1891   210-859-5752 FOR ...

Non-native plants and animals expanding ranges 100 times faster than native species, finds new research led by UMass Amherst

2024-06-18
June 18, 2024   Non-native Plants and Animals Expanding Ranges 100 Times Faster than Native Species, Finds New Research Led by UMass Amherst Native species cannot move fast enough on their own to avoid climate-driven chaos   AMHERST, Mass. – An international team of scientists has recently found that non-native species are expanding their ranges many orders of magnitude faster than native ones, in large part due to inadvertent human help. Even seemingly sedentary non-native plants are moving at three times the speed ...

NASA Associate Administrator Jim Free to deliver keynote address at ISSRDC

NASA Associate Administrator Jim Free to deliver keynote address at ISSRDC
2024-06-18
BOSTON (MA), June 18, 2024 – Jim Free, associate administrator for NASA, will deliver a keynote address on Wednesday, July 31, at the International Space Station Research and Development Conference (ISSRDC) in Boston. Free, the senior advisor to Administrator Bill Nelson and Deputy Administrator Pam Melroy, is NASA’s third highest-ranking executive and its highest-ranking civil servant. In addition to leading the agency’s 10 center directors and the mission directorate associate administrators at NASA Headquarters ...

Cost may not keep many people from filling opioid addiction treatment prescriptions

2024-06-18
When people get a prescription for the opioid addiction medication called buprenorphine, they almost always fill it — even if they have to pay more out of their own pocket, a new study shows. Whether it’s their first prescription for the medication, or they’ve been taking it for months, nearly all patients pick up the order from the pharmacy, according to the new findings from a University of Michigan team. Even among those just starting on buprenorphine, higher costs aren’t a deterrent. The researchers say this suggests that removing barriers ...

Fred Hutch announces eight recipients of 2024 Dr. Eddie Méndez Scholar Award

Fred Hutch announces eight recipients of 2024 Dr. Eddie Méndez Scholar Award
2024-06-18
SEATTLE — June 18, 2024 — Fred Hutch Cancer Center announced the recipients of the 2024 Dr. Eddie Méndez Scholar Award, which recognizes outstanding early-career scientists from underrepresented backgrounds who are studying cancer, infectious diseases and basic sciences.   The eight postdoctoral awardees come from research institutions across the U.S. and are experts in a range of subjects including cancer immunology, fungal model systems and craniofacial development. “We enthusiastically congratulate this year’s recipients who were chosen from a very competitive pool of candidates,” said Christina Termini, PhD, MM, co-director ...

NASA selects Lockheed Martin to build next-gen spacecraft for NOAA

2024-06-18
NASA, on behalf of the National Oceanic and Atmospheric Administration (NOAA), has selected Lockheed Martin Corp. of Littleton, Colorado, to build the spacecraft for NOAA’s Geostationary Extended Observations (GeoXO) satellite program. This cost-plus-award-fee contract is valued at approximately $2.27 billion. It includes the development of three spacecraft as well as four options for additional spacecraft. The anticipated period of performance for this contract includes support for 10 years of on-orbit operations and five years of on-orbit storage, for a total of 15 years for each spacecraft. ...

C-Path partners with FARA to fortify RDCA-DAP and further accelerate drug development with new Friedreich’s Ataxia Data

2024-06-18
TUCSON, Ariz., June 18, 2024 — Critical Path Institute (C-Path), a leader in accelerating drug development for rare diseases, today announced the targeted integration of additional Friedreich’s ataxia (FA) datasets into C-Path’s Rare Disease Cures Accelerator-Data and Analytics Platform (RDCA-DAP®) as part of a partnership with Friedreich’s Ataxia Research Alliance (FARA).   This update includes data from two natural history studies; the FA-CHILD study, which focuses on pediatric ...

Rigorous new study debunks misconceptions about anemia, education

Rigorous new study debunks misconceptions about anemia, education
2024-06-18
In low- and middle-income countries, anemia reduction efforts are often touted as a way to improve educational outcomes and reduce poverty. A new study, co-authored by a global health economics expert from the University of Notre Dame, evaluates the relationship between anemia and school attendance in India, debunking earlier research that could have misguided policy interventions. Santosh Kumar, associate professor of development and global health economics at Notre Dame’s Keough School of Global Affairs, is co-author of the study, published in Communications ...

Existing high blood pressure drugs may prevent epilepsy, Stanford Medicine-led study finds

2024-06-18
A class of drugs already on the market to lower blood pressure appears to reduce adults’ risk of developing epilepsy, Stanford Medicine researchers and their colleagues have discovered. The finding comes out of an analysis of the medical records of more than 2 million Americans taking blood pressure medications. The study, published June 17 in JAMA Neurology, suggests that the drugs, called angiotensin receptor blockers, could prevent epilepsy in people at highest risk of the disease, ...

LAST 30 PRESS RELEASES:

UW-led research links wildfire smoke exposure with increased dementia risk

Most U.S. adults surveyed trust store-bought turkey is free of contaminants, despite research finding fecal bacteria in ground turkey

New therapy from UI Health offers FDA-approved treatment option for brittle type 1 diabetes

Alzheimer's: A new strategy to prevent neurodegeneration

A clue to what lies beneath the bland surfaces of Uranus and Neptune

Researchers uncover what makes large numbers of “squishy” grains start flowing

Scientists uncover new mechanism in bacterial DNA enzyme opening pathways for antibiotic development

New study reveals the explosive secret of the squirting cucumber

Vanderbilt authors find evidence that the hunger hormone leptin can direct neural development in a leptin receptor–independent manner

To design better water filters, MIT engineers look to manta rays

Self-assembling proteins can be used for higher performance, more sustainable skincare products

Cannabis, maybe, for attention problems

Building a better path to recovery for OUD

How climate change threatens this iconic Florida bird

Study reveals new factor involved in controlling calorie expenditure

Managing forests with smart technologies

Clinical trial finds that adding the chemotherapy pill temozolomide to radiation therapy improves survival in adult patients with a slow-growing type of brain tumor

H.E.S.S. collaboration detects the most energetic cosmic-ray electrons and positrons ever observed

Novel supernova observations grant astronomers a peek into the cosmic past

Association of severe maternal morbidity with subsequent birth

Herodotus' theory on Armenian origins debunked by first whole-genome study

Women who suffer pregnancy complications have fewer children

Home testing kits and coordinated outreach substantially improve colorectal cancer screening rates

COVID-19 vaccine reactogenicity among young children

Generalizability of clinical trials of novel weight loss medications to the US adult population

Wildfire smoke exposure and incident dementia

Health co-benefits of China's carbon neutrality policies highlighted in new review

Key brain circuit for female sexual rejection uncovered

Electrical nerve stimulation eases long COVID pain and fatigue

ASTRO issues update to clinical guideline on radiation therapy for rectal cancer

[Press-News.org] UCSB's Haewon Jeong receives an NSF Early CAREER Award