PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Not too big: Machine learning tames huge data sets

Using the Summit supercomputer, Los Alamos algorithm breaks the exabyte barrier

2023-09-11
(Press-News.org) LOS ALAMOS, N.M., Sept. 11, 2023 — A machine-learning algorithm demonstrated the capability to process data that exceeds a computer’s available memory by identifying a massive data set’s key features and dividing them into manageable batches that don’t choke computer hardware. Developed at Los Alamos National Laboratory, the algorithm set a world record for factorizing huge data sets during a test run on Oak Ridge National Laboratory’s Summit, the world’s fifth-fastest supercomputer.

Equally efficient on laptops and supercomputers, the highly scalable algorithm solves hardware bottlenecks that prevent processing information from data-rich applications in cancer research, satellite imagery, social media networks, national security science and earthquake research, to name just a few.

“We developed an ‘out-of-memory’ implementation of the non-negative matrix factorization method that allows you to factorize larger data sets than previously possible on a given hardware,” said Ismael Boureima, a computational physicist at Los Alamos National Laboratory. Boureima is first author of the paper in The Journal of Supercomputing on the record-breaking algorithm. “Our implementation simply breaks down the big data into smaller units that can be processed with the available resources. Consequently, it’s a useful tool for keeping up with exponentially growing data sets.”

“Traditional data analysis demands that data fit within memory constraints. Our approach challenges this notion,” said Manish Bhattarai, a machine learning scientist at Los Alamos and co-author of the paper. “We have introduced an out-of-memory solution. When the data volume exceeds the available memory, our algorithm breaks it down into smaller segments. It processes these segments one at a time, cycling them in and out of the memory. This technique equips us with the unique ability to manage and analyze extremely large data sets efficiently.”

 The distributed algorithm for modern and heterogeneous high-performance computer systems can be useful on hardware as small as a desktop computer, or as large and complex as Chicoma, Summit or the upcoming Venado supercomputers, Boureima said.

“The question is no longer whether it is possible to factorize a larger matrix, rather how long is the factorization going to take,” Boureima said.

The Los Alamos implementation takes advantage of hardware features such as GPUs to accelerate computation and fast interconnect to efficiently move data between computers. At the same time, the algorithm efficiently gets multiple tasks done simultaneously.

Non-negative matrix factorization is another installment of the high-performance algorithms developed under the SmartTensors project at Los Alamos.

In machine learning, non-negative matrix factorization can be used as a form of unsupervised learning to pull meaning from data, Boureima said. “That’s very important for machine learning and data analytics because the algorithm can identify explainable latent features in the data that have a particular meaning to the user.”

The record-breaking run In the record-breaking run by the Los Alamos team, the algorithm processed a 340-terabyte dense matrix and an 11-exabyte sparse matrix, using 25,000 GPUs.  

“We’re reaching exabyte factorization, which no one else has done, to our knowledge,” said Boian Alexandrov, a co-author of the new paper and a theoretical physicist at Los Alamos who led the team that developed the SmartTensors artificial intelligence platform.

Decomposing or factoring data is a specialized data-mining technique aimed at extracting pertinent information, simplifying the data into understandable formats.

Bhattarai further emphasized the scalability of their algorithm, remarking, “In contrast, conventional methods often grapple with bottlenecks, mainly due to the lag in data transfer between a computer’s processors and its memory."

“We also showed you don’t necessarily need big computers,” Boureima said. “Scaling to 25,000 GPUs is great if you can afford it, but our algorithm will be useful on desktop computers for something you couldn’t process before.”

The paper: “Distributed Out-of-Memory NMF on CPU/GPU Architectures.” The Journal of Supercomputing. Link: https://link.springer.com/article/10.1007/s11227-023-05587-4

The funding: This research was funded by DNN R&D and by the Laboratory Directed Research and Development program at Los Alamos National Laboratory.

LA-UR-23-29923

END


ELSE PRESS RELEASES FROM THIS DATE:

UArizona Cancer Center researchers discover iron-targeting approaches to halt proliferation of cancer cells

UArizona Cancer Center researchers discover iron-targeting approaches to halt proliferation of cancer cells
2023-09-11
Researchers at the University of Arizona Cancer Center discovered a new class of iron-targeting compounds that hamper the proliferation of cultured malignant cells in a laboratory setting. The results of the study were published in the Journal of the American Chemical Society. “Cancer cells are what we call ‘addicted’ to iron, and so we are making compounds that are able to interfere with the availability of iron in cancer cells,” said Elisa Tomat, PhD, professor in the Department of Chemistry ...

Experimental physicist David Weld to investigate the role of feedback and measurement in quantum systems

2023-09-11
(Santa Barbara, Calif.) — Experimental physicist David Weld’s experimental research interest lies in a question that has been around for a long time, but which we’re only now approaching the ability to investigate.   “There’s a really old interest in the quantum act of measurement,” he said. “It’s something that’s at the foundations of quantum mechanics and has been puzzling people for more than a century.”   Called the “measurement problem” and famously illustrated by Erwin Schrödinger’s ...

ORNL teams receive funding through DOE BRaVE initiative to study biopreparedness

ORNL teams receive funding through DOE BRaVE initiative to study biopreparedness
2023-09-11
The Department of Energy’s Office of Science has selected three Oak Ridge National Laboratory research teams to receive funding through DOE’s new Biopreparedness Research Virtual Environment, or BRaVE, initiative. BRaVE, announced earlier this year, aims to build on biopreparedness research that delivered high-impact results in the fight against COVID-19. In the height of the pandemic, DOE national laboratory scientists combined fields such as biology, high-performance computing and manufacturing to bolster the national supply of personal protective equipment and improve virus testing and treatment. “The advances made ...

Self-reported “night owls” more likely to have unhealthy lifestyle behaviors, significantly increased diabetes risk

2023-09-11
Embargoed for release until 5:00 p.m. ET on Monday 11 September 2023 Annals of Internal Medicine Tip Sheet @Annalsofim Below please find summaries of new articles that will be published in the next issue of Annals of Internal Medicine. The summaries are not intended to substitute for the full articles as a source of information. This information is under strict embargo and by taking it into possession, media representatives are committing to the terms of the embargo not only on their own behalf, but also on behalf of the organization they represent. ---------------------------- 1. Self-reported “night owls” more likely to have ...

Big teeth, bigger data

Big teeth, bigger data
2023-09-11
Virginia Tech researchers in the College of Natural Resources and Environment are assessing the efficacy of shark sanctuaries by developing a modeling system that utilizes publicly accessible fishing data to determine shark catch and mortality rates. Published in the journal Science Advances, their findings represent an important step in utilizing data science to tackle oceanic conservation challenges. “Shark sanctuaries are coastal areas designated by countries as places where the targeted ...

Article: Doctors treating patients with Parkinson’s disease must focus on stigma and emotional impacts as well as motor symptoms

2023-09-11
Even the best treatment approaches for Parkinson’s disease are inadequate if they do not address patients’ feelings of social rejection, isolation, loneliness and other psychosocial effects of stigma, according to a report from experts specializing in Parkinson’s and other movement disorders. A new report co-authored by UCLA Health neurologist and researcher Dr. Indu Subramanian says many misconceptions and biases cause patients with Parkinson’s to be stereotyped, devalued and shunned, which, along with a progressive loss of functionality and independence, often lead to “self-stigma,” with declining self-esteem and increasing anxiety and depression. The ...

LSU Health New Orleans researchers discover a key failure in amd that may lead to progression and vision loss

LSU Health New Orleans researchers discover a key failure in amd that may lead to progression and vision loss
2023-09-11
New Orleans, LA – Research led by Nicolas Bazan, MD, PhD, Boyd Professor, Ernest C. and Yvette C. Villere Chair for the Study of Retinal Degeneration, and Director of the Neuroscience Center of Excellence at LSU Health New Orleans School of Medicine, suggests that age-related macular degeneration (AMD) decreases an essential fatty acid, preventing the formation of a class of protective molecules and reducing repair potential. The discovery may also open new therapeutic avenues for AMD. The findings are published in Experimental Eye Research, ...

Virginia Tech has seismic role in earthquake center

Virginia Tech has seismic role in earthquake center
2023-09-11
A Virginia Tech professor has an integral role in the establishment of a new center to study earthquakes in the Cascadia Subduction Zone off the coast of Oregon. The project will create an earthquake center to study subduction zones — fault lines where one tectonic plate slips beneath another — to enable collaborative research and community connections for increased hazard awareness. The Division of Earth Sciences in the Directorate for Geosciences at the National Science Foundation has awarded a $15 million grant over five years to establish the Cascadia Region ...

3D printing with coffee: Turning used grounds into caffeinated creations

3D printing with coffee: Turning used grounds into caffeinated creations
2023-09-11
Coffee can do a lot of things: Wake you up, warm you up and lessen that existential dread. According to a new study, it could also help reduce the waste from 3D printing.  That’s the vision behind a new project led by Michael Rivera, an assistant professor in the ATLAS Institute and Department of Computer Science at the University of Colorado Boulder. He and his colleagues have developed a method for 3D printing a wide range of objects using a paste made entirely out of old coffee grounds, water and a few other sustainable ingredients.  The team has already experimented with using coffee grounds to craft jewelry, pots for plants and even, ...

Firms address corporate scandal with lengthy codes of ethics, study shows

2023-09-11
Corporate scandals have been on the rise for the past decade. In 2019, Strategy& (the strategy consulting business unit of PricewaterhouseCoopers) found that for the first time in the history of its annual survey, more CEOs were dismissed for ethical concerns than for poor firm performance or internal board struggles. There has been no shortage of highly publicized scandals, including the BP oil spill in 2010, the Target data breach in 2013 and abuses of financial incentives at Wells Fargo in 2016. A number of CEOs have resigned following alleged inappropriate relations, including Brian Krzanich at Intel, Leslie Moonves at CBS, ...

LAST 30 PRESS RELEASES:

Sexual health symptoms may correlate with poor adherence to adjuvant endocrine therapy in Black women with breast cancer

Black patients with triple-negative breast cancer may be less likely to receive immunotherapy than white patients

Affordable care act may increase access to colon cancer care for underserved groups

UK study shows there is less stigma against LGBTQ people than you might think, but people with mental health problems continue to experience higher levels of stigma

Bringing lost proteins back home

Better than blood tests? Nanoparticle potential found for assessing kidneys

Texas A&M and partner USAging awarded 2024 Immunization Neighborhood Champion Award

UTEP establishes collaboration with DoD, NSA to help enhance U.S. semiconductor workforce

Study finds family members are most common perpetrators of infant and child homicides in the U.S.

Researchers secure funds to create a digital mental health tool for Spanish-speaking Latino families

UAB startup Endomimetics receives $2.8 million Small Business Innovation Research grant

Scientists turn to human skeletons to explore origins of horseback riding

UCF receives prestigious Keck Foundation Award to advance spintronics technology

Cleveland Clinic study shows bariatric surgery outperforms GLP-1 diabetes drugs for kidney protection

Study reveals large ocean heat storage efficiency during the last deglaciation

Fever drives enhanced activity, mitochondrial damage in immune cells

A two-dose schedule could make HIV vaccines more effective

Wastewater monitoring can detect foodborne illness, researchers find

Kowalski, Salonvaara receive ASHRAE Distinguished Service Awards

SkAI launched to further explore universe

SLU researchers identify sex-based differences in immune responses against tumors

Evolved in the lab, found in nature: uncovering hidden pH sensing abilities

Unlocking the potential of patient-derived organoids for personalized sarcoma treatment

New drug molecule could lead to new treatments for Parkinson’s disease in younger patients

Deforestation in the Amazon is driven more by domestic demand than by the export market

Demand-side actions could help construction sector deliver on net-zero targets

Research team discovers molecular mechanism for a bacterial infection

What role does a tailwind play in cycling’s ‘Everesting’?

Projections of extreme temperature–related deaths in the US

Wearable device–based intervention for promoting patient physical activity after lung cancer surgery

[Press-News.org] Not too big: Machine learning tames huge data sets
Using the Summit supercomputer, Los Alamos algorithm breaks the exabyte barrier