PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Explainable AI for decoding genome biology

Opening the black box to uncover the rules of the genome's regulatory code

Explainable AI for decoding genome biology
2021-02-18
(Press-News.org) KANSAS CITY, MO--Researchers at the Stowers Institute for Medical Research, in collaboration with colleagues at Stanford University and Technical University of Munich have developed advanced explainable artificial intelligence (AI) in a technical tour de force to decipher regulatory instructions encoded in DNA. In a report published online February 18, 2021, in Nature Genetics, the team found that a neural network trained on high-resolution maps of protein-DNA interactions can uncover subtle DNA sequence patterns throughout the genome and provide a deeper understanding of how these sequences are organized to regulate genes.

Neural networks are powerful AI models that can learn complex patterns from diverse types of data such as images, speech signals, or text to predict associated properties with impressive high accuracy. However, many see these models as uninterpretable since the learned predictive patterns are hard to extract from the model. This black-box nature has hindered the wide application of neural networks to biology, where interpretation of predictive patterns is paramount.

One of the big unsolved problems in biology is the genome's second code--its regulatory code. DNA bases (commonly represented by letters A, C, G, and T) encode not only the instructions for how to build proteins, but also when and where to make these proteins in an organism. The regulatory code is read by proteins called transcription factors that bind to short stretches of DNA called motifs. However, how particular combinations and arrangements of motifs specify regulatory activity is an extremely complex problem that has been hard to pin down.

Now, an interdisciplinary team of biologists and computational researchers led by Stowers Investigator Julia Zeitlinger, PhD, and Anshul Kundaje, PhD, from Stanford University, have designed a neural network--named BPNet for Base Pair Network--that can be interpreted to reveal regulatory code by predicting transcription factor binding from DNA sequences with unprecedented accuracy. The key was to perform transcription factor-DNA binding experiments and computational modeling at the highest possible resolution, down to the level of individual DNA bases. This increased resolution allowed them to develop new interpretation tools to extract the key elemental sequence patterns such as transcription factor binding motifs and the combinatorial rules by which motifs function together as a regulatory code.

"This was extremely satisfying," says Zeitlinger, "as the results fit beautifully with existing experimental results, and also revealed novel insights that surprised us."

For example, the neural network models enabled the researchers to discover a striking rule that governs binding of the well-studied transcription factor called Nanog. They found that Nanog binds cooperatively to DNA when multiples of its motif are present in a periodic fashion such that they appear on the same side of the spiraling DNA helix.

"There has been a long trail of experimental evidence that such motif periodicity sometimes exists in the regulatory code," Zeitlinger says. "However, the exact circumstances were elusive, and Nanog had not been a suspect. Discovering that Nanog has such a pattern, and seeing additional details of its interactions, was surprising because we did not specifically search for this pattern."

"This is the key advantage of using neural networks for this task," says ?iga Avsec, PhD, first author of the paper. Avsec and Kundaje created the first version of the model when Avsec visited Stanford during his doctoral studies in the lab of Julien Gagneur, PhD, at the Technical University in Munich, Germany.

"More traditional bioinformatics approaches model data using pre-defined rigid rules that are based on existing knowledge. However, biology is extremely rich and complicated," says Avsec. "By using neural networks, we can train much more flexible and nuanced models that learn complex patterns from scratch without previous knowledge, thereby allowing novel discoveries."

BPNet's network architecture is similar to that of neural networks used for facial recognition in images. For instance, the neural network first detects edges in the pixels, then learns how edges form facial elements like the eye, nose, or mouth, and finally detects how facial elements together form a face. Instead of learning from pixels, BPNet learns from the raw DNA sequence and learns to detect sequence motifs and eventually the higher-order rules by which the elements predict the base-resolution binding data.

Once the model is trained to be highly accurate, the learned patterns are extracted with interpretation tools. The output signal is traced back to the input sequences to reveal sequence motifs. The final step is to use the model as an oracle and systematically query it with specific DNA sequence designs, similar to what one would do to test hypotheses experimentally, to reveal the rules by which sequence motifs function in a combinatorial manner.

"The beauty is that the model can predict way more sequence designs that we could test experimentally," Zeitlinger says. "Furthermore, by predicting the outcome of experimental perturbations, we can identify the experiments that are most informative to validate the model." Indeed, with the help of CRISPR gene editing techniques, the researchers confirmed experimentally that the model's predictions were highly accurate.

Since the approach is flexible and applicable to a variety of different data types and cell types, it promises to lead to a rapidly growing understanding of the regulatory code and how genetic variation impacts gene regulation. Both the Zeitlinger Lab and the Kundaje Lab are already using BPNet to reliably identify binding motifs for other cell types, relate motifs to biophysical parameters, and learn other structural features in the genome such as those associated with DNA packaging. To enable other scientists to use BPNet and adapt it for their own needs, the researchers have made the entire software framework available with documentation and tutorials.

INFORMATION:

Other contributors to the study included Melanie Weilert, Sabrina Krueger, PhD, Khyati Dalal, Robin Fropf, PhD, and Charles McAnany, PhD, from Stowers; and Avanti Shrikumar, PhD, and Amr Alexandari from Stanford University.

This work was supported in part by the Stowers Institute for Medical Research and the National Human Genome Research Institute (awards R01HG009674 and U01HG009431 to A.K. and R01HG010211 to J.Z.) and National Institute of General Medical Sciences (DP2GM123485 to A.K.) of the National Institutes of Health (NIH). Additional support included the German Bundesministerium für Bildung und Forschung (project MechML 01IS18053F to Z.A.) and a Stanford BioX Fellowship and Howard Hughes Medical Institute International Student Research Fellowship (to A.S). Sequencing was performed at the Stowers Institute for Medical Research and University of Kansas Medical Center Genomics Core supported by the NIH awards from the National Institute of Child Health and Human Development (U54HD090216), Office of the Director (Instrumentation S10OD021743), and National Institute of General Medical Sciences (COBRE P30GM122731). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Lay Summary of Findings

DNA is well known for encoding proteins. It also contains another code--a regulatory code--that directs when and where to make proteins in an organism. In a report published online February 18, 2021, in Nature Genetics, researchers from the lab of Julia Zeitlinger, PhD, an Investigator at the Stowers Institute for Medical Research, and collaborators from Stanford University and Technical University of Munich describe how they have used explainable artificial intelligence to help decipher the genome's regulatory code.

The researchers developed a neural network whose innerworkings can be uncovered to reveal regulatory DNA sequence patterns and their higher-level organizing principles from high-resolution genomics data. The Zeitlinger Lab anticipates that the predictive models, rules, and maps generated using this type of approach will lead to a better understanding of natural and disease-associated genetic variation in regulatory regions of DNA.

About the Stowers Institute for Medical Research

Founded in 1994 through the generosity of Jim Stowers, founder of American Century Investments, and his wife, Virginia, the Stowers Institute for Medical Research is a non-profit, biomedical research organization with a focus on foundational research. Its mission is to expand our understanding of the secrets of life and improve life's quality through innovative approaches to the causes, treatment, and prevention of diseases.

The Institute consists of twenty independent research programs. Of the approximately 500 members, over 370 are scientific staff that includes principal investigators, technology center directors, postdoctoral scientists, graduate students, and technical support staff. Learn more about the Institute at http://www.stowers.org and about its graduate program at http://www.stowers.org/gradschool.


[Attachments] See images for this press release:
Explainable AI for decoding genome biology

ELSE PRESS RELEASES FROM THIS DATE:

Like it or not, history shows that taxes and bureaucracy are cornerstones of democracy

Like it or not, history shows that taxes and bureaucracy are cornerstones of democracy
2021-02-18
The media has been rife with stories about democracy in decline: the recent coup in Myanmar, the ascent of strongman Narendra Modi in India, and of course ex-President Trump's attempts to overturn the U.S. presidential election--all of which raise alarms about the current status of democracies worldwide. Such threats to the voices of the people are often attributed to the excesses of individual leaders. But while leadership is certainly important, over the past decade, as established democracies like Venezuela and Turkey fell and others slid toward greater authoritarianism, political scientists and pundits have largely overlooked a key factor: ...

Is odor the secret to bats' sex appeal?

Is odor the secret to bats sex appeal?
2021-02-18
When falling in love, humans often pay attention to looks. Many non-human animals also choose a sexual partner based on appearance. Male birds may sport flashy feathers to attract females, lionesses prefer lions with thicker manes and colorful male guppies with large spots attract the most females. But bats are active in the dark. How do they attract mates? Mariana Muñoz-Romo, a senior Latin American postdoctoral fellow at the Smithsonian Tropical Research Institute (STRI) and National Geographic explorer, pioneers research to understand the role of odors in bat mating behavior. "Aside from their genitalia, most male and female bat species look identical at first glance. However, a detailed examination during mating season reveals odor-producing glands or structures that are only present ...

Chatter between cell populations drives progression of gastrointestinal tumors

Chatter between cell populations drives progression of gastrointestinal tumors
2021-02-18
Gastrointestinal stromal tumors (GISTs) are a subytpe of cancers known as sarcomas. GIST is the most common type of sarcoma with approximately 5,000 to 6,000 new patient cases annually in the United States. GIST cannot be cured by drugs alone, and targeted therapies are only modestly effective, with a high rate of drug resistance. In a recent study, researchers at University of California San Diego School of Medicine identified new therapeutic targets that could lead to new treatment options for patients. The study, published in the February 18, 2021 online edition of Oncogene, found that specific cell-to-cell communication influences GIST biology and is strongly associated with cancer ...

Stents or bypass surgery more effective for stable patients with high-risk cardiac anatomy

Stents or bypass surgery more effective for stable patients with high-risk cardiac anatomy
2021-02-18
A recent study by University of Alberta cardiologists at the Canadian VIGOUR Centre shows that a particular group of patients with stable ischemic heart disease have better outcomes with percutaneous coronary intervention (also called angioplasty with stent) or coronary artery bypass surgery and medication, versus conservative management with medication alone. In a study published in the Journal of the American Heart Association, associate professor of medicine and academic interventional cardiologist Kevin Bainey and his team reviewed the patient information of more than 9,000 Albertans with stable ischemic heart disease. While able to function as outpatients, ...

Study suggests link between DNA and marriage satisfaction in newlyweds

Study suggests link between DNA and marriage satisfaction in newlyweds
2021-02-18
FAYETTEVILLE, Ark. -Variation in a specific gene could be related to traits that are beneficial to bonding and relationship satisfaction in the first years of a marriage, according to a new study by a University of Arkansas psychologist. Recent research indicates that a variation called "CC" in the gene CD38 is associated with increased levels of gratitude. Extending that line of work, U of A psychologist Anastasia Makhanova and her colleagues used data from a study of genotyped newlyweds to explore whether a correlation existed between the CD38 CC variation and levels of trust, forgiveness and marriage satisfaction. They found that individuals with the CC variation did report higher levels of perceptions considered beneficial to successful relationships, particularly trust. Marriage ...

Songbirds' reproductive success reduced by natural gas compressor noise

Songbirds reproductive success reduced by natural gas compressor noise
2021-02-18
Some songbirds are not dissuaded by constant, loud noise emitted by natural gas pipeline compressors and will establish nests nearby. The number of eggs they lay is unaffected by the din, but their reproductive success ultimately is diminished. That's the conclusion of a team of Penn State researchers who conducted an innovative, elaborate study that included unceasing playback of recorded compressor noise, 80 new, never-before-used nest boxes occupied by Eastern bluebirds and tree swallows, and behavioral observations with video cameras placed within boxes. Importantly, the birds did not preferentially select quiet boxes over noisy boxes, suggesting they do not recognize the reduction ...

UCI researchers eavesdrop on cellular conversations

UCI researchers eavesdrop on cellular conversations
2021-02-18
Irvine, Calif. -- An interdisciplinary team of biologists and mathematicians at the University of California, Irvine has developed a new tool to help decipher the language cells use to communicate with one another. In a paper published today in Nature Communications, the researchers introduce CellChat, a computational platform that enables the decoding of signaling molecules that transmit information and commands between the cells that come together to form biological tissues and even entire organs. "To properly understand why cells do certain things, and to predict their future actions, we need to be able to listen ...

The messenger matters in safe gun storage, suicide prevention education

2021-02-18
Law enforcement and those in the military, rather than doctors and celebrities, are the most preferred messengers on firearm safety, a Rutgers study found. The findings, published in the journal Preventive Medicine, can help communicate the importance of safe firearm storage and reduce the rate of suicides, Rutgers researchers say. "We know that safe firearm storage is a key component to suicide prevention, but that belief is not widespread among firearm owners," said lead author Michael Anestis, executive director of the New Jersey Gun Violence Research Center and an associate professor of Urban-Global Public Health at Rutgers School ...

Fuel for earliest life forms: Organic molecules found in 3.5 billion-year-old rocks

Fuel for earliest life forms: Organic molecules found in 3.5 billion-year-old rocks
2021-02-18
A research team including the geobiologist Dr. Helge Missbach from the University of Cologne has detected organic molecules and gases trapped in 3.5 billion-year-old rocks. A widely accepted hypothesis says that the earliest life forms used small organic molecules as building materials and energy sources. However, the existence of such components in early habitats on Earth was as yet unproven. The current study, published in the journal 'Nature Communications', now shows that solutions from archaic hydrothermal vents contained essential components that formed a basis for the earliest life on our planet. Specifically, the scientists examined about ...

The mass of Cygnus X-1's black hole challenges stellar evolution models

2021-02-18
Weighing in at roughly 21 solar masses, the black hole in the X-ray binary system Cygnus X-1 is so massive that it challenges current stellar evolution models, a new study reveals. Ultimately, the mass of a black hole is determined by its parent star's properties and is generally constrained by the mass lost to stellar winds throughout its lifetime. If a black hole interacts with a binary companion star, the system emits X-rays and can sometimes form radio jets, which make the systems visible to electromagnetic observations as an X-ray binary. Measurements from known x-ray binaries have shown that black holes in these systems all have masses below 20 solar masses (M?), with the largest being 15-17 M?. However, gravitational wave detections of black hole merger events have found ...

LAST 30 PRESS RELEASES:

Study challenges assumptions about how tuberculosis bacteria grow

NASA Goddard Lidar team receives Center Innovation Award for Advancements

Can AI improve plant-based meats?

How microbes create the most toxic form of mercury

‘Walk this Way’: FSU researchers’ model explains how ants create trails to multiple food sources

A new CNIC study describes a mechanism whereby cells respond to mechanical signals from their surroundings

Study uncovers earliest evidence of humans using fire to shape the landscape of Tasmania

Researchers uncover Achilles heel of antibiotic-resistant bacteria

Scientists uncover earliest evidence of fire use to manage Tasmanian landscape

Interpreting population mean treatment effects in the Kansas City Cardiomyopathy Questionnaire

Targeting carbohydrate metabolism in colorectal cancer: Synergy of therapies

Stress makes mice’s memories less specific

Research finds no significant negative impact of repealing a Depression-era law allowing companies to pay workers with disabilities below minimum wage

Resilience index needed to keep us within planet’s ‘safe operating space’

How stress is fundamentally changing our memories

Time in nature benefits children with mental health difficulties: study

In vitro model enables study of age-specific responses to COVID mRNA vaccines

Sitting too long can harm heart health, even for active people

International cancer organizations present collaborative work during oncology event in China

One or many? Exploring the population groups of the largest animal on Earth

ETRI-F&U Credit Information Co., Ltd., opens a new path for AI-based professional consultation

New evidence links gut microbiome to chronic disease outcomes

Family Heart Foundation appoints Dr. Seth Baum as Chairman of the Board of Directors

New route to ‘quantum spin liquid’ materials discovered for first time

Chang’e-6 basalts offer insights on lunar farside volcanism

Chang’e-6 lunar samples reveal 2.83-billion-year-old basalt with depleted mantle source

Zinc deficiency promotes Acinetobacter lung infection: study

How optogenetics can put the brakes on epilepsy seizures

Children exposed to antiseizure meds during pregnancy face neurodevelopmental risks, Drexel study finds

Adding immunotherapy to neoadjuvant chemoradiation may improve outcomes in esophageal cancer

[Press-News.org] Explainable AI for decoding genome biology
Opening the black box to uncover the rules of the genome's regulatory code