PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Human pangenome reference will enable more complete and equitable understanding of genomic diversity

Human pangenome reference will enable more complete and equitable understanding of genomic diversity
2023-05-10
(Press-News.org) UC Santa Cruz scientists, along with a consortium of researchers, have released a draft of the first human pangenome—a new, usable reference for genomics that combines the genetic material of 47 individuals from different ancestral backgrounds to allow for a deeper, more accurate understanding of worldwide genomic diversity. 

By adding 119 million bases—the “letters” in DNA sequences—to the existing genomics reference, the pangenome provides a representation of human genetic diversity that was not possible with a single reference genome. It is highly accurate, more complete and dramatically increases the detection of variants in the human genome, as shown in a collection of groundbreaking papers published today in the journals Nature, Genome Research, Nature Biotechnology, and Nature Methods.  

The pangenome was produced by the Human Pangenome Reference Consortium (HPRC), which is co-led by UCSC’s Associate Professor of Biomolecular Engineering Benedict Paten and Assistant Professor of Biomolecular Engineering Karen Miga and is now available for use in an assembly hub on the UCSC Genome Browser. More than a dozen UCSC researchers and students are contributors to this project, which will continue into 2024 when the researchers plan to release a final pangenome with genomic information from 350 individuals. 

“We are introducing more diversity and equity into the reference by sampling diverse human beings and including them in this structure that everyone can use,” said Paten, who is the senior author on the main marker paper. “One genome isn’t enough to represent everybody—the pangenome will ultimately be something that is inclusive and representative.”

Understanding genomic variation 

Each person’s genome varies slightly—by about 0.4 percent compared to the next person, on average—and understanding these differences can provide insight into their health, help to diagnose disease, predict medical outcomes, and guide treatments. Using the pangenome reference will improve scientists’ ability to detect and understand variation in future studies. 

Typically when scientists and clinicians study an individual’s genome to look for variation, they compare that individuals’ DNA to that of a standard reference to determine where there are differences of one or more base pairs. Until now, the reference genome has primarily been represented by a single sequence for each human chromosome, mostly sourced from one individual. But, this reference is nearly 20 years old and fundamentally limited in that it can not represent the wealth of genetic variations present in the human population. This introduces an issue called reference bias into genome analysis.

In contrast, the new pangenome is a reference that combines the genomes of 47 individuals from various ancestral backgrounds. The pangenome looks like a linear reference in areas where the sequences have the same bases, and expands to show the areas where there are differences. It represents many different versions of the human genome sequence at the same time, and gives scientists a more accurate point of comparison for variation that is present in some populations but not others.

“One genome can't possibly represent all of the rich variation we know can be observed and studied around the world,” said Miga, Director of the HPRC Production Center at UCSC. “The No. 1 goal of the human pangenome reference is to try to broaden the representation of a reference resource to be more inclusive and more equitable for studying the human species, as a collection of references and not just one.” 

Genomic variation can be small, consisting of differences of just one or a few DNA bases, or it can be large structural variants, classified as variants that are 50 base pairs or larger. These larger, structural variants can have important health implications. Until now, researchers have been unable to identify more than 70 percent of the structural variants that exist in human genomes due to limited technologies and the bias of using a single reference sequence.

Of the 119 million new bases added to the reference with the pangenome, roughly 90 million of these derive from structural variation. Structural variants are complex and may be inversions of sequences, insertions, deletions, or tandem repeats—a segment of two or more bases repeated numerous times. These new bases will help researchers to study regions in the genome for which there was previously no reference, and potentially be able to associate structural variants with disease in future studies.

“Now, we can map to more structural variants, so we're finding features and areas in the genome that just weren't there before,” Miga said. “That’s exciting because it's allowing us to look at gene regulation in a unique way that we couldn't study before, because those areas probably would have been inappropriately mapped or just ignored altogether.”

Using the pangenome reference for genomic analysis increases the detection of structural variants by 104 percent as compared to detection using the standard reference. The pangenome reference also increases the accuracy of calling small variants, those just a few bases long, by about 34 percent because of the increased amount of data present in the pangenome.

Each human carries a paired set of chromosomes—one set inherited from the mother and one from the father. The individual genomes present in the pangenome reference contains haplotype-resolved information, meaning it can confidently distinguish the two parental sets of chromosomes—a major scientific feat. Having this information will help scientists better understand how various genes and diseases are inherited. 

This also means the current reference actually includes 94 distinct genome sequences, with the goal of getting to 700 by 2024. 

Creating the pangenome 

The pangenome was made possible through the development of advanced computational techniques to align the multiple genome sequences into one, usable reference in a structure called a pangenome graph. Paten and researchers in the UCSC Computational Genomics lab helped lead the HPRC efforts to develop the algorithmic methods needed to create this pangenome graph structure. 

Because of the methods used in this project, all of the genomes within the pangenome reference are of extremely high quality and accuracy, covering more than 99 percent of each human genome with more than 99 percent accuracy.

“In the linear reference, we had only one sequence, one representation of each gene,” said Mobin Asri, a bioinformatics Ph.D. candidate at UCSC and co-first author on the main paper. “But we know that our genes have different variations in the human population. Using the pangenome graph, we want to have all of those variations in a single structure—and a graph is a natural way to do this.” 

The HPRC project relies heavily on long- and ultra long-read sequencing technology to read DNA from biological samples. With recent advances, these techniques can now decode thousands to millions of base pairs of the genome at once. The long stretches of DNA reads are then assembled via specialized algorithms into more complete genomic sequences. Ideally each assembled sequence should represent the sequence of one chromosome.

Long reads contain errors about one percent of the time and current assembly algorithms are 
not perfect, which can cause the assembled sequences to be erroneous in some locations. To check for and correct these errors, the individual genomes that have been sequenced and assembled move through multiple tools, including a reliability pipeline developed by Asri. Once having been processed by these tools, the researchers can ensure the assemblies are accurate and complete.

After moving through Asri’s pipeline, the various genomes are compiled via complex algorithmic methods into the pangenome graph structure. Visually, the graph genome allows researchers to view differences in the various reference sequences as diverging areas in otherwise shared paths.

Building an accessible resource

All of the first 47 diploid genomes in the draft pangenome were sourced from individuals who participated in the 1000 Genomes Project (1000G), an influential effort which created a catalog of common human genetic variation from openly consented samples and was completed in 2015. The open consent status of these samples allow any researcher to access the resource without the privacy barriers that typically accompany genome research, with the aim of making the pangenome accessible to as many people as possible. 

“Becoming a common resource is something that’s fundamental to the success of a human pangenome reference,” Miga said. “It has to have the ability to be accessible and open around the world to all researchers so we can use it as the foundation.”

The HPRC team is focused on outreach to ensure that the pangenome is a useful resource that will be utilized in clinics around the world. This means facilitating annotations, feedback, and input from the researchers carrying out studies using the pangenome reference. 

“The draft pangenome is an important proof of principle that we hope is going to influence a lot of people and get them thinking about the pangenome and how it might affect their work,” Paten said. “Looking ahead, we see a lot of engagement with other groups—it takes a lot of different people to build something that is going to become a big community resource.”

Along with a focus on accessibility, the HPRC project has a dedicated ethics team focused on the social and legal implications of this project. They are working to anticipate challenging issues and help guide informed consent, prioritize the study of different samples, explore possible regulatory issues pertaining to clinical adoption, and work with international and Indigenous communities to incorporate their genome sequences in these broader efforts.

Continuing the legacy and future work 

The human pangenome is a continuation of decades-long efforts from scientists at UC Santa Cruz to understand the biological code that underlies human life. 

In 2000, Jim Kent, then a UCSC graduate student and now a research scientist at the Genomics Institute and director of the UCSC Genome Browser, wrote the code that assembled the first working draft of the human genome. UCSC scientists published it with open access to anyone who wanted to use it. Since then, UCSC has been at the forefront of genomics research.

In April 2022, UCSC’s Karen Miga co-led the Telomere-to-Telomere consortium to assemble the first complete sequencing of a human genome, filling in missing, complex regions of reference that had long eluded scientists. 

“Since 2000, we’ve had a series of increasingly more accurate representations of one genome,” said David Haussler, Scientific Director of the UCSC Genomics Institute who led the UCSC team  on the original Human Genome Project and advises on the pangenome project. “But no matter how accurately you represent one genome, that’s not going to represent all of humanity. Now is a turning point: no longer genomics of the one standard human genome, but genomics for everybody.”

The researchers are making progress toward the goal of completing the full pangenome by 2024. The team is in the process of recruiting new individuals to represent some populations not included in the 1000 Genomes Project, particularly people of Middle Eastern and African ancestry. Miga, as the director of the Data Production Center at UCSC, will spearhead these efforts going forward.

In addition to completing the final pangenome reference, the researchers are working toward forming an international human pangenome project that would establish partnerships with researchers across the world. These partnerships would include a two-way skills and knowledge exchange, aimed to bring the skills and technology needed to create high-quality reference genomes into the hands of researchers worldwide so they can carry out their own research.

Other UCSC researchers on the main paper include Marina Haukness, Glenn Hickey, Julian Lucas, Jean Monlong, Xian Chang, Jordan Eizenga, Charles Markello, Adam Novak, Hugh Olsen, and Trevor Pesout. 

Other institutions involved in the Human Pangenome Reference Consortium may be found on
the project’s main page. 

Funding for the HPRC was primarily provided by the  National Human Genome Research Institute.

END

[Attachments] See images for this press release:
Human pangenome reference will enable more complete and equitable understanding of genomic diversity Human pangenome reference will enable more complete and equitable understanding of genomic diversity 2 Human pangenome reference will enable more complete and equitable understanding of genomic diversity 3

ELSE PRESS RELEASES FROM THIS DATE:

New ‘pangenome’ offers more inclusive view of human genome

2023-05-10
New Haven, Conn. — When it was launched in April 2003, the Human Genome Project helped revolutionize biomedical research by providing scientists a reference map that allowed them to analyze DNA sequences for genetic clues to the origins of a host of diseases. Twenty years later, a team of researchers that includes Yale scientists has created a new “pangenome” that fills in missing sequencing gaps from the original genome project and greatly expands the diversity of genomes represented. The achievement is described in ...

Study: palliative care provided at point of oncology surgery does not improve patient outcomes

Study: palliative care provided at point of oncology surgery does not improve patient outcomes
2023-05-10
One of the most important advances in palliative care in oncology over the past 15 years has been the recognition that palliative care specialists can improve cancer patients’ outcomes well before their end of life. Palliative care is specialized care provided to individuals with a serious illness that focuses on decision-making support, pain and symptom management, as well as psychosocial interventions to improve quality of life. Several past randomized clinical trials have shown palliative care specialists can improve the quality of life and lengthen the ...

Investigating social media to evaluate emergency medicine physicians’ emotional well-being during COVID-19

2023-05-10
About The Study: In this study, key thematic shifts and increases in language related to anxiety, anger, depression, and loneliness were identified in the content posted on social media by academic emergency medicine physicians and resident physicians during the pandemic. Social media may provide a real-time and evolving landscape to evaluate thematic content and linguistics related to emotions and sentiment for health care workers.  Authors: Anish K. Agarwal, M.D., M.P.H., M.S., of the ...

Analysis of BMI in early and middle adulthood and estimated risk of gastrointestinal cancer

2023-05-10
About The Study: In this secondary analysis of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial, overweight and obese body mass index (BMI) in early and middle adulthood was associated with an elevated risk of colorectal cancer and non-colorectal gastrointestinal cancers. The results of the current study prompt further exploration into the mechanistic role of obese BMI in carcinogenesis.  Authors: Holli A. Loomans-Kropp, Ph.D., M.P.H., of Ohio State University in Columbus, is the corresponding author.  To access the embargoed study: Visit our For The Media website at this link https://media.jamanetwork.com/  (doi:10.1001/jamanetworkopen.2023.10002) Editor’s ...

UW Medicine scientists among leads of NIH pangenome studies

UW Medicine scientists among leads of NIH pangenome studies
2023-05-10
UW Medicine genome experts made significant scientific contributions to a National Institutes of Health Human Genome Research Institute reference collection that better represents the genetic diversity of the world’s populations. Called the Human Pangenome Reference Consortium, the multi-institutional effort expands and updates earlier work that started as the Human Genome Project. That original project, with drafts reported in 2001 and 2003, was based on a more limited sampling of human DNA. The goal then was to create an entire sequence of a human genome to use as a reference. ...

The clearest snapshot of human genomic diversity ever taken

2023-05-10
For more than 20 years, scientists have relied on the human reference genome, a consensus genetic sequence, as a standard against which to compare other genetic data. Used in countless studies, the reference genome has made it possible to identify genes implicated in specific diseases and trace the evolution of human traits, among other things. But it has always been a flawed tool. One of its biggest problems is that about 70 percent of its data came from a single man of predominantly African-European background whose DNA was sequenced during ...

Researchers measure the light emitted by a sub-Neptune planet’s atmosphere for the first time

2023-05-10
For more than a decade, astronomers have been trying to get a closer look at GJ 1214b, an exoplanet 40 light-years away from Earth. Their biggest obstacle is a thick layer of haze that blankets the planet, shielding it from the probing eyes of space telescopes and stymying efforts to study its atmosphere. NASA’s new James Webb Space Telescope (JWST) solved that issue. The telescope’s infrared technology allows it to see planetary objects and features that were previously obscured ...

Paper refutes assertion that effects of bottom trawling on blue carbon can be compared to that of global air travel

2023-05-10
A ‘Matter Arising’ paper published in Nature today refutes the findings of a paper by Sala et al on the amount of CO2 released from the seabed by bottom trawling. The paper made significant headlines around the world on release in 2021, as it equated the carbon released by bottom trawling to be of a similar magnitude to the CO2 created by the global airline industry. In their paper quantifying the carbon benefits of ending bottom trawling, Prof Jan Hiddink of Bangor University’s world-renowned School of Ocean Science and others, explain that the methodology ...

Gwangju Institute of Science and Technology researchers develop injectable bioelectrodes with tunable lifetimes

Gwangju Institute of Science and Technology researchers develop injectable bioelectrodes with tunable lifetimes
2023-05-10
Implantable bioelectrodes are electronic devices that can monitor or stimulate biological activity by transmitting signals to and from living biological systems. Such devices can be fabricated using various materials and techniques. But, because of their intimate contact and interactions with living tissues, selection of the right material for performance and biocompatibility is crucial. In recent times, conductible hydrogels have attracted great attention as bioelectrode materials owing to their flexibility, compatibility, and excellent interaction ability. However, the absence ...

Study of cancer metastasis, most common cause of cancer death, gets $35 million boost at Johns Hopkins Medicine

Study of cancer metastasis, most common cause of cancer death, gets $35 million boost at Johns Hopkins Medicine
2023-05-10
FOR IMMMEDIATE RELEASE With a $35 million gift from researcher, philanthropist and race car driver Theodore Giovanis, scientists at Johns Hopkins Medicine will study the biological roots of the most fatal aspect of cancer: how it metastasizes, or spreads, through the body. The contribution, a 15-year commitment, will establish the Giovanis Institute for Translational Cell Biology, dedicated to studying metastasis. The institute’s researchers aim to make discoveries that reveal common features of metastasis across cancer types, ...

LAST 30 PRESS RELEASES:

Mind’s eye: Pineal gland photoreceptor’s 2 genes help fish detect color

Nipah virus: epidemiology, pathogenesis, treatment, and prevention

FDA ban on Red Dye 3 and more are highlighted in Sylvester Cancer's January tip sheet

Mapping gene regulation

Exposure to air pollution before pregnancy linked to higher child body mass index, study finds

Neural partially linear additive model

Dung data: manure can help to improve global maps of herbivore distribution

Concerns over maternity provision for pregnant women in UK prisons

UK needs a national strategy to tackle harms of alcohol, argue experts

Aerobic exercise: a powerful ally in the fight against Alzheimer’s

Cambridge leads first phase of governmental project to understand impact of smartphones and social media on young people

AASM Foundation partners with Howard University Medical Alumni Association to provide scholarships

Protective actions need regulatory support to fully defend homeowners and coastal communities, study finds

On-chip light control of semiconductor optoelectronic devices using integrated metasurfaces

America’s political house can become less divided

A common antihistamine shows promise in treating liver complications of a rare disease complication

Trastuzumab emtansine improves long-term survival in HER2 breast cancer

Is eating more red meat bad for your brain?

How does Tourette syndrome differ by sex?

Red meat consumption increases risk of dementia and cognitive decline

Study reveals how sex and racial disparities in weight loss surgery have changed over 20 years

Ultrasound-directed microbubbles could boost immune response against tumours, new Concordia research suggests

In small preliminary study, fearful pet dogs exhibited significantly different microbiomes and metabolic molecules to non-fearful dogs, suggesting the gut-brain axis might be involved in fear behavior

Examination of Large Language Model "red-teaming" defines it as a non-malicious team-effort activity to seek LLMs' limits and identifies 35 different techniques used to test them

Most microplastics in French bottled and tap water are smaller than 20 µm - fine enough to pass into blood and organs, but below the EU-recommended detection limit

A tangled web: Fossil fuel energy, plastics, and agrichemicals discourse on X/Twitter

This fast and agile robotic insect could someday aid in mechanical pollination

Researchers identify novel immune cells that may worsen asthma

Conquest of Asia and Europe by snow leopards during the last Ice Ages uncovered

Researchers make comfortable materials that generate power when worn

[Press-News.org] Human pangenome reference will enable more complete and equitable understanding of genomic diversity