PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach

GPMeta speeds up classification of metagenomic sequences is a critical procedure for pathogen identification in the dry-lab step of mNGS tests

Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach
2023-04-26
(Press-News.org)

Metagenomic sequencing (mNGS) is a powerful diagnostic tool to detect causative pathogens in clinical microbiological testing. Rapid and accurate classification of metagenomic sequences is a critical procedure for pathogen identification in the dry-lab step of mNGS tests. However, this crucial step may be improved by classifying sequences within a clinically relevant timeframe.

To address this challenge, a BGI Genomics team led by Xuebin Wang has recently launched GPMeta, an ultra-fast pathogen detection approach, and published these highlights in Briefings in Bioinformatics.

GPMeta can quickly and accurately identify pathogens through complex and massive mNGS sequencing data. Using simulated datasets and metagenomic sequencing datasets from clinical samples, results were benchmarked against tools used by the bioinformatics research community such as Bowtie2, Bwa, Kraken2, and Centrifuge.

Results show that GPMeta not only has higher accuracy but also exhibits significantly faster speed. In addition, GPMeta offers a GPMetaC clustering algorithm, a statistical model for clustering and re-scoring ambiguous alignments to improve the discrimination of highly homologous sequences from microbial genomes with average nucleotide identity >95%. These results underline GPMeta's key role in the development of the mNGS test in infectious diseases that require rapid turnaround times.

 

Background

The faster and earlier detection of causative pathogens is critical for precise antibiotic therapy instead of empiric treatment. It can simultaneously detect almost all new and known pathogenic microorganisms in the patient's body in one test and has huge potential applications in infection diagnosis.

mNGS detection comprises two components: wet-lab experimental manipulations involving clinical sample preprocessing, total nucleic acid extraction, library preparation and sequencing, and dry-lab bioinformatics analysis which includes raw sequencing data preprocessing, removal of human host sequences, sequence alignment to the curated pathogen database and taxonomic classification of microbial sequences.

Bioinformatics analysis is the final crucial step in mNGS detection, which needs to be completed quickly and accurately to accelerate the entire detection process. However, there is an urgent need for new strategies to accelerate the bioinformatics analysis of pathogen identification.

To meet this challenge, GPMeta uses a succinct hash index scheme and supports multiple GPUs to carry out on split databases simultaneously, which meets a growing need for the ability to deal with a rapidly expanding number of microbial genomes.

 

Method design highlights

In terms of the alignment process: Firstly, the sequenced reads are sliced into different k-mers, and a hash index algorithm is used to query the hashed pathogen database Hash DB to obtain the corresponding positions of each k-mer. Then, a DetectChain algorithm is used to associate k-mers to determine candidate alignment positions.

The highest corrected score is selected as the optimal sequence alignment position. Finally, an optional clustering model was used to optimize and re-score the alignment scores. In addition, GPMeta allowed for multi-GPU support and the whole alignment process could be distributed to multiple GPUs to carry out.

GPMetaC, as an optional module of GPMeta, adjusted the alignment scores by clustering within batches to improve the discrimination of highly homologous sequences.

 

Key results

GPMeta and GPMetaC can accurately remove human sequences and detect pathogenic sequences.

On the accuracy of removing human source sequences (as shown in Figures A and B; Figure 2):

(a) Compared with GPMeta and Bowtie2, Bwa has the highest PHR, but it also introduces higher PPA, which means that more pathogenic sequences are mistakenly removed.

(b) GPMeta and Bowtie2 have similar PHR and PPA, superior to Bwa.

On the accuracy of pathogen detection (Figures C and D; Figure 2):

(a) GPMetaC and Bwa have the highest accuracy and recall

(b) GPMetaC is slightly higher than GPMeta, indicating the impact of GPMetaC on the clustering correction function

GPMetaC can accurately detect and distinguish highly homologous species:

► Select three genera with a similarity of 85-97%, Staphylococcus (Figure A; Figure S3)

Escherichia coli (Figure B; Figure S3) and Yersinia (Figure C; Figure S3) as the simulation data set

The accuracy of GPMetaC is better than that of Bwa in Figures A and C, while the performance of both is consistent in Figure B.

In terms of accuracy and recall, GPMetaC has improved by about 2% compared to GPMeta, demonstrating the effective performance of the clustering correction function in correcting ambiguity alignment sequences.

Comparison of software running speed:

On the 25M reads dataset, GPMeta and GPMetaC only need less than 3 minutes to complete the entire detection analysis.

On the 110M reads dataset (conventional mNGS detection data volume), GPMeta and GPMetaC only need 4 minutes to complete the entire detection analysis.

When applied to the entire 190Gb pathogen library, GPMeta and GPMetaC accelerated it by 39-50 and 12-35 times respectively compared to Bwa and Bowtie2.

The entire detection and analysis GPMeta are 18 times and 12 times faster than Bwa and Bowtie2 respectively.

GPMeta has the highest accuracy for clinical samples:

Dataset evaluation of three types of tissue samples - cerebrospinal fluid, blood, and nasopharyngeal swabs - was separately conducted.

The standard for determining the true microbial sequence in the evaluation is: BLASTn compares it to the pathogen sequence library, and sequences that meet the E-value cutoff of 1x10-20, identity>95%, and match rate>0.8 are determined to come from the true microbial species. The results show that GPMeta and Bwa display the highest accuracy and recall, outperforming other commonly used software (Figures A and B; Figure 4).

► In the skin microbial sample data set, GPMeta displays the best species richness estimation, and its abundance results are closer to the species richness reported in the original literature (Figure C; Figure 4).

GPMeta is superior to the existing GPU-accelerated microbial detection tool cuCLARK:

Figure A shows that on a 20M reads simulated dataset, both GPMeta and cuCLARK full mode can complete pathogen library comparison in half a minute, while cuCLARK light mode is faster. In terms of accuracy, as shown in Figures A and B; Figure S7, GPMeta/GPMetaC can achieve an accuracy and recall rate of 96 to 98%. However, the accuracy and recall rate of both cuCLARK operating modes is less than 80%.

 

Conclusion

GPMeta is a powerful tool to timely and accurately identify pathogens from mNGS data, which is of great importance in eliminating the threat from severe acute infections and in targeting precise and effective antibiotic therapy.

Moreover, GPMeta supports multiple GPUs to perform alignment and taxonomic classification of microbial sequences on split databases simultaneously and automatically merges results from multiple sub-databases, which is significant to keep up with the rapidly expanding microbial genome database. To make the best use of GPMeta, how to best and easily integrate it into clinical practices needs further study.

We welcome non-commercial users to test-drive GPMeta at https://github.com/Bgi-LUSH/GPMeta.

 

About BGI Genomics

BGI Genomics, headquartered in Shenzhen China, is the world's leading integrated solutions provider of precision medicine. Our services cover over 100 countries and regions, involving more than 2,300 medical institutions. In July 2017, as a subsidiary of BGI Group, BGI Genomics (300676.SZ) was officially listed on the Shenzhen Stock Exchange.

 

END


[Attachments] See images for this press release:
Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach 2 Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach 3

ELSE PRESS RELEASES FROM THIS DATE:

Alarming rates of teen suicide continue to increase in the US

Alarming rates of teen suicide continue to increase in the US
2023-04-26
In the United States suicide has become the second leading cause of premature death among those ages 10 to 24; it is the leading cause of death among teens ages 13 to 14. Researchers from Florida Atlantic University’s Schmidt College of Medicine and collaborators conducted a study exploring trends in rates of suicide among 13 to 14 year olds in the U.S. from 1999 to 2018. They also explored possible modifications by sex, race, level of urbanization, census region, month of the year and day of the week.    Results, published online ahead of print in the journal Annals of Pediatrics and Child Health, showed that among children ages 13 to 14, suicide rates ...

Thinking About an Unconventional Spelling for Your New Product or Service? You May Want to Reconsider

2023-04-26
Researchers from University of Notre Dame and The Ohio State University published a new Journal of Marketing study that examines how the use of unconventional spellings of a brand name impacts consumers’ inferences about and willingness to support the brand. The study, forthcoming in the Journal of Marketing, is titled “‘Choozing’ the Best Spelling: Consumer Response to Unconventionally Spelled Brand Names” and is authored by John P. Costello, Jesse Walker, and Rebecca Walker Reczek. Choosing a brand ...

Degrading viral RNA to treat SARS-CoV-2 infection

2023-04-26
Development of vaccines against SARS-CoV-2 has been rapid, but the rise of variants forces scientists to frequently modify treatments. Ideally, therapies would target mutation-resistant viral proteins, but this has proven difficult. Researchers reporting in ACS Central Science, however, have now developed a system that directly targets and degrades the viral RNA genome, reducing infection in mice. The method could be adapted to fight off many viruses, as well as treat various diseases. Vaccines and antiviral drugs typically target proteins critical to viral infection and replication. This ...

U.S. adults who felt discrimination at work faced increased risk of high blood pressure

2023-04-26
Research Highlights: U.S. adults who reported feeling highly discriminated against at work had an increased risk of developing high blood pressure than those who reported low discrimination at work. Researchers suggest government and employer anti-discrimination policies and interventions may help to eliminate discrimination in the workplace. Embargoed until 4 a.m. CT/5 a.m. ET Wednesday, April 26, 2023 DALLAS, April 26, 2023 — U.S. adults who reported feeling discriminated against at work had a higher risk for developing high ...

Innovative treatment targets blood clots without increased bleeding risk

2023-04-26
Safer and more effective blood thinners could be on the way following a groundbreaking discovery by researchers at UBC and the University of Michigan, published today in Nature Communications. By combining their expertise in blood clotting systems and chemical synthesis, the researchers have designed a new compound called MPI 8 that offers the potential to prevent blood clots without any increased risk of bleeding—a common side effect of existing blood thinners. “The development of MPI 8 represents a major breakthrough in the field of blood clot prevention and treatment,” said Dr. Jay Kizhakkedathu, a professor and Canada Research ...

Researchers show genetic basis of facial changes in Down Syndrome

2023-04-26
Researchers at the Francis Crick Institute, King’s College London and University College London have shed light on the genetics behind changes in the structure and shape of the face and head in a mouse model of Down Syndrome. Described in a paper published today in Development, the researchers found that having a third copy of the gene Dyrk1a and at least three other genes were responsible for these changes taking place in development – called craniofacial dysmorphology – which involve shortened back-to-front length and widened diameter of the head. Affecting ...

Gestational weight gain z scores, standardized by pre-pregnancy BMI, associated with susceptibility to autism-related traits

2023-04-26
ROCKVILLE, Md.—Gestational weight gain may be associated with autism-related behaviors among children who have a greater pre-disposition to these behaviors and who have mothers with pre-pregnancy overweight or obesity, according to a new study in Obesity, The Obesity Society’s (TOS) flagship journal. Excessive gestational weight gain has been associated with neurodevelopmental outcomes in children, including autism spectrum disorder and related traits. However, it is unclear how pre-pregnancy body mass index (BMI) or familial susceptibility to autism spectrum disorder influences the gestational weight gain-autism traits association, ...

Longer siestas linked to higher risk of obesity, metabolic syndrome, and high blood pressure

2023-04-26
It is a common custom in some countries for individuals to take a siesta or midday nap. Sleeping during the middle of the day has the potential to affect sleep quality, cognitive function, and metabolic processes. However, the relationship between siestas and metabolic health is not well understood. A new study led by investigators from Brigham and Women’s Hospital, a founding member of the Mass General Brigham healthcare system, assessed more than 3,000 adults from a Mediterranean population, examining the relationship of siestas and siesta duration with obesity and metabolic syndrome. The researchers found that those who took siestas of 30 minutes or longer (long siestas) were more ...

The hidden power of Japanese food ― inhibiting the development of liver fibrosis

The hidden power of Japanese food ― inhibiting the development of liver fibrosis
2023-04-26
Japanese food is popular worldwide and has been registered as a UNESCO Intangible Cultural Heritage. There is a scoring system named “the 12-component modified Japanese Diet Index (mJDI12),” which focuses on the intake of the Japanese diet pattern. It includes 12 foods and food groups: rice, miso soup, pickles, soy products, green and yellow vegetables, fruits, seafood, mushrooms, seaweed, green tea, coffee, and beef and pork. Scores range from 0 to 12, with higher scores indicating a diet that conforms to the Japanese food pattern. A research group led by Dr. Hideki Fujii ...

Even as SARS-CoV-2 mutates, some human antibodies fight back

Even as SARS-CoV-2 mutates, some human antibodies fight back
2023-04-26
LA JOLLA, CA—An anonymous San Diego resident has become a fascinating example of how the human immune system fights SARS-CoV-2. In a new investigation, scientists from La Jolla Institute for Immunology (LJI) have shown how antibodies, collected from this clinical study volunteer, bind to the SARS-CoV-2 "Spike" protein to neutralize the virus. Although studies have shown antibodies bound to Spike before, this new research reveals how the original Moderna SARS-CoV-2 vaccine could prompt the body to produce antibodies against the later Omicron variants of SARS-CoV-2. The researchers ...

LAST 30 PRESS RELEASES:

Firms that read more perform better

Tightly tied waist cord of saree underskirt may pose cancer risk, warn doctors

10% of children in high-burden tuberculosis settings may develop the disease by age 10

Health experts push for the elimination of a ‘remarkably harmful toxin’

University of Tennessee, Lockheed Martin expand Master Research Agreement

Testing thousands of RNA enzymes helps find first ‘twister ribozyme’ in mammals

Groundbreaking study provides new evidence of when Earth was slushy

International survey of more than 1600 biomedical researchers on the perceived causes of irreproducibility of research results

Integrating data from different experimental approaches into one model is challenging – this study presents a community-based, full-scale in silico model of the rat hippocampal CA1 region that integra

SwRI awarded grant to characterize Las Moras Springs watershed

Water overuse in MATOPIBA could mean failure to meet up to 40% of local demand for crop irrigation

An extra year of education does not protect against brain aging

Researchers from Uppsala and Magdeburg obtain an ERC Synergy Grant to advance cancer immunotherapy

Deaf male mosquitoes don’t mate

Recognizing traumatic brain injury as a chronic condition fosters better care over the survivor’s lifetime

SwRI’s Dr. James Walker receives Distinguished Scientist Award from Hypervelocity Impact Society

A mother’s health problems pose a risk to her children

Ensuring a bright future for diamond electronics and sensors

The American Pediatric Society selects Dr. Maria Trent as the Recipient of the 2025 David G. Nichols Health Equity Award

The first 3D view of the formation and evolution of globular clusters

Towards a hydrogen-powered future: highly sensitive hydrogen detection system

Scanning synaptic receptors: A game-changer for understanding psychiatric disorders

High-quality nanomechanical resonators with built-in piezoelectricity

ERC Synergy Grants for 57 teams tackling major scientific challenges

Nordic research team receives €13 million to explore medieval book culture 

The origin of writing in Mesopotamia is tied to designs engraved on ancient cylinder seals

Explaining science through dance

Pioneering neuroendocrinologist's century of discovery launches major scientific tribute series

Gendered bilingualism in post-colonial Korea

Structural safety monitoring of buildings with color variations

[Press-News.org] Introducing GPMeta: Ultrarapid GPU-accelerated pathogen identification approach
GPMeta speeds up classification of metagenomic sequences is a critical procedure for pathogen identification in the dry-lab step of mNGS tests