(Press-News.org) CAMBRIDGE, Mass. -- In 2001, the Human Genome Project and Celera Genomics announced that after 10 years of work at a cost of some $400 million, they had completed a draft sequence of the human genome. Today, sequencing a human genome is something that a single researcher can do in a couple of weeks for less than $10,000.
Since 2002, the rate at which genomes can be sequenced has been doubling every four months or so, whereas computing power doubles only every 18 months. Without the advent of new analytic tools, biologists' ability to generate genomic data will soon outstrip their ability to do anything useful with it.
In the latest issue of Nature Biotechnology, MIT and Harvard University researchers describe a new algorithm that drastically reduces the time it takes to find a particular gene sequence in a database of genomes. Moreover, the more genomes it's searching, the greater the speedup it affords, so its advantages will only compound as more data is generated.
In some sense, this is a data-compression algorithm — like the one that allows computer users to compress data files into smaller zip files. "You have all this data, and clearly, if you want to store it, what people would naturally do is compress it," says Bonnie Berger, a professor of applied math and computer science at MIT and senior author on the paper. "The problem is that eventually you have to look at it, so you have to decompress it to look at it. But our insight is that if you compress the data in the right way, then you can do your analysis directly on the compressed data. And that increases the speed while maintaining the accuracy of the analyses."
Exploiting redundancy
The researchers' compression scheme exploits the fact that evolution is stingy with good designs. There's a great deal of overlap in the genomes of closely related species, and some overlap even in the genomes of distantly related species: That's why experiments performed on yeast cells can tell us something about human drug reactions.
Berger; her former grad student Michael Baym PhD '09, who's now a visiting scholar in the MIT math department and a postdoc in systems biology at Harvard Medical School; and her current grad student Po-Ru Loh developed a way to mathematically represent the genomes of different species — or of different individuals within a species — such that the overlapping data is stored only once. A search of multiple genomes can thus concentrate on their differences, saving time.
"If I want to run a computation on my genome, it takes a certain amount of time," Baym explains. "If I then want to run the same computation on your genome, the fact that we're so similar means that I've already done most of the work."
In experiments on a database of 36 yeast genomes, the researchers compared their algorithm to one called BLAST, for Basic Local Alignment Search Tool, one of the most commonly used genomic-search algorithms in biology. In a search for a particular genetic sequence in only 10 of the yeast genomes, the new algorithm was twice as fast as BLAST; but in a search of all 36 genomes, it was four times as fast. That discrepancy will only increase as genomic databases grow larger, Berger explains.
Matchmaking
The new algorithm would be useful in any application where the central question is, as Baym puts it: "I have a sequence; what is it similar to?" Identifying microbes is one example. The new algorithm could help clinicians determine causes of infections, or it could help biologists characterize "microbiomes," collections of microbes found in animal tissue or particular microenvironments; variations in the human microbiome have been implicated in a range of medical conditions. It could be used to characterize the microbes in particularly fertile or infertile soil, and it could even be used in forensics, to determine the geographical origins of physical evidence by its microbial signatures.
Berger's group is currently working to extend the technique to information on proteins and RNA sequences, where it could pay even bigger dividends. Now that the human genome has been mapped, the major questions in biology are what genes are active when, and how the proteins they code for interact. Searches of large databases of biological information are crucial to answering both questions.
### Written by Larry Hardesty, MIT News Office
Searching genomic data faster
Biologists' capacity for generating genomic data is increasing more rapidly than computing power - A new algorithm will help them keep up
2012-07-11
ELSE PRESS RELEASES FROM THIS DATE:
Tobacco use more prevalent among African-American adolescents living in public housing communities
2012-07-11
COLUMBIA, Mo. – Today, nearly 4,000 adolescents in the United States will smoke their first cigarette, and about a fourth of those youth will become daily smokers, the U.S. Department of Health and Human Services reports. A recent study by a University of Missouri researcher found that African-American youths who live in public housing communities are 2.3 times more likely to use tobacco than other African-American youths.
"Compared to their same-aged peers, youth living in public housing were more likely to use tobacco and have positive attitudes about using tobacco," ...
U-M researchers identify new genetic cause for chronic kidney disease
2012-07-11
Ann Arbor, Mich. — A new single-gene cause of chronic kidney disease has been discovered that implicates a disease mechanism not previously believed to be related to the disease, according to new research from the University of Michigan.
The research was published July 8 in the journal Nature Genetics.
"In developed countries, the frequency of chronic kidney disease is continually increasing for unknown reasons. The disease is a major health burden," says Friedhelm Hildebrandt, M.D., the paper's senior author and professor of pediatrics and of human genetics at C.S. ...
First seabed sonar to measure marine energy effect on environment and wildlife
2012-07-11
UK scientists will measure the effect on the marine environment and wildlife of devices that harness tide and wave energy using sonar technology that has, for the first time, been successfully deployed on the seabed.
Renewable energy from tidal currents can be generated using turbines in the tidal flow, and wave energy can be captured in a number of different ways. FLOWBEC (Flow and Benthic Ecology 4D) is a three-year, £1.2 million project that brings together a consortium of researchers to investigate the effects of such devices by monitoring environment and wildlife ...
Climate change may lead to fewer but more violent thunderstorms
2012-07-11
Researchers are working to identify exactly how a changing climate will impact specific elements of weather, such as clouds, rainfall, and lightning. A Tel Aviv University researcher has predicted that for every one degree Celsius of warming, there will be approximately a 10 percent increase in lightning activity.
This could have negative consequences in the form of flash floods, wild fires, or damage to power lines and other infrastructure, says Prof. Colin Price, Head of the Department of Geophysics, Atmospheric and Planetary Sciences at Tel Aviv University. In an ongoing ...
Researchers develop new possibilities for solar power
2012-07-11
Two Queen's researchers have contributed to a significant breakthrough in solar technology. Their research has led to a new solar photovoltaic thermal (PVT) system that generates both electricity and heat.
Solar PVTs are normally made with crystal silicon cells which generate electricity, but little heat. Stephen Harrison and Joshua Pearce (Mechanical and Materials Engineering) designed and tested amorphous silicon cells in a PVT system. Their research shows increased heat generation because of higher operating temperatures and 10 per cent more solar electric output.
"These ...
New biofuel process dramatically improves energy recovery
2012-07-11
EAST LANSING, Mich. — A new biofuel production process created by Michigan State University researchers produces energy more than 20 times higher than existing methods.
The results, published in the current issue of Environmental Science and Technology, showcase a novel way to use microbes to produce biofuel and hydrogen, all while consuming agricultural wastes.
Gemma Reguera, MSU microbiologist, has developed bioelectrochemical systems known as microbial electrolysis cells, or MECs, using bacteria to breakdown and ferment agricultural waste into ethanol. Reguera's ...
Toward achieving 1 million times increase in computing efficiency
2012-07-11
Modern-day computers are based on logic circuits using semiconductor transistors. To increase computing power, smaller transistors are required. Moore's Law states that the number of transistors that can fit on an integrated circuit should double every two years due to scaling. But as transistors reach atomic dimensions, achieving this feat is becoming increasingly difficult. Among the most significant challenges is heat dissipation from circuits created using today's standard semiconductor technology, complementary metal-oxide semiconductor (CMOS), which give off more ...
New CDC study on racial disparities in infant mortality published in Journal of Women's Health
2012-07-11
New Rochelle, NY, July 10, 2012—Improving access to health care for minority women of childbearing age could improve pregnancy outcomes and reduce racial differences in infant mortality, according to an article in Journal of Women's Health, a peer-reviewed publication from Mary Ann Liebert, Inc., publishers. The article is available free on the Journal of Women's Health website at http://www.liebertpub.com/jwh.
Infant mortality rates for non-Hispanic blacks and other minorities are much higher than for non-Hispanic whites. Better preconception heath care for women is ...
Waste to watts: Improving microbial fuel cells
2012-07-11
Some of the planet's tiniest inhabitants may help address two of society's biggest environmental challenges: how to deal with the vast quantities of organic waste produced and where to find clean, renewable energy.
According to César Torres and Sudeep Popat, researchers at Arizona State University's Biodesign Institute, certain kinds of bacteria are adept at converting waste into useful energy. These microorganisms are presently being applied to the task, through an innovative technology known as a microbial fuel cell or MFC.
As Torres explains, "the great advantage ...
Do people want to know if they are at risk for Alzheimer's disease?
2012-07-11
New Rochelle, NY, July 9, 2012—Genetic tests exist to identify risk for the rare inherited form of early-onset Alzheimer's disease (AD) and to predict susceptibility to the more common, late-onset form of AD, but do people want to know, and how do they react? The answers can be found in the article published in Genetic Testing and Molecular Biomarkers, a peer-reviewed journal from Mary Ann Liebert, Inc. The article is available free on the Genetic Testing and Molecular Biomarkers website.
"This article addresses a major disease of tremendous impact on increasing numbers ...
LAST 30 PRESS RELEASES:
Cocoa or green tea could protect you from the negative effects of fatty foods during mental stress - study
A new model to explore the epidermal renewal
Study reveals significant global disparities in cancer care across different countries
Proactively screening diabetics for heart disease does not improve long-term mortality rates or reduce future cardiac events, new study finds
New model can help understand coexistence in nature
National Poll: Some parents need support managing children's anger
Political shadows cast by the Antarctic curtain
Scientists lead study on ‘spray on, wash off’ bandages for painful EB condition
A new discovery about pain signalling may contribute to better treatment of chronic pain
Migrating birds have stowaway passengers: invasive ticks could spread novel diseases around the world
Diabetes drug shows promise in protecting kidneys
Updated model reduces liver transplant disparities for women
Risk of internal bleeding doubles when people on anticoagulants take NSAID painkiller
‘Teen-friendly’ mindfulness therapy aims to help combat depression among teenagers
Innovative risk score accurately calculates which kidney transplant candidates are also at risk for heart attack or stroke, new study finds
Kidney outcomes in transthyretin amyloid cardiomyopathy
Partial cardiac denervation to prevent postoperative atrial fibrillation after coronary artery bypass grafting
Finerenone in women and men with heart failure with mildly reduced or preserved ejection fraction
Finerenone, serum potassium, and clinical outcomes in heart failure with mildly reduced or preserved ejection fraction
Hormone therapy reshapes the skeleton in transgender individuals who previously blocked puberty
Evaluating performance and agreement of coronary heart disease polygenic risk scores
Heart failure in zero gravity— external constraint and cardiac hemodynamics
Amid record year for dengue infections, new study finds climate change responsible for 19% of today’s rising dengue burden
New study finds air pollution increases inflammation primarily in patients with heart disease
AI finds undiagnosed liver disease in early stages
The American Society of Tropical Medicine and Hygiene and the Bill & Melinda Gates Foundation announce new research fellowship in malaria genomics in honor of professor Dominic Kwiatkowski
Excessive screen time linked to early puberty and accelerated bone growth
First nationwide study discovers link between delayed puberty in boys and increased hospital visits
Traditional Mayan practices have long promoted unique levels of family harmony. But what effect is globalization having?
New microfluidic device reveals how the shape of a tumour can predict a cancer’s aggressiveness
[Press-News.org] Searching genomic data fasterBiologists' capacity for generating genomic data is increasing more rapidly than computing power - A new algorithm will help them keep up