(Press-News.org) CAMBRIDGE, Mass. (January 17, 2013) – Using only a computer, an Internet connection, and publicly accessible online resources, a team of Whitehead Institute researchers has been able to identify nearly 50 individuals who had submitted personal genetic material as participants in genomic studies.
Intent on conducting an exercise in “vulnerability research”—a common practice in the field of information security—the team took a multi-step approach to prove that under certain circumstances, the full names and identities of genomic research participants can be determined, even when their genetic information is held in databases in de-identified form.
“This is an important result that points out the potential for breaches of privacy in genomics studies,” says Whitehead Fellow Yaniv Erlich, who led the research team. A description of the group’s work is published in this week’s Science magazine.
Erlich and colleagues began by analyzing unique genetic markers known as short tandem repeats on the Y chromosomes (Y-STRs) of men whose genetic material was collected by the Center for the Study of Human Polymorphisms (CEPH) and whose genomes were sequenced and made publicly available as part of the 1000 Genomes Project. Because the Y chromosome is transmitted from father to son, as are family surnames, there is a strong correlation between surnames and the DNA on the Y chromosome.
Recognizing this correlation, genealogists and genetic genealogy companies have established publicly accessible databases that house Y-STR data by surname. In a process known as “surname inference,” the Erlich team was able to discover the family names of the men by submitting their Y-STRs to these databases. With surnames in hand, the team queried other information sources, including Internet record search engines, obituaries, genealogical websites, and public demographic data from the National Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at New Jersey’s Coriell Institute, to identify nearly 50 men and women in the United States who were CEPH participants.
Previous studies have contemplated the possibility of genetic identification by matching the DNA of a single person, assuming the person’s DNA were cataloged in two separate databases. This work, however, exploits data between distant paternally-related individuals. As a result, the team notes that the posting of genetic data from a single individual can reveal deep genealogical ties and lead to the identification of a distantly-related person who may have no acquaintance with the person who released that genetic data.
“We show that if, for example, your Uncle Dave submitted his DNA to a genetic genealogy database, you could be identified,” says Melissa Gymrek, a member of the Erlich lab and first author of the Science paper. “In fact, even your fourth cousin Patrick, whom you’ve never met, could identify you if his DNA is in the database, as long as he is paternally related to you.”
Aware of the sensitivity of his work, Erlich emphasizes that he has no intention of revealing the names of those identified, nor does he wish to see public sharing of genetic information curtailed.
“Our aim is to better illuminate the current status of identifiability of genetic data,” he says. “More knowledge empowers participants to weigh the risks and benefits and make more informed decisions when considering whether to share their own data. We also hope that this study will eventually result in better security algorithms, better policy guidelines, and better legislation to help mitigate some of the risks described.”
To that end, Erlich shared his findings with officials at the National Human Genome Research Institute (NHGRI) and NIGMS prior to publication. In response, NIGMS and NHGRI moved certain demographic information from the publicly-accessible portion the NIGMS cell repository to help reduce the risk of future breaches. In the same issue of Science in which the Erlich study appears, Judith H. Greenberg and Eric D. Green, the Directors of NIGMS and NHGRI, and colleagues author a perspective on this latest research in which they advocate for an examination of approaches to balance research participants’ privacy rights with the societal benefits to be realized from the sharing of biomedical research data.
“Yaniv’s work is a timely reminder that in this era in which massive amounts of genomic data are being generated rapidly and shared in the interest of scientific advancement, there is an increasing likelihood of privacy breaches,” says Whitehead Institute Director David Page. “I’m delighted that, thanks to Yaniv’s overture to NIH, we at Whitehead Institute have the opportunity to join policymakers at NHGRI and elsewhere in what will be a critical, ongoing dialog about the importance of safeguarding data, of sharing data, and the implications of failure in either endeavor.”
###
This work was supported by the National Defense Science & Engineering Graduate Fellowship, the Edmond J. Safra Center for Bioinformatics at Tel-Aviv University, and a gift from James and Cathleen Stone.
Written by Matt Fearer
Yaniv Erlich is the Andria and Paul Heafy Fellow of Whitehead Institute for Biomedical Research, where his laboratory is located and all his research is conducted.
Full Citation:
"Identifying Personal Genomes by Surname Inference"
Science, January 18, 2012
Melissa Gymrek (1,2,3,4), Amy L. McGuire (5), David Golan (6), Eran Halperin (7,8,9), and Yaniv Erlich (1)
1. Whitehead Institute for Biomedical Research, Nine Cambridge Center, Cambridge, MA 02142, USA.
2. Harvard-MIT Division of Health Sciences and Technology, MIT, Cambridge, MA 02139, USA.
3. Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge,
MA, 02142, USA.
4. Department of Molecular Biology and Diabetes Unit, Massachusetts General Hospital, Boston,
MA 02114, USA.
5. Center for Medical Ethics and Health Policy, Baylor College of Medicine, Houston, TX 77030, USA.
6. Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv 69978, Israel.
7. School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel.
8. Department of Molecular Microbiology and Biotechnology, Tel-Aviv University, Tel Aviv 69978, Israel.
9. The International Computer Science Institute, Berkeley, CA 94704, USA.
Scientists expose new vulnerabilities in the security of personal genetic information
2013-01-18
ELSE PRESS RELEASES FROM THIS DATE:
Mouse research links adolescent stress and severe adult mental illness
2013-01-18
Working with mice, Johns Hopkins researchers have established a link between elevated levels of a stress hormone in adolescence — a critical time for brain development — and genetic changes that, in young adulthood, cause severe mental illness in those predisposed to it.
The findings, reported in the journal Science, could have wide-reaching implications in both the prevention and treatment of schizophrenia, severe depression and other mental illnesses.
"We have discovered a mechanism for how environmental factors, such as stress hormones, can affect the brain's physiology ...
Feed a cold, starve a fever…. and your worms!
2013-01-18
Contact:Gina Alvino
(415) 568-3173
plospathogens@plos.org
Disclaimer
This press release refers to an upcoming article in PLOS Pathogens. The release is provided by the article authors. Any opinions expressed in these releases or articles are the personal views of the journal staff and/or article contributors, and do not necessarily represent the views or policies of PLOS. PLOS expressly disclaims any and all warranties and liability in connection with the information found in the releases and articles and your use of such information.
Media Permissions
PLOS Journals ...
How the brain copes with multi tasking alters with age
2013-01-18
The pattern of blood flow in the prefrontal cortex in the brains alters with age during multi-tasking, finds a new study in BioMed Central's open access journal BMC Neuroscience. Increased blood volume, measured using oxygenated haemoglobin (Oxy-Hb) increased at the start of multitasking in all age groups. But to perform the same tasks, healthy older people had a higher and more sustained increase in Oxy-Hb than younger people.
Age related changes to the brain occur earliest in the prefrontal cortex, the area of the brain associated with memory, emotion, and higher decision ...
It's a dog's life: Doggy database aims to define pet health
2013-01-18
Using data collected about Labrador Retrievers, research published in BioMed Central's open access journal BMC Veterinary Research is beginning to quantify the health, illnesses, and veterinary care of dogs.
The UK is a nation of pet lovers – but what do we know about the health of our pets? To date the long term (longitudinal) study of canine diseases has been patchy, relying on information from referral centers and details about pet illnesses which are not reported to a vet have never been studied before.
The Dogslife internet-based project was organized in conjunction ...
Savanna study highlights African fuelwood crisis
2013-01-18
The dwindling reserves of fuelwood in Africa have been illuminated in a new study published today, which shows a bleak outlook for supplies across savannas in South Africa.
Presenting their findings in IOP Publishing's journal Environmental Research Letters, researchers have found that at current consumption levels in the communal areas of Lowveld, South Africa, reserves of fuelwood could be totally exhausted within 13 years.
The consequences are significant, with around half of the 2.4 million rural households in the country using wood as their primary fuel source, ...
Molecular twist helps regulate the cellular message to make histone proteins
2013-01-18
(Embargoed) CHAPEL HILL, N.C. – Histone proteins are the proteins that package DNA into chromosomes. Every time the cell replicates its DNA it must make large amounts of newly made histones to organize DNA within the nucleus.
An imbalance in the production of DNA and histones is usually lethal for the cell, which is why the levels of the messenger RNA (mRNA) encoding the histone proteins must be tightly controlled to ensure the proper amounts of histones (not too many and not too few) are made.
In a collaborative effort published online in the January 18, 2013 issue ...
Inadequate food facilities in NC migrant camps could cause illness
2013-01-18
WINSTON-SALEM, N.C. – Jan. 17, 2013 – Farmworkers are at potential risk from food and waterborne illnesses because of the condition of cooking and eating facilities available to them, according to a new study from Wake Forest Baptist Medical Center.
Researchers from Wake Forest Baptist are the first to evaluate cooking and eating facilities in migrant farmworker camps to compare against established housing regulations. They found that the facilities fail to comply with regulations in a substantial number of camps. The study, which appears online today in the January issue ...
A global approach to monitoring biodiversity loss
2013-01-18
In contrast to climate change, there is no coordinated global system in place for measuring and reporting on biodiversity change or loss. An international team of biologists is now addressing this gap.
In Science today, 30 researchers led by Henrique Miguel Pereira, from the Centre for Environmental Biology of the University of Lisbon, proposed a global biodiversity monitoring system based on a set of essential variables.
By determining the most essential measurements to accurately and usefully report on biodiversity loss, known as essential biodiversity variables (EBVs), ...
Weight loss helps to oust worms
2013-01-18
Scientists from The University of Manchester have discovered that weight loss plays an important role in the body's response to fighting off intestinal worms.
The findings have been published in the journal PLOS Pathogens and show that the immune system hijacks the natural feeding pathways causing weight loss. This then drives the defense mechanisms down the correct pathway to expel the worms.
Nearly one quarter of the world's population is infected with gastrointestinal parasites. These prevalent infections often result in a period of reduced appetite resulting in ...
Sniffing immune cells
2013-01-18
This press release is available in German.
Immune cells constantly patrol our body to check for foreign invaders, such as bacteria or viruses. To do so they leave the blood stream, actively crawl through tissues and finally re-enter the circulation via lymphatic vessels. Research from the laboratory of Michael Sixt elucidates how the cells are guided through tissues like the skin. It is thought that cells either sense their environment by 'touching' or 'smelling': They adhere to structural molecules like connective tissue proteins using adhesion receptors. Or ...