PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Collecting just the right data

When you can't collect all the data you need, a new algorithm tells you which to target

2014-07-25
(Press-News.org) Much artificial-intelligence research addresses the problem of making predictions based on large data sets. An obvious example is the recommendation engines at retail sites like Amazon and Netflix.

But some types of data are harder to collect than online click histories —information about geological formations thousands of feet underground, for instance. And in other applications — such as trying to predict the path of a storm — there may just not be enough time to crunch all the available data.

Dan Levine, an MIT graduate student in aeronautics and astronautics, and his advisor, Jonathan How, the Richard Cockburn Maclaurin Professor of Aeronautics and Astronautics, have developed a new technique that could help with both problems. For a range of common applications in which data is either difficult to collect or too time-consuming to process, the technique can identify the subset of data items that will yield the most reliable predictions. So geologists trying to assess the extent of underground petroleum deposits, or meteorologists trying to forecast the weather, can make do with just a few, targeted measurements, saving time and money.

Levine and How, who presented their work at the Uncertainty in Artificial Intelligence conference this week, consider the special case in which something about the relationships between data items is known in advance. Weather prediction provides an intuitive example: Measurements of temperature, pressure, and wind velocity at one location tend to be good indicators of measurements at adjacent locations, or of measurements at the same location a short time later, but the correlation grows weaker the farther out you move either geographically or chronologically.

Graphic Content

Such correlations can be represented by something called a probabilistic graphical model. In this context, a graph is a mathematical abstraction consisting of nodes — typically depicted as circles — and edges — typically depicted as line segments connecting nodes. A network diagram is one example of a graph; a family tree is another. In a probabilistic graphical model, the nodes represent variables, and the edges represent the strength of the correlations between them.

Levine and How developed an algorithm that can efficiently calculate just how much information any node in the graph gives you about any other — what in information theory is called "mutual information." As Levine explains, one of the obstacles to performing that calculation efficiently is the presence of "loops" in the graph, or nodes that are connected by more than one path.

Calculating mutual information between nodes, Levine says, is kind of like injecting blue dye into one of them and then measuring the concentration of blue at the other. "It's typically going to fall off as we go further out in the graph," Levine says. "If there's a unique path between them, then we can compute it pretty easily, because we know what path the blue dye will take. But if there are loops in the graph, then it's harder for us to compute how blue other nodes are because there are many different paths."

So the first step in the researchers' technique is to calculate "spanning trees" for the graph. A tree is just a graph with no loops: In a family tree, for instance, a loop might mean that someone was both parent and sibling to the same person. A spanning tree is a tree that touches all of a graph's nodes but dispenses with the edges that create loops.

Betting the Spread

Most of the nodes that remain in the graph, however, are "nuisances," meaning that they don't contain much useful information about the node of interest. The key to Levine and How's technique is a way to use those nodes to navigate the graph without letting their short-range influence distort the long-range calculation of mutual information.

That's possible, Levine explains, because the probabilities represented by the graph are Gaussian, meaning that they follow the bell curve familiar as the model of, for instance, the dispersion of characteristics in a population. A Gaussian distribution is exhaustively characterized by just two measurements: the average value — say, the average height in a population — and the variance — the rate at which the bell spreads out.

"The uncertainty in the problem is really a function of the spread of the distribution," Levine says. "It doesn't really depend on where the distribution is centered in space." As a consequence, it's often possible to calculate variance across a probabilistic graphical model without relying on the specific values of the nodes. "The usefulness of data can be assessed before the data itself becomes available," Levine says.

INFORMATION: Written by Larry Hardesty, MIT News Office


ELSE PRESS RELEASES FROM THIS DATE:

Could age of first period influence development of diseases in older women?

2014-07-25
BOSTON—A novel study shows that the age girls reach puberty is influenced by 'imprinted genes'—a subset of genes whose activity differs depending on which parent contributes the gene. This is the first evidence that imprinted genes can control the rate of development after birth and details of this study were published today in the journal Nature. Age of the first period, known as menarche, is a marker for the timing of puberty in females. Medical evidence shows that the onset of menses varies between girls, is an inherited trait, and is linked to breast cancer, diabetes ...

It takes two to court

It takes two to court
2014-07-25
KANSAS CITY, MO—Researchers at the Stowers Institute for Medical Research have identified the functions of two classes of pheromone receptors, and found pheromones crucial to triggering the mating process in mice. They found one class of receptors helps a male mouse detect pheromones that indicate when a female is present. The other class of receptors lets him know if the female mouse is ovulating and ready to mate. Both sets of pheromones are critical to trigger mating. Stowers' researchers believe mice developed this system through evolution to maximize the chance of ...

Experiences at every stage of life contribute to cognitive abilities in old age

2014-07-25
Early life experiences, such as childhood socioeconomic status and literacy, may have greater influence on the risk of cognitive impairment late in life than such demographic characteristics as race and ethnicity, a large study by researchers with the UC Davis Alzheimer's Disease Center and the University of Victoria, Canada, has found. "Declining cognitive function in older adults is a major personal and public health concern," said Bruce Reed, professor of neurology and associate director of the UC Davis Alzheimer's Disease Center. "But not all people lose cognitive ...

Why do men prefer nice women?

2014-07-25
People's emotional reactions and desires in initial romantic encounters determine the fate of a potential relationship. Responsiveness may be one of those initial "sparks" necessary to fuel sexual desire and land a second date. However, it may not be a desirable trait for both men and women on a first date. Does responsiveness increase sexual desire in the other person? Do men perceive responsive women as more attractive, and does the same hold true for women's perceptions of men? A study published in Personality and Social Psychology Bulletin seeks to answer those questions. ...

Heart attack patients could be treated more quickly after Manchester research

2014-07-25
Heart attack patients could be treated more quickly after Manchester research Clinical judgement, combined with an electrocardiogram (ECG) and blood test on arrival, is effective in reducing unnecessary hospital admissions for chest pain, a new study shows. The findings of a research group in Manchester, published in the Emergency Medicine Journal, could potentially make a huge difference to a large number of patients. Chest pain is the most common reason for emergency hospital admission. In Manchester, the incidence of premature death due to heart disease and stroke ...

Test increases odds of correct surgery for thyroid cancer patients

2014-07-25
PITTSBURGH -- The routine use of a molecular testing panel developed at UPMC greatly increases the likelihood of performing the correct initial surgery for patients with thyroid nodules and cancer, report researchers from the University of Pittsburgh Cancer Institute (UPCI), partner with UPMC CancerCenter. The test, available at the UPMC/UPCI Multidisciplinary Thyroid Center and other diagnostic testing agencies, improved the chances of patients getting the correct initial surgery by 30 percent, according to the study published this month in the Annals of Surgery. "Before ...

Brain tumor causes and risk factors elude scientists

2014-07-25
Today, nearly 700,000 people in the U.S. are living with a brain tumor, and yet, when it comes to pinpointing causes or risk factors, scientists are still searching for answers. "Right now, we don't know who, we don't know when, and we don't know why people develop brain tumors," said Elizabeth M. Wilson, MNA, President and CEO, American Brain Tumor Association. "It's frustrating for the brain tumor community, and it's why the American Brain Tumor Association funds research to pursue answers to these questions, and it's why we host this national conference to provide ...

Is Europe putting cancer research at risk?

2014-07-25
The European Society for Medical Oncology (ESMO), the leading pan-European association representing medical oncology professionals, has expressed concern that the proposed EU General Data Protection Regulation [1] could make cancer research impossible and add a significant burden to both doctors and cancer patients. The proposed wording of the regulation [2] stipulates 'explicit and specific patient consent', meaning that researchers would have to approach patients every single time research is planned in order to consult their data or use tissue samples stored for research ...

Informed consent: False positives not a worry in lung cancer study

Informed consent: False positives not a worry in lung cancer study
2014-07-25
PROVIDENCE, R.I. [Brown University] — The U.S. Preventive Services Task Force recently recommended computerized tomography (CT) lung screening for people at high risk for cancer, but a potential problem with CT is that many patients will have positive results on the screening test, only to be deemed cancer-free on further testing. Many policymakers have expressed concern that this high false-positive rate will cause patients to become needlessly upset. A new study of National Lung Screening Trial participant responses to false positive diagnoses, however, finds that those ...

Exposure to dim light at night may make breast cancers resistant to tamoxifen

2014-07-25
PHILADELPHIA — For rats bearing human breast tumors, exposure to dim light at night made the tumors resistant to the breast cancer drug tamoxifen, according to data published in Cancer Research, a journal of the American Association for Cancer Research. The negative effects of dim light exposure on tamoxifen treatment were overcome by giving rats a melatonin supplement during the night. "Resistance to tamoxifen is a growing problem among patients with hormone receptor-positive breast cancer," said Steven M. Hill, PhD, professor of structural and cellular biology and the ...

LAST 30 PRESS RELEASES:

Scientists engineer substrates hostile to bacteria but friendly to cells

New tablet shows promise for the control and elimination of intestinal worms

Project to redesign clinical trials for neurologic conditions for underserved populations funded with $2.9M grant to UTHealth Houston

Depression – discovering faster which treatment will work best for which individual

Breakthrough study reveals unexpected cause of winter ozone pollution

nTIDE January 2025 Jobs Report: Encouraging signs in disability employment: A slow but positive trajectory

Generative AI: Uncovering its environmental and social costs

Lower access to air conditioning may increase need for emergency care for wildfire smoke exposure

Dangerous bacterial biofilms have a natural enemy

Food study launched examining bone health of women 60 years and older

CDC awards $1.25M to engineers retooling mine production and safety

Using AI to uncover hospital patients’ long COVID care needs

$1.9M NIH grant will allow researchers to explore how copper kills bacteria

New fossil discovery sheds light on the early evolution of animal nervous systems

A battle of rafts: How molecular dynamics in CAR T cells explain their cancer-killing behavior

Study shows how plant roots access deeper soils in search of water

Study reveals cost differences between Medicare Advantage and traditional Medicare patients in cancer drugs

‘What is that?’ UCalgary scientists explain white patch that appears near northern lights

How many children use Tik Tok against the rules? Most, study finds

Scientists find out why aphasia patients lose the ability to talk about the past and future

Tickling the nerves: Why crime content is popular

Intelligent fight: AI enhances cervical cancer detection

Breakthrough study reveals the secrets behind cordierite’s anomalous thermal expansion

Patient-reported influence of sociopolitical issues on post-Dobbs vasectomy decisions

Radon exposure and gestational diabetes

EMBARGOED UNTIL 1600 GMT, FRIDAY 10 JANUARY 2025: Northumbria space physicist honoured by Royal Astronomical Society

Medicare rules may reduce prescription steering

Red light linked to lowered risk of blood clots

Menarini Group and Insilico Medicine enter a second exclusive global license agreement for an AI discovered preclinical asset targeting high unmet needs in oncology

Climate fee on food could effectively cut greenhouse gas emissions in agriculture while ensuring a social balance

[Press-News.org] Collecting just the right data
When you can't collect all the data you need, a new algorithm tells you which to target