(Press-News.org) Chart captions that explain complex trends and patterns are important for improving a reader’s ability to comprehend and retain the data being presented. And for people with visual disabilities, the information in a caption often provides their only means of understanding the chart.
But writing effective, detailed captions is a labor-intensive process. While autocaptioning techniques can alleviate this burden, they often struggle to describe cognitive features that provide additional context.
To help people author high-quality chart captions, MIT researchers have developed a dataset to improve automatic captioning systems. Using this tool, researchers could teach a machine-learning model to vary the level of complexity and type of content included in a chart caption based on the needs of users.
The MIT researchers found that machine-learning models trained for autocaptioning with their dataset consistently generated captions that were precise, semantically rich, and described data trends and complex patterns. Quantitative and qualitative analyses revealed that their models captioned charts more effectively than other autocaptioning systems.
The team’s goal is to provide the dataset, called VisText, as a tool researchers can use as they work on the thorny problem of chart autocaptioning. These automatic systems could help provide captions for uncaptioned online charts and improve accessibility for people with visual disabilities, says co-lead author Angie Boggust, a graduate student in electrical engineering and computer science at MIT and member of the Visualization Group in the Computer Science and Artificial Intelligence Laboratory (CSAIL).
“We’ve tried to embed a lot of human values into our dataset so that when we and other researchers are building automatic chart-captioning systems, we don’t end up with models that aren’t what people want or need,” she says.
Boggust is joined on the paper by co-lead author and fellow graduate student Benny J. Tang and senior author Arvind Satyanarayan, associate professor of computer science at MIT who leads the Visualization Group in CSAIL. The research will be presented at the Annual Meeting of the Association for Computational Linguistics.
Human-centered analysis
The researchers were inspired to develop VisText from prior work in the Visualization Group that explored what makes a good chart caption. In that study, researchers found that sighted users and blind or low-vision users had different preferences for the complexity of semantic content in a caption.
The group wanted to bring that human-centered analysis into autocaptioning research. To do that, they developed VisText, a dataset of charts and associated captions that could be used to train machine-learning models to generate accurate, semantically rich, customizable captions.
Developing effective autocaptioning systems is no easy task. Existing machine-learning methods often try to caption charts the way they would an image, but people and models interpret natural images differently from how we read charts. Other techniques skip the visual content entirely and caption a chart using its underlying data table. However, such data tables are often not available after charts are published.
Given the shortfalls of using images and data tables, VisText also represents charts as scene graphs. Scene graphs, which can be extracted from a chart image, contain all the chart data but also include additional image context.
“A scene graph is like the best of both worlds — it contains almost all the information present in an image while being easier to extract from images than data tables. As it’s also text, we can leverage advances in modern large language models for captioning,” Tang explains.
They compiled a dataset that contains more than 12,000 charts — each represented as a data table, image, and scene graph — as well as associated captions. Each chart has two separate captions: a low-level caption that describes the chart’s construction (like its axis ranges) and a higher-level caption that describes statistics, relationships in the data, and complex trends.
The researchers generated low-level captions using an automated system and crowdsourced higher-level captions from human workers.
“Our captions were informed by two key pieces of prior research: existing guidelines on accessible descriptions of visual media and a conceptual model from our group for categorizing semantic content. This ensured that our captions featured important low-level chart elements like axes, scales, and units for readers with visual disabilities, while retaining human variability in how captions can be written,” says Tang.
Translating charts
Once they had gathered chart images and captions, the researchers used VisText to train five machine-learning models for autocaptioning. They wanted to see how each representation — image, data table, and scene graph — and combinations of the representations affected the quality of the caption.
“You can think about a chart captioning model like a model for language translation. But instead of saying, translate this German text to English, we are saying translate this ‘chart language’ to English,” Boggust says.
Their results showed that models trained with scene graphs performed as well or better than those trained using data tables. Since scene graphs are easier to extract from existing charts, the researchers argue that they might be a more useful representation.
They also trained models with low-level and high-level captions separately. This technique, known as semantic prefix tuning, enabled them to teach the model to vary the complexity of the caption’s content.
In addition, they conducted a qualitative examination of captions produced by their best-performing method and categorized six types of common errors. For instance, a directional error occurs if a model says a trend is decreasing when it is actually increasing.
This fine-grained, robust qualitative evaluation was important for understanding how the model was making its errors. For example, using quantitative methods, a directional error might incur the same penalty as a repetition error, where the model repeats the same word or phrase. But a directional error could be more misleading to a user than a repetition error. The qualitative analysis helped them understand these types of subtleties, Boggust says.
These sorts of errors also expose limitations of current models and raise ethical considerations that researchers must consider as they work to develop autocaptioning systems, she adds.
Generative machine-learning models, such as those that power ChatGPT, have been shown to hallucinate or give incorrect information that can be misleading. While there is a clear benefit to using these models for autocaptioning existing charts, it could lead to the spread of misinformation if charts are captioned incorrectly.
“Maybe this means that we don’t just caption everything in sight with AI. Instead, perhaps we provide these autocaptioning systems as authorship tools for people to edit. It is important to think about these ethical implications throughout the research process, not just at the end when we have a model to deploy,” she says.
Boggust, Tang, and their colleagues want to continue optimizing the models to reduce some common errors. They also want to expand the VisText dataset to include more charts, and more complex charts, such as those with stacked bars or multiple lines. And they would also like to gain insights into what these autocaptioning models are actually learning about chart data.
This research was supported, in part, by a Google Research Scholar Award, the National Science Foundation, the MLA@CSAIL Initiative, and the United States Air Force Research Laboratory.
###
Written by Adam Zewe
Paper: “VisText: A Benchmark for Semantically Rich Chart Captioning”
https://vis.mit.edu/pubs/vistext.pdf
END
Researchers teach an AI to write better chart captions
A new dataset can help scientists develop automatic systems that generate richer, more descriptive captions for online charts.
2023-06-29
ELSE PRESS RELEASES FROM THIS DATE:
What Genetics is Telling Us About Substance Use Disorders - A Free Webinar from the Brain & Behavior Research Foundation
2023-06-29
The Brain & Behavior Research Foundation (BBRF) is hosting a free webinar, “What Genetics is Telling Us About Substance Use Disorders” on Tuesday, July 11, 2023, at 2:00 pm EST. The presenter will be Sandra Sanchez-Roige, Ph.D. Dr. Sanchez-Roige is an Associate Professor at the Department of Psychiatry at the University of California San Diego, and the Department of Medicine, Division of Genetic Medicine at Vanderbilt University Medical Center. She is also the recipient of a 2018 BBRF Young Investigator Grant. The webinar will be hosted by Jeffrey Borenstein, M.D., President & CEO of the Brain & ...
Tobacco smoke exposure may increase heavy metal levels in children’s saliva
2023-06-29
UNIVERSITY PARK, Pa. — Secondhand tobacco smoke continues to be a major source of indoor air pollution that causes more than 41,000 nonsmoking adults to die every year in the United States, according to the Centers for Disease Control and Prevention. The exposure is even more dire for children, who can be more affected by less smoke. It can increase frequency and severity of asthma attacks, respiratory infections, cancer, sudden infant death syndrome and behavioral problems. Now, for the first time, Penn State-led research has shown exposure to tobacco smoke increases the presence of heavy metals in children’s saliva.
The ...
Staging pancreatic cancer early with minimally invasive surgery shows positive results in patient prognosis, Mayo Clinic study finds
2023-06-29
ROCHESTER, Minn. — A study published in the Journal of the American College of Surgeons reveals that performing a minor surgical procedure on patients newly diagnosed with pancreatic cancer helps to identify cancer spread early and determine the stage of cancer. The researchers add that the surgery ideally should be performed before the patient begins chemotherapy.
"This is an important study because it supports that staging laparoscopy may help with determining a patient's prognosis and better inform treatment so that patients ...
Cyanotriazole compounds can rapidly cure trypanosome infections in mice
2023-06-29
Cyanotriazole compounds are fast-acting topoisomerase II poisons that can selectively and rapidly kill trypanosome parasites that cause Chagas disease and African sleeping sickness, according to a new study. Millions who live in Latin America and sub-Saharan Africa are at risk for trypanosomatid infections – pathogenic protozoan parasites that cause Chagas disease and human African trypanosomiasis (HAT), which are potentially fatal if not treated. Although treatments for HAT have improved in recent years, Chagas therapies remain limited and rely on lengthy regimens of toxic drugs. More effective, safer, and shorter-duration ...
First 'ghost particle' image of Milky Way galaxy captured by scientists
2023-06-29
From visible starlight to radio waves, the Milky Way galaxy has long been observed through the various frequencies of electromagnetic radiation it emits. Scientists have now revealed a uniquely different image of our galaxy by determining the galactic origin of thousands of neutrinos — invisible "ghost particles" which exist in great quantities but normally pass straight through Earth undetected. The neutrino-based image of the Milky Way is the first of its kind: a galactic portrait made with particles of matter rather than electromagnetic ...
How the cat nose knows what it’s smelling
2023-06-29
COLUMBUS, Ohio – Scientists have found the secret to felines’ finesse at sniffing out food, friends and foes.
A complex collection of tightly coiled bony airway structures gets the credit, according to the first detailed analysis of the domestic cat’s nasal airway.
The researchers created a 3D computer model of the cat nose and simulated how an inhalation of air containing common cat food odors would flow through the coiled structures. They found that the air separates into two flow streams, one that is cleansed and humidified and another delivering the odorant quickly and efficiently to the system responsible for ...
Gullies on Mars could have been formed by recent periods of liquid meltwater, study suggests
2023-06-29
PROVIDENCE, R.I. [Brown University] — A study led by Brown University researchers offers new insights into how water from melting ice could have played a recent role in the formation of ravine-like channels that cut down the sides of impact craters on Mars.
The study, published in Science, focuses on Martian gullies, which look eerily similar to gullies that form on Earth in the Dry Valleys of Antarctica and are caused by water erosion from melting glaciers. The researchers, including Brown planetary scientist Jim Head, built a model that simulates a sweet spot for when conditions on Mars allow the planet to warm above freezing temperatures, ...
Chemists develop new method to create chiral structures
2023-06-29
RIVERSIDE, Calif. -- Some molecules exist in two forms such that their structures and their mirror images are not superimposable, like our left and right hands. Called chirality, it is a property these molecules have due to their asymmetry. Chiral molecules tend to be optically active because of how they interact with light. Oftentimes, only one form of a chiral molecule exists in nature, for example, DNA. Interestingly, if a chiral molecule works well as a drug, its mirror image could be ineffective for therapy.
In ...
The first neutrino image of our galaxy
2023-06-29
For the first time, researchers have produced an image of the Milky Way using neutrinos, which were observed with the IceCube telescope in the Antarctic ice. The neutrino image suggests that cosmic ray interactions are more intense in the center of our galaxy than once thought. The results are published in an article in the journal Science.
For ages, the view of our Milky Way galaxy has inspired awe, visible with the naked eye as a hazy band of stars that stretches across the sky. Now IceCube researchers are able to see the Milky Way using neutrinos – tiny, ghostlike ...
DNA organization in real-time
2023-06-29
Performing cutting-edge science requires thinking outside the box and bringing together different scientific disciplines. Sometimes this even means being in the right place at the right time. For David Brückner, postdoctoral researcher and NOMIS fellow at ISTA, all the above-mentioned things came into effect as he attended an on-campus lecture by Professor Thomas Gregor from Princeton University. Inspired by the talk, Brückner reached out with an idea: to physically interpret the specific data sets Gregor presented. Now, the results of their collaboration are published ...
LAST 30 PRESS RELEASES:
Tigers in the neighborhood: How India makes room for both tigers and people
Grove School’s Arthur Paul Pedersen publishes critical essay on scientific measurement literacy
Moffitt study finds key biomarker to predict KRASG12C inhibitor effectiveness in lung cancer
Improving blood transfusion monitoring in critical care patients: Insights from diffuse optics
Powerful legal and financial services enable kleptocracy, research shows
Carbon capture from constructed wetlands declines as they age
UCLA-led study establishes link between early side effects from prostate cancer radiation and long-term side effects
Life cycles of some insects adapt well to a changing climate. Others, not so much.
With generative AI, MIT chemists quickly calculate 3D genomic structures
The gut-brain connection in Alzheimer’s unveiled with X-rays
NIH-funded clinical trial will evaluate new dengue therapeutic
Sound is a primary issue in the lives of skateboarders, study shows
Watch what you eat: NFL game advertisements promote foods high in fat, sodium
Red Dress Collection Concert hosted by Sharon Stone kicks off American Heart Month
One of the largest studies on preterm birth finds a maternal biomarker test significantly reduces neonatal morbidities and improves neonatal outcomes
One of the largest studies of its kind finds early intervention with iron delivered intravenously during pregnancy is a safe and effective treatment for anemia
New Case Western Reserve University study identifies key protein’s role in psoriasis
First-ever ethics checklist for portable MRI brain researchers
Addressing 3D effects of clouds for significant improvements of climate models
Gut microbes may mediate the link between drinking sugary beverages and diabetes risk
Ribosomes team up in difficult situations, new technology shows
Mortality trends among adults ages 25-44 in the US
Discontinuation and reinitiation of dual-labeled GLP-1 receptor agonists among us adults with overweight or obesity
Ultraprocessed food consumption and obesity development in Canadian children
Experts publish framework for global adoption of digital health in medical education
Canadian preschoolers get nearly half of daily calories from ultra-processed foods: University of Toronto study
City of Hope scientists identify mechanism for self-repair of the thymus, a crucial component of the immune system
New study reveals how reduced rainfall threatens plant diversity
New study reveals optimized in vitro fertilization techniques to boost coral restoration efforts in the Caribbean
No evidence that maternal sickness during pregnancy causes autism
[Press-News.org] Researchers teach an AI to write better chart captionsA new dataset can help scientists develop automatic systems that generate richer, more descriptive captions for online charts.