PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

AI vision, reinvented: The power of synthetic data

New, open-source tool creates synthetic diagrams, charts and documents to help vision-language models “see” more clearly

2025-07-21
(Press-News.org) In the race to develop AI that understands complex images like financial forecasts, medical diagrams and nutrition labels — essential for AI to operate independently in everyday settings — closed-source systems like ChatGPT and Claude currently set the pace. But no one outside their makers knows how those models were trained or what data they used, leaving open-source alternatives scrambling to catch up. 

Now, researchers at Penn Engineering and the Allen Institute for AI (Ai2) have developed a new approach to train open-source models: using AI to create scientific figures, charts and tables that teach other AI systems how to interpret complex visual information. 

Their tool, CoSyn (short for Code-Guided Synthesis), taps open-source AI models’ coding skills to render text-rich images and generate relevant questions and answers, giving other AI systems the data they need to learn how to “see” and understand scientific figures.

As the researchers detail in a paper for ACL 2025, one of the world’s leading AI conferences, CoSyn-trained models match or outperform their proprietary peers. “This is like taking a student who’s great at writing and asking them to teach someone how to draw, just by describing what the drawing should look like,” says Yue Yang (GrEng’25), co-first author and Research Scientist at Ai2’s PRIOR: Perceptual Reasoning and Interaction Research group. “We’re essentially transferring the strengths of open-source AI from text to vision.”

Synthetic Images, Real Results

The resulting dataset, called CoSyn-400K, includes more than 400,000 synthetic images and 2.7 million sets of corresponding instructions, in categories as varied as scientific charts, chemical structures and user-interface screenshots. CoSyn-trained models outperformed top proprietary systems like GPT-4V and Gemini 1.5 Flash on a suite of seven benchmark tests.

In one particularly striking case, the researchers synthetically generated just 7,000 nutrition labels to train a model for a new benchmark they created, NutritionQA. That small, targeted dataset enabled their model to beat others trained on millions of real images.

“Training AI with CoSyn is incredibly data efficient,” says Mark Yatskar, Assistant Professor in CIS and Yang’s doctoral co-advisor. “We’re showing that synthetic data can help models generalize to real-world scenarios that could be unique to a person's needs, like reading a nutrition label for someone with low vision.”

Scaling and Diversifying the Dataset

Creating hundreds of thousands of useful, varied training examples posed its own challenges. 

To reach the scale required, co-first-author Ajay Patel, a doctoral student in Computer and Information Science (CIS), developed a software library called DataDreamer that automated the entire process of generating data. This allowed the team to prompt language models in parallel, enabling large-scale production of synthetic images and instructions.

In order to avoid repetition, the team leveraged “personas,” short character profiles like “a sci-fi novelist” or “a chemistry teacher,” which guided the AI’s responses and shaped the content and tone of each example. Embedding these personas into prompts led CoSyn to produce richer, more varied training data across a wide range of domains.

“AI models tend to repeat themselves unless you nudge them into different perspectives,” explains Patel. “Personas give us a scalable way to do that, and the results speak for themselves.”

Leveling the Playing Field for Open-Source AI

By building CoSyn entirely with open-source tools, the researchers hope to democratize access to powerful vision-language training methods without the ethical and legal challenges surrounding web scraping and copyrighted content.

“This is a step towards AI helping us make new scientific discoveries,” adds Chris Callison-Burch, Professor in CIS, who co-advised Yang and currently advises Patel. “It opens the door to AI systems that can reason about scientific documents, which could help a wide range of people, from college students to researchers.”

From Understanding to Action

The team has released the full CoSyn code and dataset to the public, inviting the global research community to build upon their work. 

Yang is already looking ahead to synthetic data that can help AI not only understand images, but also interact with them, serving as intelligent digital agents that can click buttons, fill out forms and assist users in daily tasks. 

“In the long run, we want AI that can act in the world, not just describe it,” Yang says. “This is one way to teach it how.”

This research was conducted during Yang’s internship with the PRIOR team at Ai2 and supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA) via the HIATUS Program contract #2022-22072200005, the Defense Advanced Research Projects Agency’s (DARPA) SciFy program (Agreement No. HR00112520300), and the Penn ASSET center and Ai2.

END


ELSE PRESS RELEASES FROM THIS DATE:

Chemical shield stops stressed DNA from triggering disease

2025-07-21
When environmental stress harms DNA, it can set off a cascade of failures linked to heart conditions, neurodegeneration, and chronic inflammation. A new chemical tool developed at UC Riverside interrupts that process, helping preserve DNA before the damage leads to disease. The study, published in the German Chemical Society journal Angewandte Chemie International Edition, focused on mitochondrial DNA, which is separate from the DNA housed in a cell’s nucleus. While nuclear DNA contains the vast majority of the genetic code, mitochondria carry their own smaller genomes that are essential for ...

Genetic test predicts obesity in childhood

2025-07-21
What if we could prevent people from developing obesity? The World Obesity Federation expects more than half the global population to develop overweight or obesity by 2035. However, treatment strategies such as lifestyle change, surgery and medications are not universally available or effective. By drawing on genetic data from over five million people, an international team of researchers has created a genetic test called a polygenic risk score (PGS) that predicts adulthood obesity already in early childhood. This finding could help to identify children ...

Arctic winter reaches melting point: scientists witness dramatic thaw in Svalbard

2025-07-21
A new commentary published in Nature Communications by Dr James Bradley, Reader in Environmental Science at Queen Mary University of London, and his team reveals a dramatic and concerning shift in the Arctic winter. During a fieldwork campaign in Svalbard in February 2025, researchers encountered exceptionally high temperatures, widespread snowmelt, and blooming vegetation.  Svalbard, warming at six to seven times the global average rate, is at the forefront of the climate crisis, with winter ...

New genetic analysis predicts risk of adult obesity from childhood

2025-07-21
A new genetic analysis using data from over five million people has provided a clearer understanding of the risk of going on to live with obesity.  New research led by the Universities of Copenhagen and Bristol shows analysing genes at a young age may support early strategies to prevent obesity developing later in life. The World Obesity Federation expects more than half the global population to become overweight or obese by 2035. However, treatment strategies such as lifestyle change, surgery and medications are not universally available or effective. By drawing on genetic data from over five million people, ...

Gecko-inspired cancer therapy could lead to fewer side-effects, better patient outcomes

2025-07-21
As far back as the 4th Century B.C., Aristotle marveled at the nimble gecko's ability to “run up and down a tree in any way, even with the head downwards.”  Its grippy toes, able to latch on to even the slipperiest surface with extraordinary force, have inspired everything from super glues to “Superman” climbing suits to sponges for soaking up environmental toxins. Now CU Boulder scientists have taken a cue from the reptile to develop a material able to stick to tumors inside the body, pumping out chemotherapy drugs for days. The technology, developed ...

How accurately are racial minorities represented in US cancer registration systems?

2025-07-21
Tracking race-specific rates of cancer incidence and mortality is important for identifying racial differences in these outcomes and for monitoring efforts aimed at achieving the highest level of health for all. Researchers have assessed how well US race data collection standards and their revisions have captured cancer burdens for various racial groups over the years. The findings are published by Wiley online in CANCER, a peer-reviewed journal of the American Cancer Society. Race data collection has followed recommendations from the US Office of ...

Bench-pressing cells

2025-07-21
Immune responses rely on the efficient movement of immune cells within the complex and geometrically unpredictable three-dimensional tissues that make up our bodies. Recent research by the Sixt group at the Institute of Science and Technology Austria (ISTA) unveils how immune cells use their cytoskeleton to exert forces on their surrounding environment to push their way through tissues. The findings were published in Nature Immunology. “Eww; what, inside of me?” A common response when Patricia Reis-Rodrigues, a PhD student in the Sixt group at ISTA, reveals ...

Potty pressure: 1 in 5 parents report struggles with toilet training

2025-07-21
ANN ARBOR, Mich. – Transitioning from diapers to the toilet is a major step for young children — and their parents. Now a new report shines a light on just how bumpy that journey can be. One in five parents say their child had potty anxiety during toilet training and another one in five say the process was harder than they expected, according to the University of Michigan Health C.S. Mott Children’s Hospital National Poll on Children’s Health. “Learning to use the toilet is a major step in a young child’s development and requires time, patience, and consistency,” said Mott Poll Co-Director and Mott pediatrician Susan Woolford, M.D. “Our ...

Tumor-targeting fluorescent bacteria illuminate cancer for precision surgery

2025-07-21
Accurate removal of tumors is the most critical aspect of cancer surgery, yet it remains a significant challenge in clinical practice. In breast cancer, for example, the positive margin rate—where cancer cells remain at the surgical boundary—can reach up to 35%, often requiring reoperation and increasing the risk of recurrence. Preoperative imaging or ultrasound is often insufficient to fully identify tumor boundaries, forcing surgeons to rely heavily on experience. These limitations highlight the urgent need for technologies that can provide real-time tumor visualization during surgery. A joint research team led by ...

Global study of more than 100,000 young people latest to link early smartphone ownership with poorer mental health in young adults

2025-07-21
Owning a smartphone before age 13 is associated with poorer mind health and wellbeing in early adulthood, according to a global study of more than 100,000 young people. Published today in the peer-reviewed Journal of Human Development and Capabilities, the study found that 18- to 24-year-olds who had received their first smartphone at age 12 or younger were more likely to report suicidal thoughts, aggression, detachment from reality, poorer emotional regulation, and low self-worth. The data also shows ...

LAST 30 PRESS RELEASES:

People with sensitive personalities more likely to experience mental health problems

Want to improve early detection of diabetes? Look in the same households as those with abnormal blood sugar

Unveiling the gut-heart connection: The role of microbiota in heart failure

Breakthrough insights into tumor angiogenesis and endothelial cell origins

Unlocking the power of mitochondrial biogenesis to combat acute kidney injury

MIT study sheds light on graphite’s lifespan in nuclear reactors

The role of fucosylation in digestive diseases and cancer

Meet Allie, the AI-powered chess bot trained on data from 91 million games

Students’ image tool offers sharper signs, earlier detection in the lab or from space

UBC Okanagan study suggests fasting effects on the body are not the same for everyone

Children’s Hospital of Philadelphia and Children’s Hospital Colorado researchers conduct first prospective study of pediatric EoE patients and disease progression

Harnessing VR to prevent substance use relapse

The 8,000-year history recorded in Great Salt Lake sediments

To craft early tools, ancient human relatives transported stones over long distances 600,000 years earlier than previously thought

Human embryo implantation recorded in real time for the first time

70 years of data show adaptation reducing Europe’s flood losses

Recapitulating egg and sperm development in the dish

Study reveals benefits of traditional Himalayan crops

Scientist uncover hidden immune “hubs” that drive joint damage in rheumatoid arthritis

Congress of Neurological Surgeons releases first guidelines on the care of patients with functioning pituitary adenomas

New discovery could lower heart attack and stroke risk for people with type 2 diabetes

Tumor electrophysiology in precision tumor therapy

AI revolution in medicine: how large language models are transforming drug development

Hidden contamination in DNA extraction kits threatens accuracy of global zoonotic surveillance

Slicing and dictionaries: a new approach to medical big data

60 percent of the world’s land area is in a precarious state

Thousands of kids in mental health crisis are stuck for days in hospital emergency rooms, study finds

Prices and affordability of essential medicines in 72 low-, middle-, and high-income markets

Space mice babies

FastUKB: A revolutionary tool for simplifying UK Biobank data analysis

[Press-News.org] AI vision, reinvented: The power of synthetic data
New, open-source tool creates synthetic diagrams, charts and documents to help vision-language models “see” more clearly