PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

AI vision, reinvented: The power of synthetic data

New, open-source tool creates synthetic diagrams, charts and documents to help vision-language models “see” more clearly

2025-07-21
(Press-News.org) In the race to develop AI that understands complex images like financial forecasts, medical diagrams and nutrition labels — essential for AI to operate independently in everyday settings — closed-source systems like ChatGPT and Claude currently set the pace. But no one outside their makers knows how those models were trained or what data they used, leaving open-source alternatives scrambling to catch up. 

Now, researchers at Penn Engineering and the Allen Institute for AI (Ai2) have developed a new approach to train open-source models: using AI to create scientific figures, charts and tables that teach other AI systems how to interpret complex visual information. 

Their tool, CoSyn (short for Code-Guided Synthesis), taps open-source AI models’ coding skills to render text-rich images and generate relevant questions and answers, giving other AI systems the data they need to learn how to “see” and understand scientific figures.

As the researchers detail in a paper for ACL 2025, one of the world’s leading AI conferences, CoSyn-trained models match or outperform their proprietary peers. “This is like taking a student who’s great at writing and asking them to teach someone how to draw, just by describing what the drawing should look like,” says Yue Yang (GrEng’25), co-first author and Research Scientist at Ai2’s PRIOR: Perceptual Reasoning and Interaction Research group. “We’re essentially transferring the strengths of open-source AI from text to vision.”

Synthetic Images, Real Results

The resulting dataset, called CoSyn-400K, includes more than 400,000 synthetic images and 2.7 million sets of corresponding instructions, in categories as varied as scientific charts, chemical structures and user-interface screenshots. CoSyn-trained models outperformed top proprietary systems like GPT-4V and Gemini 1.5 Flash on a suite of seven benchmark tests.

In one particularly striking case, the researchers synthetically generated just 7,000 nutrition labels to train a model for a new benchmark they created, NutritionQA. That small, targeted dataset enabled their model to beat others trained on millions of real images.

“Training AI with CoSyn is incredibly data efficient,” says Mark Yatskar, Assistant Professor in CIS and Yang’s doctoral co-advisor. “We’re showing that synthetic data can help models generalize to real-world scenarios that could be unique to a person's needs, like reading a nutrition label for someone with low vision.”

Scaling and Diversifying the Dataset

Creating hundreds of thousands of useful, varied training examples posed its own challenges. 

To reach the scale required, co-first-author Ajay Patel, a doctoral student in Computer and Information Science (CIS), developed a software library called DataDreamer that automated the entire process of generating data. This allowed the team to prompt language models in parallel, enabling large-scale production of synthetic images and instructions.

In order to avoid repetition, the team leveraged “personas,” short character profiles like “a sci-fi novelist” or “a chemistry teacher,” which guided the AI’s responses and shaped the content and tone of each example. Embedding these personas into prompts led CoSyn to produce richer, more varied training data across a wide range of domains.

“AI models tend to repeat themselves unless you nudge them into different perspectives,” explains Patel. “Personas give us a scalable way to do that, and the results speak for themselves.”

Leveling the Playing Field for Open-Source AI

By building CoSyn entirely with open-source tools, the researchers hope to democratize access to powerful vision-language training methods without the ethical and legal challenges surrounding web scraping and copyrighted content.

“This is a step towards AI helping us make new scientific discoveries,” adds Chris Callison-Burch, Professor in CIS, who co-advised Yang and currently advises Patel. “It opens the door to AI systems that can reason about scientific documents, which could help a wide range of people, from college students to researchers.”

From Understanding to Action

The team has released the full CoSyn code and dataset to the public, inviting the global research community to build upon their work. 

Yang is already looking ahead to synthetic data that can help AI not only understand images, but also interact with them, serving as intelligent digital agents that can click buttons, fill out forms and assist users in daily tasks. 

“In the long run, we want AI that can act in the world, not just describe it,” Yang says. “This is one way to teach it how.”

This research was conducted during Yang’s internship with the PRIOR team at Ai2 and supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA) via the HIATUS Program contract #2022-22072200005, the Defense Advanced Research Projects Agency’s (DARPA) SciFy program (Agreement No. HR00112520300), and the Penn ASSET center and Ai2.

END


ELSE PRESS RELEASES FROM THIS DATE:

Chemical shield stops stressed DNA from triggering disease

2025-07-21
When environmental stress harms DNA, it can set off a cascade of failures linked to heart conditions, neurodegeneration, and chronic inflammation. A new chemical tool developed at UC Riverside interrupts that process, helping preserve DNA before the damage leads to disease. The study, published in the German Chemical Society journal Angewandte Chemie International Edition, focused on mitochondrial DNA, which is separate from the DNA housed in a cell’s nucleus. While nuclear DNA contains the vast majority of the genetic code, mitochondria carry their own smaller genomes that are essential for ...

Genetic test predicts obesity in childhood

2025-07-21
What if we could prevent people from developing obesity? The World Obesity Federation expects more than half the global population to develop overweight or obesity by 2035. However, treatment strategies such as lifestyle change, surgery and medications are not universally available or effective. By drawing on genetic data from over five million people, an international team of researchers has created a genetic test called a polygenic risk score (PGS) that predicts adulthood obesity already in early childhood. This finding could help to identify children ...

Arctic winter reaches melting point: scientists witness dramatic thaw in Svalbard

2025-07-21
A new commentary published in Nature Communications by Dr James Bradley, Reader in Environmental Science at Queen Mary University of London, and his team reveals a dramatic and concerning shift in the Arctic winter. During a fieldwork campaign in Svalbard in February 2025, researchers encountered exceptionally high temperatures, widespread snowmelt, and blooming vegetation.  Svalbard, warming at six to seven times the global average rate, is at the forefront of the climate crisis, with winter ...

New genetic analysis predicts risk of adult obesity from childhood

2025-07-21
A new genetic analysis using data from over five million people has provided a clearer understanding of the risk of going on to live with obesity.  New research led by the Universities of Copenhagen and Bristol shows analysing genes at a young age may support early strategies to prevent obesity developing later in life. The World Obesity Federation expects more than half the global population to become overweight or obese by 2035. However, treatment strategies such as lifestyle change, surgery and medications are not universally available or effective. By drawing on genetic data from over five million people, ...

Gecko-inspired cancer therapy could lead to fewer side-effects, better patient outcomes

2025-07-21
As far back as the 4th Century B.C., Aristotle marveled at the nimble gecko's ability to “run up and down a tree in any way, even with the head downwards.”  Its grippy toes, able to latch on to even the slipperiest surface with extraordinary force, have inspired everything from super glues to “Superman” climbing suits to sponges for soaking up environmental toxins. Now CU Boulder scientists have taken a cue from the reptile to develop a material able to stick to tumors inside the body, pumping out chemotherapy drugs for days. The technology, developed ...

How accurately are racial minorities represented in US cancer registration systems?

2025-07-21
Tracking race-specific rates of cancer incidence and mortality is important for identifying racial differences in these outcomes and for monitoring efforts aimed at achieving the highest level of health for all. Researchers have assessed how well US race data collection standards and their revisions have captured cancer burdens for various racial groups over the years. The findings are published by Wiley online in CANCER, a peer-reviewed journal of the American Cancer Society. Race data collection has followed recommendations from the US Office of ...

Bench-pressing cells

2025-07-21
Immune responses rely on the efficient movement of immune cells within the complex and geometrically unpredictable three-dimensional tissues that make up our bodies. Recent research by the Sixt group at the Institute of Science and Technology Austria (ISTA) unveils how immune cells use their cytoskeleton to exert forces on their surrounding environment to push their way through tissues. The findings were published in Nature Immunology. “Eww; what, inside of me?” A common response when Patricia Reis-Rodrigues, a PhD student in the Sixt group at ISTA, reveals ...

Potty pressure: 1 in 5 parents report struggles with toilet training

2025-07-21
ANN ARBOR, Mich. – Transitioning from diapers to the toilet is a major step for young children — and their parents. Now a new report shines a light on just how bumpy that journey can be. One in five parents say their child had potty anxiety during toilet training and another one in five say the process was harder than they expected, according to the University of Michigan Health C.S. Mott Children’s Hospital National Poll on Children’s Health. “Learning to use the toilet is a major step in a young child’s development and requires time, patience, and consistency,” said Mott Poll Co-Director and Mott pediatrician Susan Woolford, M.D. “Our ...

Tumor-targeting fluorescent bacteria illuminate cancer for precision surgery

2025-07-21
Accurate removal of tumors is the most critical aspect of cancer surgery, yet it remains a significant challenge in clinical practice. In breast cancer, for example, the positive margin rate—where cancer cells remain at the surgical boundary—can reach up to 35%, often requiring reoperation and increasing the risk of recurrence. Preoperative imaging or ultrasound is often insufficient to fully identify tumor boundaries, forcing surgeons to rely heavily on experience. These limitations highlight the urgent need for technologies that can provide real-time tumor visualization during surgery. A joint research team led by ...

Global study of more than 100,000 young people latest to link early smartphone ownership with poorer mental health in young adults

2025-07-21
Owning a smartphone before age 13 is associated with poorer mind health and wellbeing in early adulthood, according to a global study of more than 100,000 young people. Published today in the peer-reviewed Journal of Human Development and Capabilities, the study found that 18- to 24-year-olds who had received their first smartphone at age 12 or younger were more likely to report suicidal thoughts, aggression, detachment from reality, poorer emotional regulation, and low self-worth. The data also shows ...

LAST 30 PRESS RELEASES:

Solvent selection tool boosts thermoelectric devices

Collecting large-scale data from impoverished communities

Neuroanatomy of social dominance

Reference genomes for rice’s wild relatives may boost future crops

How AI can enhance early detection of emerging viruses: UNLV study

Surface structure engineering of PtCu clusters enhances the performance of propane dehydrogenation

Gemini North discovers long-predicted stellar companion of Betelgeuse

Hollow molecules offer sustainable hydrocarbon separation

High-performance near-Infrared computational spectrometer enabled by finely-tuned PbS quantum dots

Hyaluronidase nanogel-armed CAR-T cell for improving efficacy against solid tumors

Tailored hard/soft magnetic heterostructure anchored on 2D carbon nanosheet for efficient microwave absorption and anti-corrosion property

A novel strategy for modulating the crystalline-amorphous composites and electronic structure to enhance hydrogen evolution reaction

Metal-free catalysts break through in green H2O2 synthesis! Novel organic semiconductors enable high-efficiency interfacial reactions

Do these two cancer drugs have what it takes to beat Alzheimer’s?

Genome editing corrected rare brain mutations in mice. Could it help fight neurological diseases?

Prime editing treats childhood brain disease in mice

Estimated out-of-pocket costs for patients with common cancers and private insurance

Finding human brain genes in duplicated DNA

SwRI experiments may explain mysterious distribution of hydrogen peroxide on Europa

New research reveals how autistic teens’ brains respond in some social settings, helping them ‘pass’ as non-autistic

GLP-1 drugs fail to provide key long-term health benefit

FloodPlanet dataset enhances global inundation monitoring

Focus in flashes: How the brain handles overload

Breaking the crystalline barrier: Amorphous nanomaterials in advanced photocatalysis

SwRI’s Sidney Chocron named Ballistics Science Fellow

Turning waste alkaline water directly into clean hydrogen!

Astronomers witness newborn planet sculpting the dust around it

AI vision, reinvented: The power of synthetic data

Chemical shield stops stressed DNA from triggering disease

Genetic test predicts obesity in childhood

[Press-News.org] AI vision, reinvented: The power of synthetic data
New, open-source tool creates synthetic diagrams, charts and documents to help vision-language models “see” more clearly