(Press-News.org) Nashville, TN & Williamsburg, VA – 24 Nov 2025 – A new study published in Artif. Intell. Auton. Syst. delivers the first systematic cross-model analysis of prompt engineering for structured data generation, offering actionable guidance for developers, data scientists, and organizations leveraging large language models (LLMs) in healthcare, e-commerce, and beyond. Led by Ashraf Elnashar from Vanderbilt University, alongside co-authors Jules White (Vanderbilt University) and Douglas C. Schmidt (William & Mary), the research benchmarks six prompt styles across three leading LLMs to solve a critical challenge: balancing accuracy, speed, and cost in structured data workflows.
Structured data—from medical records and receipts to business analytics—powers essential AI-driven tasks, but its quality and efficiency depend heavily on how prompts are designed. “Prior research only scratched the surface, testing a limited set of prompts on single models,” said Elnashar, the study’s corresponding author and a researcher in Vanderbilt’s Department of Computer Science. “Our work expands the horizon by evaluating six widely used prompt formats across ChatGPT-4o, Claude, and Gemini, revealing clear trade-offs that let practitioners tailor their approach to real-world needs.”
Key Findings: Accuracy vs. Efficiency—A Clear Choice for Every Use Case
The team’s rigorous experiment, conducted across three datasets (personal stories, medical records, and receipts), measured accuracy, token cost (a key driver of API expenses), and generation time for each prompt style-LLM combination. The results uncovered distinct strengths in each model:
Claude emerged as the accuracy leader (85% overall), excelling with hierarchical prompt formats like JSON and YAML—ideal for complex, high-stakes tasks such as medical record generation where data integrity is non-negotiable.
ChatGPT-4o stood out for efficiency, delivering the lowest token usage (under 100 tokens for lightweight formats) and fastest processing times (4–6 seconds on average), making it perfect for cost-sensitive or real-time applications like e-commerce receipt processing.
Gemini offered a balanced middle ground, with solid performance across all metrics—though it showed variability with mixed-format prompts like Hybrid CSV/Prefix.
“Hierarchical formats like JSON and YAML boost accuracy but come with higher token costs, while lightweight options like CSV and simple prefixes cut latency without sacrificing much precision,” Elnashar explained. “For example, a healthcare provider handling patient data might prioritize Claude + JSON for accuracy, while an e-commerce platform could opt for ChatGPT-4o + CSV to process thousands of receipts efficiently.”
The study also highlighted a universal challenge: all LLMs struggled with narrative-style unstructured data (e.g., personal stories), with accuracy dropping to ~40% across prompt styles—underscoring the need for tailored approaches for different data types.
Practical Tools for Developers: Reusable Resources to Accelerate AI Workflows
Beyond insights, the research provides tangible value for the AI community. The team has made datasets, prompt templates, validation scripts, and design guidelines publicly available on GitHub (https://github.com/elnashara/EfficientStructuringMethods/tree/main), enabling reproducibility and immediate adoption.
“We wanted to move beyond theory—these resources let developers skip the trial-and-error and directly apply our findings to their pipelines,” said Jules White, co-author and professor at Vanderbilt’s Department of Computer Science. “Whether you’re building a medical data system or an e-commerce analytics tool, our work gives you a roadmap to choose the right prompt style and LLM.”
Looking Ahead: Expanding the Boundaries of Prompt Engineering
The study builds on the authors’ prior work focused on GPT-4o, now generalized to multiple models and prompt formats. Future research will explore LLMs’ robustness to noisy instructions, missing fields, and unseen schemas—critical considerations for real-world deployments. “As AI becomes more integrated into critical systems, we need to understand how these models perform when faced with the messiness of real data,” noted Schmidt, a professor in William & Mary’s Department of Computer Science.
This research was conducted without specific grant funding. The authors acknowledge the support of LLMs ChatGPT-4o, Claude, and Gemini for code generation, data visualization, and comparative evaluation.
About the Authors
Ashraf Elnashar: Department of Computer Science, Vanderbilt University (ashraf.elnashar@vanderbilt.edu)
Jules White: Department of Computer Science, Vanderbilt University
Douglas C. Schmidt: Department of Computer Science, William & Mary
About the Publication
Title: Prompt engineering for structured data: a comparative evaluation of styles and LLM performanceJournal: Artif. Intell. Auton. Syst.DOI: 10.55092/aias2025009
License: Creative Commons Attribution 4.0 International License
END
Groundbreaking research compares prompt styles and LLMs for structured data generation - Unveiling key trade-offs for real-world AI applications
2025-12-02
ELSE PRESS RELEASES FROM THIS DATE:
Beat the bugs, enjoy the beats
2025-12-02
As summer festivals and youth gatherings return in full swing, new research from Flinders University is revealing the hidden health risks that come with multi-day events, and how to avoid them.
A comprehensive review led by public health experts to identify and understand the risks that occur at multi-day events reveals that infectious disease outbreaks and foodborne illnesses are the most common public health threats at youth-focused mass gatherings.
The global study examined 19 multi-day events attended predominantly by young people, ranging from music festivals and cultural ...
Genome advancement puts better Wagyu marbling on the menu
2025-12-02
Researchers from the University of Adelaide’s Davies Livestock Research Centre (DLRC) have described the most complete cattle genome yet, in a study that will lead to improvements in Wagyu breeding and result in better beef marbling.
“We have presented a near complete cattle genome that is 16 per cent longer than the current reference genome,” said Dr Lloyd Low, from the DLRC and senior author of the study published in Nature Communications.
“This new Wagyu genome provides a much more complete and accurate view of the genetic blueprint behind one of the world’s most ...
Developing a new electric vehicle sound
2025-12-02
HONOLULU, Dec. 1, 2025 — One of the many benefits of electric vehicles is that they are much quieter than traditional gasoline-powered vehicles. In some cases, though, they are too quiet. Automakers are required to design their vehicles so they emit sounds at low speeds to alert pedestrians to their presence.
However, aside from some basic regulations regarding volume, automakers are free to choose whatever noise they wish their vehicles to emit. This freedom gives researchers a unique opportunity to design custom sounds to maximize their effectiveness.
Graduate ...
Elephant seals recognize their rivals from years prior
2025-12-02
HONOLULU, Dec. 1, 2025 — How would you react if you overheard the voice of a long-lost friend or old co-worker? Chances are, just the sound of their voice will bring back memories of times you spent together. Humans are not the only animals that can remember the voices of their old acquaintances. Elephant seals, too, can remember the calls of their rivals even a year later.
Caroline Casey, research scientist and adjunct professor at the University of California, Santa Cruz, will present her team’s research on elephant seal memory Monday, Dec. 1, at 2:45 p.m. HST as part of the Sixth ...
Fossils reveal anacondas have been giants for over 12 million years
2025-12-02
A University of Cambridge-led team has analysed giant anaconda fossils from South America to deduce that these tropical snakes reached their maximum size 12.4 million years ago and have remained giants ever since.
Many animal species that lived 12.4 to 5.3 million years ago, in the period known as the ‘Middle to Upper Miocene’, were much bigger than their modern relatives due to warmer global temperatures, extensive wetlands and an abundance of food.
While other Miocene giants - like the 12-metre caiman (Purussaurus) and the 3.2-metre giant freshwater turtle (Stupendemys) - have since gone extinct, anacondas (Eunectes) bucked the trend by surviving as a giant species.
Anacondas ...
Sylvester researchers lead major treatment overhauls for acute myeloid leukemia
2025-12-01
MIAMI, FLORIDA (DEC. 1, 2025) – A new generation of targeted treatments and gentler chemotherapy options for older adults with a new diagnosis of acute myeloid leukemia (AML) is driving better survival and cure rates. Led by Mikkael Sekeres, M.D., M.S., chief of the Division of Hematology at Sylvester Comprehensive Cancer Center, part of the University of Miami Miller School of Medicine, the updated 2025 American Society of Hematology (ASH) AML treatment guidelines, appear Dec. 1, 2025, in the journal Blood Advances.In addition, the updated guidelines will be presented Dec . 7 at the American Society of Hematology (ASH) annual ...
New global guidelines streamline environmental microbiome research
2025-12-01
Microbiomes, the communities of microorganisms that live in and around us, play a vital role in everything from human health to soil fertility and climate regulation. But studying these tiny life forms, especially outside the human body, presents a major challenge: how do scientists share complex data across such a wide range of environments and disciplines?
To help solve this problem, a team of nearly 250 researchers from 28 countries has developed a new set of guidelines called STREAMS, short for Standards for Technical Reporting in Environmental and host-Associated Microbiome Studies. STREAMS builds on ...
Small changes make some AI systems more brain-like than others
2025-12-01
Artificial intelligence systems that are designed with a biologically inspired architecture can simulate human brain activity before ever being trained on any data, according to new research from Johns Hopkins University.
The findings, published in Nature Machine Intelligence, challenge conventional approaches to building AI by prioritizing architectural design over the type of deep learning and training that takes months, costs billions of dollars and requires thousands of megawatts of energy.
“The way that the AI field is moving right now is to throw a bunch of data at the models and build compute resources the size of small cities. That ...
Asia PGI and partners unveil preview of PathGen: New AI-powered outbreak intelligence tool
2025-12-01
SINGAPORE, 1 December 2025 – Asia Pathogen Genomics Initiative (Asia PGI) today offered the first public preview of PathGen, an AI-powered sense-making and decision-making support platform of pathogen genomics and contextual data. Designed for public health practitioners, clinicians and industry, it can help detect emerging disease threats earlier, assess risks faster, and coordinate responses within and across borders, all without compromising countries’ ownership of their respective sovereign data. The objective is to strengthen health security across Asia and beyond, ...
Groundbreaking technique unlocks secrets of bacterial shape-shifting
2025-12-01
Scientists have long known that bacteria come in many shapes and sizes, but understanding what those differences mean has remained a major challenge, especially for species that can’t be grown in the lab. Now, a new study led by Nina Wale, an Assistant Professor in MSU’s Department of Microbiology, Genetics, & Immunology, introduces a groundbreaking method that could change how researchers study bacterial diversity.
The research, published in mSphere, focuses on a tiny, unculturable pathogen called Pasteuria ramosa, which infects water-dwelling ...