PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

Groundbreaking research compares prompt styles and LLMs for structured data generation - Unveiling key trade-offs for real-world AI applications

2025-12-02
(Press-News.org) Nashville, TN & Williamsburg, VA – 24 Nov 2025 – A new study published in Artif. Intell. Auton. Syst. delivers the first systematic cross-model analysis of prompt engineering for structured data generation, offering actionable guidance for developers, data scientists, and organizations leveraging large language models (LLMs) in healthcare, e-commerce, and beyond. Led by Ashraf Elnashar from Vanderbilt University, alongside co-authors Jules White (Vanderbilt University) and Douglas C. Schmidt (William & Mary), the research benchmarks six prompt styles across three leading LLMs to solve a critical challenge: balancing accuracy, speed, and cost in structured data workflows.

Structured data—from medical records and receipts to business analytics—powers essential AI-driven tasks, but its quality and efficiency depend heavily on how prompts are designed. “Prior research only scratched the surface, testing a limited set of prompts on single models,” said Elnashar, the study’s corresponding author and a researcher in Vanderbilt’s Department of Computer Science. “Our work expands the horizon by evaluating six widely used prompt formats across ChatGPT-4o, Claude, and Gemini, revealing clear trade-offs that let practitioners tailor their approach to real-world needs.”

Key Findings: Accuracy vs. Efficiency—A Clear Choice for Every Use Case The team’s rigorous experiment, conducted across three datasets (personal stories, medical records, and receipts), measured accuracy, token cost (a key driver of API expenses), and generation time for each prompt style-LLM combination. The results uncovered distinct strengths in each model:

Claude emerged as the accuracy leader (85% overall), excelling with hierarchical prompt formats like JSON and YAML—ideal for complex, high-stakes tasks such as medical record generation where data integrity is non-negotiable. ChatGPT-4o stood out for efficiency, delivering the lowest token usage (under 100 tokens for lightweight formats) and fastest processing times (4–6 seconds on average), making it perfect for cost-sensitive or real-time applications like e-commerce receipt processing. Gemini offered a balanced middle ground, with solid performance across all metrics—though it showed variability with mixed-format prompts like Hybrid CSV/Prefix. “Hierarchical formats like JSON and YAML boost accuracy but come with higher token costs, while lightweight options like CSV and simple prefixes cut latency without sacrificing much precision,” Elnashar explained. “For example, a healthcare provider handling patient data might prioritize Claude + JSON for accuracy, while an e-commerce platform could opt for ChatGPT-4o + CSV to process thousands of receipts efficiently.”

The study also highlighted a universal challenge: all LLMs struggled with narrative-style unstructured data (e.g., personal stories), with accuracy dropping to ~40% across prompt styles—underscoring the need for tailored approaches for different data types.

Practical Tools for Developers: Reusable Resources to Accelerate AI Workflows Beyond insights, the research provides tangible value for the AI community. The team has made datasets, prompt templates, validation scripts, and design guidelines publicly available on GitHub (https://github.com/elnashara/EfficientStructuringMethods/tree/main), enabling reproducibility and immediate adoption.

“We wanted to move beyond theory—these resources let developers skip the trial-and-error and directly apply our findings to their pipelines,” said Jules White, co-author and professor at Vanderbilt’s Department of Computer Science. “Whether you’re building a medical data system or an e-commerce analytics tool, our work gives you a roadmap to choose the right prompt style and LLM.”

Looking Ahead: Expanding the Boundaries of Prompt Engineering The study builds on the authors’ prior work focused on GPT-4o, now generalized to multiple models and prompt formats. Future research will explore LLMs’ robustness to noisy instructions, missing fields, and unseen schemas—critical considerations for real-world deployments. “As AI becomes more integrated into critical systems, we need to understand how these models perform when faced with the messiness of real data,” noted Schmidt, a professor in William & Mary’s Department of Computer Science.

This research was conducted without specific grant funding. The authors acknowledge the support of LLMs ChatGPT-4o, Claude, and Gemini for code generation, data visualization, and comparative evaluation.

About the Authors Ashraf Elnashar: Department of Computer Science, Vanderbilt University (ashraf.elnashar@vanderbilt.edu) Jules White: Department of Computer Science, Vanderbilt University Douglas C. Schmidt: Department of Computer Science, William & Mary About the Publication Title: Prompt engineering for structured data: a comparative evaluation of styles and LLM performanceJournal: Artif. Intell. Auton. Syst.DOI: 10.55092/aias2025009

License: Creative Commons Attribution 4.0 International License

END


ELSE PRESS RELEASES FROM THIS DATE:

Beat the bugs, enjoy the beats

2025-12-02
As summer festivals and youth gatherings return in full swing, new research from Flinders University is revealing the hidden health risks that come with multi-day events, and how to avoid them. A comprehensive review led by public health experts to identify and understand the risks that occur at multi-day events reveals that infectious disease outbreaks and foodborne illnesses are the most common public health threats at youth-focused mass gatherings. The global study examined 19 multi-day events attended predominantly by young people, ranging from music festivals and cultural ...

Genome advancement puts better Wagyu marbling on the menu

2025-12-02
Researchers from the University of Adelaide’s Davies Livestock Research Centre (DLRC) have described the most complete cattle genome yet, in a study that will lead to improvements in Wagyu breeding and result in better beef marbling. “We have presented a near complete cattle genome that is 16 per cent longer than the current reference genome,” said Dr Lloyd Low, from the DLRC and senior author of the study published in Nature Communications. “This new Wagyu genome provides a much more complete and accurate view of the genetic blueprint behind one of the world’s most ...

Developing a new electric vehicle sound

2025-12-02
HONOLULU, Dec. 1, 2025 — One of the many benefits of electric vehicles is that they are much quieter than traditional gasoline-powered vehicles. In some cases, though, they are too quiet. Automakers are required to design their vehicles so they emit sounds at low speeds to alert pedestrians to their presence. However, aside from some basic regulations regarding volume, automakers are free to choose whatever noise they wish their vehicles to emit. This freedom gives researchers a unique opportunity to design custom sounds to maximize their effectiveness. Graduate ...

Elephant seals recognize their rivals from years prior

2025-12-02
HONOLULU, Dec. 1, 2025 — How would you react if you overheard the voice of a long-lost friend or old co-worker? Chances are, just the sound of their voice will bring back memories of times you spent together. Humans are not the only animals that can remember the voices of their old acquaintances. Elephant seals, too, can remember the calls of their rivals even a year later. Caroline Casey, research scientist and adjunct professor at the University of California, Santa Cruz, will present her team’s research on elephant seal memory Monday, Dec. 1, at 2:45 p.m. HST as part of the Sixth ...

Fossils reveal anacondas have been giants for over 12 million years

2025-12-02
A University of Cambridge-led team has analysed giant anaconda fossils from South America to deduce that these tropical snakes reached their maximum size 12.4 million years ago and have remained giants ever since. Many animal species that lived 12.4 to 5.3 million years ago, in the period known as the ‘Middle to Upper Miocene’, were much bigger than their modern relatives due to warmer global temperatures, extensive wetlands and an abundance of food. While other Miocene giants - like the 12-metre caiman (Purussaurus) and the 3.2-metre giant freshwater turtle (Stupendemys) - have since gone extinct, anacondas (Eunectes) bucked the trend by surviving as a giant species. Anacondas ...

Sylvester researchers lead major treatment overhauls for acute myeloid leukemia

2025-12-01
MIAMI, FLORIDA (DEC. 1, 2025) – A new generation of targeted treatments and gentler chemotherapy options for older adults with a new diagnosis of acute myeloid leukemia (AML) is driving better survival and cure rates. Led by Mikkael Sekeres, M.D., M.S., chief of the Division of Hematology at Sylvester Comprehensive Cancer Center, part of the University of Miami Miller School of Medicine, the updated 2025 American Society of Hematology (ASH) AML treatment guidelines, appear Dec. 1, 2025, in the journal Blood Advances.In addition, the updated guidelines will be presented Dec . 7 at the American Society of Hematology (ASH) annual ...

New global guidelines streamline environmental microbiome research

2025-12-01
Microbiomes, the communities of microorganisms that live in and around us, play a vital role in everything from human health to soil fertility and climate regulation. But studying these tiny life forms, especially outside the human body, presents a major challenge: how do scientists share complex data across such a wide range of environments and disciplines? To help solve this problem, a team of nearly 250 researchers from 28 countries has developed a new set of guidelines called STREAMS, short for Standards for Technical Reporting in Environmental and host-Associated Microbiome Studies. STREAMS builds on ...

Small changes make some AI systems more brain-like than others

2025-12-01
Artificial intelligence systems that are designed with a biologically inspired architecture can simulate human brain activity before ever being trained on any data, according to new research from Johns Hopkins University. The findings, published in Nature Machine Intelligence, challenge conventional approaches to building AI by prioritizing architectural design over the type of deep learning and training that takes months, costs billions of dollars and requires thousands of megawatts of energy.  “The way that the AI field is moving right now is to throw a bunch of data at the models and build compute resources the size of small cities. That ...

Asia PGI and partners unveil preview of PathGen: New AI-powered outbreak intelligence tool

2025-12-01
SINGAPORE, 1 December 2025 – Asia Pathogen Genomics Initiative (Asia PGI) today offered the first public preview of PathGen, an AI-powered sense-making and decision-making support platform of pathogen genomics and contextual data. Designed for public health practitioners, clinicians and industry, it can help detect emerging disease threats earlier, assess risks faster, and coordinate responses within and across borders, all without compromising countries’ ownership of their respective sovereign data. The objective is to strengthen health security across Asia and beyond, ...

Groundbreaking technique unlocks secrets of bacterial shape-shifting

2025-12-01
Scientists have long known that bacteria come in many shapes and sizes, but understanding what those differences mean has remained a major challenge, especially for species that can’t be grown in the lab. Now, a new study led by Nina Wale, an Assistant Professor in MSU’s Department of Microbiology, Genetics, & Immunology, introduces a groundbreaking method that could change how researchers study bacterial diversity.  The research, published in mSphere, focuses on a tiny, unculturable pathogen called Pasteuria ramosa, which infects water-dwelling ...

LAST 30 PRESS RELEASES:

Why do we get a skip in our step when we’re happy? Thank dopamine

UC Irvine scientists uncover cellular mechanism behind muscle repair

Platform to map living brain noninvasively takes next big step

Stress-testing the Cascadia Subduction Zone reveals variability that could impact how earthquakes spread

We may be underestimating the true carbon cost of northern wildfires

Blood test predicts which bladder cancer patients may safely skip surgery

Kennesaw State's Vijay Anand honored as National Academy of Inventors Senior Member

Recovery from whaling reveals the role of age in Humpback reproduction 

Can the canny tick help prevent disease like MS and cancer?

Newcomer children show lower rates of emergency department use for non‑urgent conditions, study finds

Cognitive and neuropsychiatric function in former American football players

From trash to climate tech: rubber gloves find new life as carbon capturers materials

A step towards needed treatments for hantaviruses in new molecular map

Boys are more motivated, while girls are more compassionate?

Study identifies opposing roles for IL6 and IL6R in long-term mortality

AI accurately spots medical disorder from privacy-conscious hand images

Transient Pauli blocking for broadband ultrafast optical switching

Political polarization can spur CO2 emissions, stymie climate action

Researchers develop new strategy for improving inverted perovskite solar cells

Yes! The role of YAP and CTGF as potential therapeutic targets for preventing severe liver disease

Pancreatic cancer may begin hiding from the immune system earlier than we thought

Robotic wing inspired by nature delivers leap in underwater stability

A clinical reveals that aniridia causes a progressive loss of corneal sensitivity

Fossil amber reveals the secret lives of Cretaceous ants

Predicting extreme rainfall through novel spatial modeling

The Lancet: First-ever in-utero stem cell therapy for fetal spina bifida repair is safe, study finds

Nanoplastics can interact with Salmonella to affect food safety, study shows

Eric Moore, M.D., elected to Mayo Clinic Board of Trustees

NYU named “research powerhouse” in new analysis

New polymer materials may offer breakthrough solution for hard-to-remove PFAS in water

[Press-News.org] Groundbreaking research compares prompt styles and LLMs for structured data generation - Unveiling key trade-offs for real-world AI applications