PRESS-NEWS.org - Press Release Distribution
PRESS RELEASES DISTRIBUTION

New video dataset to advance AI for health care

Multimodal data captures the human dynamics of care to improve AI and clinical practice

2025-12-16
(Press-News.org) Researchers at the University of Pennsylvania have launched Observer, the first multimodal medical dataset to capture anonymized, real-time interactions between patients and clinicians. Much like the medical drama The Pitt, which portrays life in the emergency room, Observer lets outsiders peer inside primary care clinics — only, in this case, none of the filmed interactions are fictional. 

Until now, the data available to health care researchers has been limited to traces left behind after a visit: qualitative information like clinician notes and quantitative measurements like patient vital signs. None of these sources captures subtleties like body language and vocal tone, or the environmental factors, including computer use, that affect how providers and patients engage with one another.

“So much of what shapes medical visits and their outcomes has been invisible to researchers,” says Kevin B. Johnson, David L. Cohen University Professor and the lead author of a new paper describing Observer in the Journal of the American Medical Informatics Association. “Thanks to technology that anonymizes our recordings, enabling HIPAA compliance, Observer lets us watch care unfold. That kind of evidence isn’t just the foundation for improving clinical practice, it’s crucial for developing responsible AI tools to augment care.”

Already, the researchers have awarded pilot grants to other teams to begin using Observer, with the goal of expanding the dataset into a national resource for improving health care. “These early projects are the start of a flywheel,” says Johnson. “As researchers generate new insights and recordings, the dataset will grow, letting us ask even more ambitious questions.”

Why Clinical Data Matters

For decades, researchers have leveraged data about medical visits to study how to improve health care. The Medical Information Mart for Intensive Care, an MIT-affiliated project begun in the 1990s, now contains tens of thousands of records of ICU visits, and has been cited in thousands of research papers covering topics like clinical decision making and hospital operations. 

More recently, such data has also played a key role in AI training, since it allows AI models to identify patterns connecting diagnoses, treatments and outcomes across large patient populations. “We’ve learned a tremendous amount from what gets documented in the medical record,” Johnson says. “But if we want to understand the full experience of care, we need data that shows what happens in the room.”

With Observer linking video, audio and transcripts to clinical data and electronic health records (EHR), researchers can now ask new questions: when laughter appears during a visit and whether it affects outcomes; how often clinicians look at patients versus their computer screens; how room layout or digital scribing technology changes communication; and how patients respond to explanations of diagnoses.

“This kind of multimodal evidence — combining video, audio and medical records — creates opportunities across so many fields,” says Karen O’Connor, Associate Director of Johnson’s Artificial Intelligence for Ambulatory Care Innovation (AI-4-AI) Lab. “By making this data available, we’re democratizing medical research and opening new paths to improving care.”

Ensuring Patient Privacy

In the United States, patient health information is protected by the Health Insurance Portability and Accountability Act (HIPAA), which requires that any data used for research be stripped of identifying details. 

For video and audio, that standard has historically been almost impossible to meet. Until recently, creating a data set of real clinical encounters would have required manually reviewing and editing every second of footage and sound, a labor-intensive and error-prone process. 

Enter MedVidDeID, a tool the Penn researchers developed to automatically anonymize video and audio recordings from clinical settings, which they describe in a separate paper in the Journal of Biomedical Informatics. In tests, MedVidDeID successfully de-identified more than 90% of video frames without human intervention and reduced total review time by over 60%.

The multi-stage system extracts transcripts, removes identifying text, scrubs audio, transforms voices, and automatically detects and blurs faces and other visual identifiers using state-of-the-art computer-vision models. A human reviewer performs final quality control to ensure total removal of protected health information.

“We built a modular pipeline that automates most of the audio-video de-identification process. By keeping a human in the loop, we’re able to protect patient privacy while enabling video-informed research at scale,” says Sriharsha Mopidevi, Senior Application Developer in the AI-4-AI Lab and co-author of both papers. 

Before collecting data, the researchers ensured that patients, patients’ families and clinicians had the opportunity to opt in and later provide feedback on the process. As a result, the team deployed multiple cameras in participating clinics: a fixed room camera to capture the overall visit, a head-mounted camera worn by the clinician to show their perspective, and — when participants opted in — a patient-mounted camera to record the visit from the patient’s point of view. 

Future Directions

With the first phase of data collection complete and pilot studies underway, the Observer team is preparing to expand the data set and make it available to a wider research community. The team plans to adopt an access model similar to the one used by MIMIC, allowing qualified investigators to apply for permission to use the multimodal recordings for their own studies.

“This is ultimately about changing the health care system,” Johnson says. “You cannot improve care or build meaningful clinical AI without understanding the encounter itself. When you can see what happens across hundreds or thousands of visits, transformation becomes possible.”

This work was supported by the National Library of Medicine and the NIH Office of the Director under project number 5DP1LM014558-03 (Former Number: 1DP1OD035237-01) for the project entitled “Helping Doctors Doctor: Using AI to Automate Documentation and ‘De-Autonomate’ Health Care.”

Kevin Johnson, M.D., M.S., is the David L. Cohen University Professor of Biomedical Informatics, Computer and Information Science, Pediatrics, and Science Communication at the University of Pennsylvania. 

Additional co-authors include Basam Alasaly, Kuk Jin Jang, Eric Eaton and Ross Koppel, all of the University of Pennsylvania.

END


ELSE PRESS RELEASES FROM THIS DATE:

MEA-based graph deviation network for early autism syndrome signatures in human forebrain organoids

2025-12-16
Multi-electrode arrays (MEAs) provide a noninvasive interface with sub-millisecond temporal resolution and long-term, multi-site recordings, enabling mechanistic investigations of in vitro human brain development and disease-related dysfunction; nevertheless, conventional MEA pipelines largely rely on firing/burst statistics or channel-/waveform-level features, which can be insufficient to systematically characterize and interpret network-level organization and its subtle pathological deviations. Accordingly, representing ...

New modeling approach sheds light on rare gut disease

2025-12-16
During development of the digestive system, a complex network of nerves forms around it, creating a “second brain” — the enteric nervous system (ENS) — which controls the movement of food and waste through the gut. But a combination of changes in the molecular letters making up certain genetic instructions can prevent these nerves from developing properly, leading to Hirschsprung disease (HSCR), a painful and often dangerous condition in which babies develop intestinal blockage and are unable to pass stool. A study led by NYU Langone Health researchers reveals a new strategy to ...

Study documents potentially hazardous flame retardants in firefighter gear

2025-12-16
Some firefighter gear is manufactured with chemicals called brominated flame retardants that could pose a risk to firefighter health, according to a new study published in Environmental Science & Technology Letters on Dec. 16. The study is the first published research in the U.S. to investigate and document the use of brominated flame retardants in firefighter turnout gear, worn for protection on the job. The findings could inform fire department decision-making when it comes to keeping or replacing gear. Structural firefighters — those working in the built environment — wear turnout gear consisting of three layers: a flame-resistant outer shell; a ...

Can certain bacteria regulate aging of the immune system and its related alterations?

2025-12-16
The process of aging is associated with a decline in immune functions and persistent low-level inflammation. Now, researchers from Japan have discovered a strain of Lentilactobacillus capable of preventing and even reversing aging-related immune alterations. Feeding aged mice with heat-inactivated YRC2606 resulted in lowered levels of inflammatory cytokines and signaling proteins. These findings point to the possibility of a functional food intervention that has the potential to benefit an increasingly aging population. The health benefits of consuming fermented milk products have been passed down through generations, without clearly understanding ...

AI model helps diagnose often undetected heart disease from simple EKG

2025-12-16
Doctors may soon be able to diagnose an elusive form of heart disease within seconds by using an AI model developed at University of Michigan, according to a recent study. Researchers trained the model to detect coronary microvascular dysfunction, a complex condition that requires advanced imaging techniques to diagnose, using a common electrocardiogram. Their prediction tool significantly outperformed earlier AI models in nearly every diagnostic task, including predicting myocardial flow reserve, the gold standard for ...

There are fewer online trolls than people think

2025-12-16
Americans overestimate online toxicity, believing 43% of Reddit users post severely toxic comments when only 3% actually do, and this misperception inculcates pessimism about society. Angela Y. Lee, Eric Neumann, and colleagues surveyed 1,090 American adults via the online platform CloudResearch Connect to compare people’s perceptions of harmful online behavior with platform-level data from past research. Participants overestimated the prevalence of Reddit users posting toxic content by 13-fold and overestimated the prevalence of Facebook users sharing false news by 5-fold, guessing 47% of users post false news while only 8.5% actually do. Even when participants ...

Cell membrane fluctuations produce electricity

2025-12-16
Researchers develop a theoretical framework that shows how living cell membranes can generate electricity from molecular fluctuations. Pradeep Sharma and colleagues created a model demonstrating that active biological processes, such as protein dynamics and ATP hydrolysis, create membrane fluctuations that could produce transmembrane voltages via flexoelectricity. Such transmembrane voltages can reach 90 millivolts. Voltage changes can happen on millisecond timescales, matching typical action potential curves for neurons. The authors’ framework predicts that active membrane ...

Jeonbuk National University study shows positive parenting can protect adolescents against self-harm

2025-12-16
Self-harm refers to intentionally injuring one’s own body as a coping mechanism to emotional distress. It manifests in many forms and has serious consequences not only on physical health but also on mental health. Self-harm among adolescents is becoming a significant public issue. It is more common in adolescence than any other age group, and adolescent self-harm experiences can increase the likelihood of repeated self-harm, suicide risk, substance use in adulthood, and long-term mental health difficulties. Among ...

Surface-engineered ZnO nanocrystals to tackle perfluoroalkyl substance contamination

2025-12-16
Perfluoroalkyl substances (PFASs), a large class of synthetic chemicals, are valued for their ability to withstand heat, water, and oil. These materials are used in the production of everyday as well as industrial items. PFAS molecules are made up of a chain of carbon and fluorine atoms linked together. The energy required to break the carbon–fluorine (C–F) bond is extremely high, making these compounds durable and highly resistant to biological degradation. However, PFASs are also often called "forever chemicals,” as they do not degrade easily. This persistence leads to ongoing ...

This new understanding of T cell receptors may improve cancer immunotherapies

2025-12-16
One of the most exciting advances in cancer treatments in the past decade is the development of T cell immunotherapies, in which a patient’s own immune system is trained to recognize and attack dangerous cells. Yet a full understanding of how they actually work has eluded researchers. That’s been a significant limitation, because while T cell immunotherapies are highly effective for certain subtypes of cancers, they’re ineffective for the majority of them—and the reasons why are unclear. Understanding their modus operandi could bring their benefits to a much broader group of cancer patients.   Now ...

LAST 30 PRESS RELEASES:

The vast majority of US rivers lack any protections from human activities, new research finds

Ultrasound-responsive in situ antigen "nanocatchers" open a new paradigm for personalized tumor immunotherapy

Environmental “superbugs” in our rivers and soils: new one health review warns of growing antimicrobial resistance crisis

Triple threat in greenhouse farming: how heavy metals, microplastics, and antibiotic resistance genes unite to challenge sustainable food production

Earthworms turn manure into a powerful tool against antibiotic resistance

AI turns water into an early warning network for hidden biological pollutants

Hidden hotspots on “green” plastics: biodegradable and conventional plastics shape very different antibiotic resistance risks in river microbiomes

Engineered biochar enzyme system clears toxic phenolic acids and restores pepper seed germination in continuous cropping soils

Retail therapy fail? Online shopping linked to stress, says study

How well-meaning allies can increase stress for marginalized people

Commercially viable biomanufacturing: designer yeast turns sugar into lucrative chemical 3-HP

Control valve discovered in gut’s plumbing system

George Mason University leads phase 2 clinical trial for pill to help maintain weight loss after GLP-1s

Hop to it: research from Shedd Aquarium tracks conch movement to set new conservation guidance

Weight loss drugs and bariatric surgery improve the body’s fat ‘balance:’ study

The Age of Fishes began with mass death

TB harnesses part of immune defense system to cause infection

Important new source of oxidation in the atmosphere found

A tug-of-war explains a decades-old question about how bacteria swim

Strengthened immune defense against cancer

Engineering the development of the pancreas

The Journal of Nuclear Medicine ahead-of-print tip sheet: Jan. 9, 2026

Mount Sinai researchers help create largest immune cell atlas of bone marrow in multiple myeloma patients

Why it is so hard to get started on an unpleasant task: Scientists identify a “motivation brake”

Body composition changes after bariatric surgery or treatment with GLP-1 receptor agonists

Targeted regulation of abortion providers laws and pregnancies conceived through fertility treatment

Press registration is now open for the 2026 ACMG Annual Clinical Genetics Meeting

Understanding sex-based differences and the role of bone morphogenetic protein signaling in Alzheimer’s disease

Breakthrough in thin-film electrolytes pushes solid oxide fuel cells forward

Clues from the past reveal the West Antarctic Ice Sheet’s vulnerability to warming

[Press-News.org] New video dataset to advance AI for health care
Multimodal data captures the human dynamics of care to improve AI and clinical practice