Omni-modal language models: Paving the way toward artificial general intelligence
2025-11-11
(Press-News.org) The survey “A Survey on Omni-Modal Language Models” offers a systematic overview of the technological evolution, structural design, and performance evaluation of omni-modal language models (OMLMs). The work highlights how OMLMs enable unified perception, reasoning, and generation across modalities, contributing to the ongoing progress toward Artificial General Intelligence (AGI).
Recently, Lu Chen, a master’s student at the School of Computer and Artificial Intelligence, Shandong Jianzhu University, in collaboration with Dr. Zheyun Qin, a postdoctoral researcher at the School of Computer Science and Technology, Shandong University, published a comprehensive review entitled “A Survey on Omni-Modal Language Models” in AI+ Journal.
The paper provides an in-depth analysis of the core technological evolution, representative architectures, and multi-level evaluation frameworks of omni-modal language models (OMLMs)—a new generation of AI systems that integrate and reason across multiple modalities, including text, image, audio, and video.
Unlike traditional multimodal systems dominated by a single input form, OMLMs achieve modality alignment, semantic fusion, and joint representation learning, enabling dynamic collaboration among modalities within a unified semantic space. This paradigm allows end-to-end task processing—from perception to reasoning and generation—bringing AI one step closer to human-like cognition.
The study also introduces lightweight adaptation strategies, such as modality pruning and adaptive scheduling, to improve deployment efficiency in real-time medical and industrial scenarios. Furthermore, it explores domain-specific applications of OMLMs in healthcare, education, and industrial quality inspection, demonstrating their versatility and scalability.
“Omni-modal models represent a paradigm shift in artificial intelligence,” said Lu Chen, the first author of the paper.
“By integrating perception, understanding, and reasoning within a unified framework, they bring AI closer to the characteristics of human cognition.”
Corresponding author Dr. Zheyun Qin added:
“Our survey not only summarizes the current progress of omni-modal research but also provides forward-looking insights into structural flexibility and efficient deployment.”
This work offers a comprehensive reference for researchers and practitioners in the field of multimodal intelligence and contributes to the convergence of large language models and multimodal perception technologies.
This paper was published in AI Plus (Chen L., Mu J., Wang J., Kang X., Xi X., Qin Z., A Survey on Omni-Modal Language Models, AI Plus, 2026, 1:0001. DOI: 10.55092/aiplus20260001).
END
ELSE PRESS RELEASES FROM THIS DATE:
2025-11-11
WASHINGTON, Nov. 11, 2025 — Car enthusiasts will pay hundreds of dollars for stereo systems that will improve the sound quality in their cars. However, the inherent directionality of speakers and complex shapes of car cabins can exacerbate sound disparities between the drivers and passengers, no matter how advanced a speaker system.
In the Journal of Applied Physics, by AIP Publishing, researchers from a collaboration of institutions in China created a prototype using a fractal to mitigate the sound differences.
Fractals are unique shapes that can be split into infinitely smaller ...
2025-11-11
About The Study: In this population-based cross-sectional study, infant mortality risk was inversely associated with county-level access to maternity care, with the highest risk in counties with no access. When examined by race and ethnicity, differences in mortality risk between living in a full access and no access county were observed among white infants only, indicating that unmeasured barriers may limit the protective effect of access for some racial and ethnic groups.
Corresponding Author: To contact the corresponding author, Ripley Lucas, MPH, email RLucas@marchofdimes.org.
To ...
2025-11-11
About The Study: In this randomized clinical trial, the clinical hypnosis group experienced significantly greater reductions of hot flash scores and daily interference from hot flashes compared with the active control condition at week 6. This study suggests that hypnosis delivered through self-administered audio files is a clinically significant and effective method to reduce hot flashes in postmenopausal women.
Corresponding Author: To contact the corresponding author, Gary Elkins, PhD, email gary_elkins@baylor.edu.
To ...
2025-11-11
(Vienna, 11 November 2025) Using sophisticated RNA sequencing technology, biomedical researchers can measure the activity of our genes across millions of single cells, creating detailed maps of tissues, organs, and diseases. Analysing these datasets requires a rare combination of skills: deep understanding of the biology, and the ability to develop computer code that turns data into insights. What if we could equip biomedical researchers with an AI assistant that sees the data, supports the analysis, ...
2025-11-11
Novel study used whole genome sequencing to combine monogenetic and polygenetic testing, which are often siloed in research and practice
More physicians should order genetic testing but much of the workforce isn’t trained in it
Findings lay the groundwork for developing targeted therapies
CHICAGO --- In a new Northwestern Medicine study, scientists have developed a more precise genetic risk score to determine whether a person is likely to develop arrhythmia, an irregular heartbeat that ...
2025-11-11
Recent epic leaps in genetics have created a biodiversity library. As the genetic make-up of animals, plants, fungi, bacteria, and viruses has been mapped, researchers racing to develop solutions to today’s global challenges run into a question:
Why be limited to a mouse?
Electric eels, octopi, birds, sponges, and plastic-gobbling bacteria are amongst the new stages on which discovery is unfolding. The challenge is to refine the theater to optimize performance.
In today’s Nature Reviews Biodiversity, Michigan State University evolutionary biologist Jason Gallant presses a case for research – from classrooms and laboratories to funding agencies ...
2025-11-11
“These findings suggest a physiological interaction between exercise and L-BAIBA supplementation to improve soleus muscle and bone properties and reduce bone marrow adiposity.”
BUFFALO, NY — November 11, 2025 — A new research paper was published in Volume 17, Issue 10 of Aging-US on October 1, 2025, titled “L-β-aminoisobutyric acid (L-BAIBA) in combination with voluntary wheel running exercise enhances musculoskeletal properties in middle-age male mice.”
In this study led by first author ...
2025-11-11
(Toronto and Little Rock, November 11, 2025) JMIR Publications, a premier open access publisher of digital health research, and The MidSouth Computational Biology and Bioinformatics Society (MCBIOS), a leading professional organization for computational biology and bioinformatics, today announced a strategic, long-term partnership. This agreement formally designates JMIR Bioinformatics and Biotechnology as the official journal of MCBIOS.
This landmark Memorandum of Understanding (MOU) creates a stable, high-impact venue for MCBIOS members to publish their research, particularly the output from the Society's annual conference. Both organizations are now ...
2025-11-11
OAK BROOK, Ill. – Studying brain patterns in soccer fans, researchers found that certain circuit regions of the brain were activated while viewing soccer matches involving their favorite team, triggering positive and negative emotions and behaviors, according to a new study published today in Radiology, a journal of the Radiological Society of North America (RSNA). The researchers say these patterns could apply to other types of fanaticism as well, and that the circuits are forged early in life.
Soccer is a global phenomenon, and its followers ...
2025-11-11
Computed tomography (CT) scans of the chest, abdomen, and spine, taken originally to detect problems such as kidney stones or growths on the lungs, can be repurposed through artificial intelligence (AI) to catch signs of bone loss, a new study shows.
NYU Langone Health radiologists who developed the AI tool with experts at Visage say their new tool will soon be ready to provide “opportunistic screening” in NYU Langone hospitals for osteoporosis. The effort will be part of a clinical trial to diagnose those with unknown low bone density, using CT scans taken for other purposes.
Publishing ...
LAST 30 PRESS RELEASES:
[Press-News.org] Omni-modal language models: Paving the way toward artificial general intelligence