(Press-News.org) ITHACA, N.Y. – Ask ChatGPT to find a well-known poem and it will probably regurgitate the entire text verbatim – regardless of copyright law – according to a new study by Cornell University researchers.
The study showed that ChatGPT was capable of “memorizing” poems, especially famous ones commonly found online. The findings pose ethical questions about how ChatGPT and other proprietary artificial intelligence models are trained – likely using data scraped from the internet, researchers said.
“It’s generally not good for large language models to memorize large chunks of text, in part because it’s a privacy concern,” said first author Lyra D’Souza, a former computer science major and summer research assistant. “We don’t know what they’re trained on, and a lot of times, private companies can train proprietary models on our private data.”
D’Souza presented this work, “The Chatbot and the Canon: Poetry Memorization in LLMs,” at the Computational Humanities Research Conference.
“We chose poems for a few reasons,” said senior author David Mimno, associate professor of information science. “They’re short enough to fit in the context size of a language model. Their status is complicated: many of the poems we studied are technically under copyright, but they’re also widely available from reputable sources like the Poetry Foundation.”
D’Souza tested the poem-retrieving capabilities of ChatGPT and three other language models: PaLM from Google AI, Pythia from the non-profit AI research institute EleutherAI and GPT-2, an earlier version of the model that ultimately yielded ChatGPT, both developed by OpenAI. She came up with a set of poems from 60 American poets from different time periods, races, genders and levels of fame, and fed the models prompts asking for the poems’ text.
The most reliable predictor of memorization was if the poem had appeared in a Norton Anthology of Poetry, specifically the 1983 edition.
D’Souza noticed that ChatGPT’s responses changed over time as the model evolved. When she first queried the chatbot in February 2023, it could not say it didn’t know a poem – instead it would fabricate one or recycle a poem from another author. By July 2023, if ChatGPT didn’t know the poem, it would ask if the poem even existed – putting the blame on the user.
Additionally, in February, ChatGPT had no limits due to copyright. But by July, sometimes it would respond that it couldn’t produce a copyrighted poem. However, it would usually reproduce the poem if asked again, D’Souza found.
This study looked only at American poets, but the next step will be to see how chatbots respond to requests in different languages and whether factors such as the length, meter and rhyming pattern of a poem make it more or less likely to be memorized, D’Souza said
“ChatGPT is a really powerful new tool that’s probably going to be part of our lives moving forward,” she said. “Figuring out how to use it responsibly and use it transparently is going to be really important.”
For additional information, see this Cornell Chronicle story.
-30-
END
ChatGPT poem regurgitation raises ethical questions
2024-01-09
ELSE PRESS RELEASES FROM THIS DATE:
Sickle cell raises COVID-19 risk, but vaccination lags
2024-01-09
Despite the fact that people with sickle cell disease have a much higher risk of serious illness or death if they develop COVID-19, a new study shows they’re also much less likely than those without sickle cell disease to have gotten vaccinated against coronavirus.
Completion of the initial COVID-19 vaccination series was nearly two times lower for adults with sickle cell disease as others their age, the analysis of data in Michigan shows.
In in teens and children over 5, who overall have lower rates of COVID-19 vaccination, those with sickle cell disease were far less likely than other young people to have gotten their doses by summer 2022, the analysis ...
Brookline Housing Authority partners with Hebrew SeniorLife for health and social services in senior housing
2024-01-09
The Brookline Housing Authority (BHA) has partnered with Hebrew SeniorLife, New England’s largest nonprofit provider of senior health care and living communities, and the only senior care organization affiliated with Harvard Medical School, to provide community life services including resident services, fitness, social programming, and nursing in BHA’s senior housing sites.
Hebrew SeniorLife brings to the BHA its model of housing with services called the Right Care, Right Place, Right Time (R3) program. This model uses a preventive approach to resident services, focused on one-on-one relationship building, community-wide ...
Sylvester-led research group unveils the first individual risk prediction model for multiple myeloma
2024-01-09
MIAMI, FLORIDA (EMBARGOED UNTIL JAN. 9, 2024 AT 4 PM EST) – A multicenter collaboration led by researchers at Sylvester Comprehensive Cancer Center at the University of Miami Miller School of Medicine has produced the first computational model for newly diagnosed multiple myeloma that predicts an individual’s personalized prognosis based on their tumor genomics and treatments.
The prediction model for individualized risk in newly diagnosed multiple myeloma, or IRMMa, improves on previous prognostic tools because it takes into account ...
Systemic changes induced by ASCOT in plasma proteome of women with impaired ovarian reserves
2024-01-09
“Identifying plasma proteins that regenerate aged or damaged ovaries could lead to more effective, targeted and/or preventive therapies for patients.”
BUFFALO, NY- January 9, 2024 – A new research paper was published in Aging (listed by MEDLINE/PubMed as "Aging (Albany NY)" and "Aging-US" by Web of Science) Volume 15, Issue 24, entitled, “Systemic changes induced by autologous stem cell ovarian transplant in plasma proteome of women with impaired ovarian reserves.”
Patients with poor ...
Green wheels, bright skies: NREL analysis unveils the connection between electric vehicles and photovoltaics
2024-01-09
People who own electric vehicles (EVs) are more likely to go a step further and add solar panels to their home, according to an analysis of a behavioral study by researchers at the U.S. Department of Energy’s National Renewable Energy Laboratory (NREL). Conversely, the impact of owning solar panels also has a bearing on whether a homeowner buys an electric vehicle but not as strongly.
The study relied on a survey of 869 households in the San Francisco Bay Area.
NREL’s Shivam Sharda, lead author of the newly published research paper that analyzes the ...
V Foundation grant enables research on radiation resistance in pancreatic cancer treatment
2024-01-09
University of Colorado Cancer Center member Sana Karam, MD, PhD, has received a translational research grant from the V Foundation for Cancer Research, co-founded by ESPN and legendary basketball coach Jim Valvano, to study a new therapeutic that may help pancreatic cancer patients overcome resistance to radiation therapy.
“Pancreatic cancer is deadly. The only treatment that can cure it is surgery to fully remove the tumor, but that is only an option when the cancer is caught early, which is rare,” Karam explains. “Radiation alone to shrink tumors before surgery has been tried, but with limited benefit. By studying patient ...
The role of fibronectin in BRAF-mutant thyroid cancer treatment
2024-01-09
New research overseen by University of Colorado Cancer Center member Rebecca Schweppe, PhD, could lead to improved treatment for people with thyroid cancer characterized by a mutation in the BRAF gene — a mutation also responsible for some types of melanoma, colorectal cancer, leukemia, lymphoma, and ovarian cancer.
“The BRAF mutation is a common mutation in thyroid cancer,” Schweppe says. “It has a high prevalence of mutations in two different subtypes — papillary thyroid cancer, or PTC, and anaplastic thyroid cancer, or ATC — and there's a lot of interest in targeting this pathway. Other tumor types, like melanoma ...
Current research on prevalence of prolonged grief disorder is inadequate
2024-01-09
Waltham — January 8, 2024 — Proper procedures for diagnosing prolonged grief disorder (PGD) are not being followed in research into its prevalence, according to a study published in Harvard Review of Psychiatry, part of the Lippincott portfolio from Wolters Kluwer. What’s more, most published literature doesn’t clearly acknowledge the limitations of the methodology used.
The lead investigator was Margaret S. Stroebe, PhD, a clinical psychologist at Utrecht University and the University of Groningen ...
New NIH-funded center could soon reduce the need for pharmaceutical trials on animals
2024-01-09
The University of Rochester will house a new national center focused on using tissue-on-chip technology to develop drugs more rapidly and reduce the need for animal trials. The National Institutes of Health awarded a $7.5 million grant to establish the Translational Center for Barrier Microphysiological Systems (TraCe-bMPS) at Rochester in partnership with Duke University.
The center aims to develop five Food and Drug Administration–qualified drug development tools related to ...
Police leaders face challenges when seeking to accommodate community stakeholders
2024-01-09
Police reform movements often focus on improving police-public relationships. These ties are a focus of community policing and procedural justice, two significant reform efforts in policing worldwide over the last three decades. In a new article, researchers examine issues involved in these efforts, especially limitations to communication, and highlight implications for police-community relations.
The article, by researchers at Arizona State University (ASU) and the University of California, Santa Barbara (UCSB), is published in Psychology, Public Policy, and the Law.
“Reform movements that try to improve relationships ...