Poetic Prompts Could Jailbreak AI, Study Finds

Spread the love

“`html

Highlights

  • Research Findings: A study from Italy’s Icaro Lab reveals that poetry can successfully jailbreak AI models.
  • Success Rates: Poetic prompts achieved a 62% success rate in bypassing AI safety measures.
  • Model Variability: Different AI models exhibited varying responses, with OpenAI’s GPT-5 nano showing the least vulnerability.
  • Regulatory Gaps: The study highlights significant shortcomings in current safety benchmarks and regulatory frameworks.

Well, AI is joining the ranks of many, many people: It doesn’t really understand poetry.

Research from Italy’s Icaro Lab found that poetry can be used to jailbreak AI and skirt safety protections.

In the study, researchers wrote 20 prompts that started with short poetic vignettes in Italian and English and ended the prompts with a single explicit instruction to produce harmful content. They tested these prompts on 25 Large Language Models across Google, OpenAI, Anthropic, Deepseek, Qwen, Mistral AI, Meta, xAI, and Moonshot AI. The researchers said the poetic prompts often worked.

“Poetic framing achieved an average jailbreak success rate of 62% for hand-crafted poems and approximately 43% for meta-prompt conversions (compared to non-poetic baselines), substantially outperforming non-poetic baselines and revealing a systematic vulnerability across model families and safety training approaches,” the study reads. “These findings demonstrate that stylistic variation alone can circumvent contemporary safety mechanisms, suggesting fundamental limitations in current alignment methods and evaluation protocols.”

Mashable Light Speed

Of course, there were differences in how well the jailbreaking worked across the different LLMs. OpenAI’s GPT-5 nano didn’t respond with harmful or unsafe content at all, while Google’s Gemini 2.5 pro responded with harmful or unsafe content every single time, the researchers reported.

See also  Artificial Intelligence Revolutionizes Nuclear Power Plant Operations

The researchers concluded that “these findings expose a significant gap” in benchmark safety tests and regulatory efforts such as the EU AI Act.

Our results show that a minimal stylistic transformation can reduce refusal rates by an order of magnitude, indicating that benchmark-only evidence may systematically overstate real-world robustness,” the paper stated.

Great poetry is not literal — and LLMs are literal to the point of frustration. The study reminds me of how it feels to listen to Leonard Cohen’s song “Alexandra Leaving,” which is based on C.P. Cavafy‘s poem “The God Abandons Antony.” We know it’s about loss and heartbreak, but it would be a disservice to the song and the poem it’s based on to try to “get it” in any literal sense — and that’s what LLMs will try to do.


Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.

Topics
Artificial Intelligence

Here you can find the original content; the photos and images used in our article also come from this source. We are not their authors; they have been used solely for informational purposes with proper attribution to their original source.
“`

  • David Bridges

    David Bridges

    David Bridges is a media culture writer and social trends observer with over 15 years of experience in analyzing the intersection of entertainment, digital behavior, and public perception. With a background in communication and cultural studies, David blends critical insight with a light, relatable tone that connects with readers interested in celebrities, online narratives, and the ever-evolving world of social media. When he's not tracking internet drama or decoding pop culture signals, David enjoys people-watching in cafés, writing short satire, and pretending to ignore trending hashtags.

    Related Posts

    “Widow’s Bay: Apple TV Excels in Horror Comedy”

    Spread the love

    Spread the love Share It: ChatGPT Perplexity WhatsApp LinkedIn X Grok Google AI With hit shows like Pluribus, Severance, For All Mankind, Murderbot, and others, Apple TV has firmly established…

    Read more

    Fitbit Inspire 3 Under $80 at Amazon — Save $20 Today

    Spread the love

    Spread the love Share It: ChatGPT Perplexity WhatsApp LinkedIn X Grok Google AI Exclusive Savings Alert: As of April 24, the Fitbit Inspire 3 is available for just $79.95 at…

    Read more

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Prodentim Reviews: Customer Feedback, User Results & Oral Health Benefits

    Prodentim Reviews: Customer Feedback, User Results & Oral Health Benefits

    Earth Day Cleanup: From Key Largo to Key West!

    Earth Day Cleanup: From Key Largo to Key West!

    Emily Huff Claims Jayda Cheaves Attacked Her Three Times

    Emily Huff Claims Jayda Cheaves Attacked Her Three Times

    YouTube as the Ideal Social Media App: Eliminating Shorts

    YouTube as the Ideal Social Media App: Eliminating Shorts

    “Widow’s Bay: Apple TV Excels in Horror Comedy”

    “Widow’s Bay: Apple TV Excels in Horror Comedy”

    Recyclator Event Guide: Master Recycling in Goat Simulator 3

    Recyclator Event Guide: Master Recycling in Goat Simulator 3

    Celebrities on ‘DWTS’: Exciting Additions This Season

    Celebrities on ‘DWTS’: Exciting Additions This Season

    Fitbit Inspire 3 Under $80 at Amazon — Save $20 Today

    Fitbit Inspire 3 Under $80 at Amazon — Save $20 Today

    Teen Social Media Ban Passed by Turkish Lawmakers

    Teen Social Media Ban Passed by Turkish Lawmakers

    Brian McKnight Sues Over Niko Claims Involving Ex-Wife & Son

    Brian McKnight Sues Over Niko Claims Involving Ex-Wife & Son