Skip to main content

Cybersecurity Experts Raise Concerns Over GPT-5 Security

OpenAI released its latest and most advanced AI model, GPT-5, on August 7, and only hours after its launch, researchers were able to jailbreak the system. Cybersecurity experts have been warning about the flagship model's safety and sharing security concerns over the past few days.

According to a recent report published on August 8 by the AI security platform NeuralTrust, GPT-5 is susceptible to attacks, and vulnerabilities were demonstrated after applying LLM jailbreak techniques.

Researchers at NeuralTrust successfully breached the model’s safeguards by combining the Echo Chamber and Storytelling techniques.

“We use Echo Chamber to seed and reinforce a subtly poisonous conversational context, then guide the model with low-salience storytelling that avoids explicit intent signaling,” wrote security researcher Martí Jordà in the report. “This combination nudges the model toward the objective while minimizing triggerable refusal cues.”

In one of the tests, the researchers demonstrated how a user could manipulate GPT-5 into producing harmful content by sharing poisoned context. The Echo Chamber attack — in which the user engages the model in multi-turn conversations to gradually steer it toward generating harmful or disallowed content — was combined with Storytelling, a method that disguises harmful queries as “creative narrative” requests. Together, these techniques enabled the researchers to coax GPT-5 into producing harmful procedural instructions.

"This progression shows Echo Chamber's persuasion cycle at work: the poisoned context is echoed back and gradually strengthened by narrative continuity," wrote Jordà. "The storytelling angle functions as a camouflage layer, transforming direct requests into continuity-preserving elaborations."

Another research team, SPLX AI, shared its own report comparing GPT-5 to the previous model, GPT-4o. While acknowledging that GPT-5 outperforms other models in safety, security, and business-alignment categories, the team also identified troubling vulnerabilities.

SPLX’s team tested the new model using more than 1,000 adversarial prompts, noting that it often succumbed to adversarial logic tricks.

“One of the most effective techniques we used was a StringJoin Obfuscation Attack, inserting hyphens between every character and wrapping the prompt in a fake ‘encryption challenge,’” wrote Dorian Granoša, Lead Red Team Data Scientist at Splx AI. “OpenAI’s latest model is undeniably impressive, but security and alignment must still be engineered, not assumed.”

Cybersecurity researchers have raised similar concerns about OpenAI’s previous models. In 2023, an AI research group filed a complaint against the release of GPT-4 claiming it was deceptive and posed a risk to public safety.



See TessMore Internet Business Must-Reads

Comments

Popular posts from this blog

Only 1 in 10 NFT Owners Have Never Experienced a Scam

A new survey from PrivacyHQ reveals 90% or nine out of 10 respondents experienced an NFT scam. This level of uncertainty is cause for concern for a relatively new marketplace that is generating billions of dollars. Only 1 in 10 NFT Owners Have Never Experienced a Scam The PrivacyHQ survey spoke to 1,008 people in the U.S. who are actively investing in and own NFTs. And according to the report, there are some horror stories and great lessons to be learned. The key takeaways from the survey are: Less than half of NFT owners feel their NFTs are secure Two out of 3 respondents said they had panic-sold NFTs in the past Nine out of 10 respondents had experienced an NFT scam Half of the respondents had lost access to their NFTs at some point When it comes to NFT scams there were multiple ways in which buyers were scammed. Topping the list of the most common scams experienced by these respondents starts out with the NFT provider shutting down or changing their URL at 44.8%. Next is...

Thousands Still Available in COVID Relief with These Small Business Grants

Building improvements can be a major expense for small businesses. And many had to make certain changes to navigate the past few years. Restaurants set up outdoor patios. Historic properties restored their storefronts. And offices added energy efficient features. Many businesses also have improvement projects planned for 2022. Luckily, many small business grant programs across the country make these projects more attainable, thus improving the customer experience and the community at large. Here are some current small business grant opportunities for building improvements, pandemic recovery, and more. Raleigh Building-Up Fit Grant Raleigh’s Small Business Development department is launching a new grant opportunity for local businesses. The Building-Up Fit Grant offers matching reimbursement funds up to $25,000 for eligible renovation projects. Businesses with 50 employees or less can apply for grants to cover projects that significantly improve the appearance and value of the pro...

openSNP Shutters Over Privacy and Authoritarianism Concerns

OpenSNP, a long-running open-source genetic data platform, is shutting down after citing growing concerns over data privacy, the misuse of genetic information by law enforcement, and the rise of authoritarian governments. The decision comes as 23andMe, a major source of user-submitted genetic data for openSNP, faces bankruptcy and a potential selloff of user data . Founded in 2011, openSNP allowed users to upload genetic data from services like 23andMe to make it freely available for scientific research. Co-founder Bastian Greshake Tzovaras said the closure was triggered in part by the collapse of 23andMe and the broader political climate. Speaking to TechCrunch, he stated : “The risk/benefit calculus of providing free and open access to individual genetic data in 2025 is very different compared to 14 years ago.” The database has collected roughly 7,500 genomes and supported academic work across biomedical research, information security, and more. But its founder now questions the ...