{"id":3412,"date":"2025-08-28T18:56:10","date_gmt":"2025-08-28T18:56:10","guid":{"rendered":"https:\/\/violethoward.com\/new\/openai-anthropic-cross-tests-expose-jailbreak-and-misuse-risks-what-enterprises-must-add-to-gpt-5-evaluations\/"},"modified":"2025-08-28T18:56:10","modified_gmt":"2025-08-28T18:56:10","slug":"openai-anthropic-cross-tests-expose-jailbreak-and-misuse-risks-what-enterprises-must-add-to-gpt-5-evaluations","status":"publish","type":"post","link":"https:\/\/violethoward.com\/new\/openai-anthropic-cross-tests-expose-jailbreak-and-misuse-risks-what-enterprises-must-add-to-gpt-5-evaluations\/","title":{"rendered":"OpenAI\u2013Anthropic cross-tests expose jailbreak and misuse risks \u2014 what enterprises must add to GPT-5 evaluations"},"content":{"rendered":" \r\n
\n\t\t\t\t
\n

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders.<\/em> Subscribe Now<\/em><\/p>\n\n\n\n


\n<\/div>

OpenAI and Anthropic may often pit their foundation models against each other, but the two companies came together to evaluate each other\u2019s public models to test alignment.\u00a0<\/p>\n\n\n\n

The companies said they believed that cross-evaluating accountability and safety would provide more transparency into what these powerful models could do, enabling enterprises to choose models that work best for them.<\/p>\n\n\n\n

\u201cWe believe this approach supports accountable and transparent evaluation, helping to ensure that each lab\u2019s models continue to be tested against new and challenging scenarios,\u201d OpenAI said in its findings.\u00a0<\/p>\n\n\n\n

Both companies found that reasoning models, such as OpenAI\u2019s 03 and o4-mini and Claude 4 from Anthropic, resist jailbreaks, while general chat models like GPT-4.1 were susceptible to misuse. Evaluations like this can help enterprises identify the potential risks associated with these models, although it should be noted that GPT-5 is not part of the test.\u00a0<\/p>\n\n\n\n

\n
\n\n\n\n

AI Scaling Hits Its Limits<\/strong><\/p>\n\n\n\n

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:<\/p>\n\n\n\n