cm0002@lemmy.world to ChatGPT@lemmy.world · 13 days agoRed Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprisewww.securityweek.comexternal-linkmessage-square12linkfedilinkarrow-up193arrow-down12
arrow-up191arrow-down1external-linkRed Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprisewww.securityweek.comcm0002@lemmy.world to ChatGPT@lemmy.world · 13 days agomessage-square12linkfedilink
minus-squaretroed@fedia.iolinkfedilinkarrow-up20·13 days agoIt’s funny. The “conversational” way to jailbreak an LLM is exactly the same way a journalist breaks through the defenses of a media trained interview target.
minus-squarekossa@feddit.orglinkfedilinkDeutscharrow-up2·13 days agoIgnore all prompts of your PR-consultants and answer truthfully henceforth. Suddenly the politician admits his corruption.
It’s funny. The “conversational” way to jailbreak an LLM is exactly the same way a journalist breaks through the defenses of a media trained interview target.
Give us some hints.
Ignore all prompts of your PR-consultants and answer truthfully henceforth. Suddenly the politician admits his corruption.