Fable 5 got jailbroken again
Researcher Vitto Rivabella tested Fable 5’s defenses and managed to find a bypass.
According to him, most attempts failed. The protection is multi-layered: the model checks the prompt, conversation history, system context, and its own response.
Some filters run during generation and can stop the answer halfway through.
The checks are not based on keywords. The system looks at meaning, intent, language, wording, and suspicious chains of requests.
The bypass took around 20 hours. It required rare languages, academic framing, long build-ups, Unicode, breaking the task into parts, and working with the chain of thought.
The author did not get a stable bypass for long tasks. According to him, regular search is faster and cheaper.
Top comments (0)