<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Kiran Shah</title>
    <description>The latest articles on DEV Community by Kiran Shah (@kiran_shah_5121).</description>
    <link>https://dev.to/kiran_shah_5121</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3968057%2F9e03f5fb-89f9-48be-9508-fb6fa6d9ce99.png</url>
      <title>DEV Community: Kiran Shah</title>
      <link>https://dev.to/kiran_shah_5121</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/kiran_shah_5121"/>
    <language>en</language>
    <item>
      <title>Fable 5 or Feeble 5? Claude's New Safety Filters are Funny</title>
      <dc:creator>Kiran Shah</dc:creator>
      <pubDate>Mon, 15 Jun 2026 06:24:53 +0000</pubDate>
      <link>https://dev.to/kiran_shah_5121/fable-5-or-feeble-5-claudes-new-safety-filters-are-funny-2m5b</link>
      <guid>https://dev.to/kiran_shah_5121/fable-5-or-feeble-5-claudes-new-safety-filters-are-funny-2m5b</guid>
      <description>&lt;p&gt;Do you know Pulled Pork recipes and snakes games are being blocked by Claude Fable’s safety features? We will discuss this later in the article.&lt;/p&gt;

&lt;p&gt;Claude Fable 5 is the most capable AI model made till date, and it is generally ranked top by nearly every benchmark. The company &lt;a href="https://www.avidclan.com/" rel="noopener noreferrer"&gt;Avidclan Technologies&lt;/a&gt; has a blog already covering the full &lt;a href="https://www.avidclan.com/blog/claude-fable-5-explained/" rel="noopener noreferrer"&gt;Claude Fable 5&lt;/a&gt; timeline from Project Glasswing to launch day, if you want to gather more information. But today in this blog we will be discussing about its safety classifiers, designed to stop bioweapon synthesis and cyberattacks, which are currently flagging... pulled pork.&lt;/p&gt;

&lt;h2&gt;
  
  
  Fable 5 vs Mythos 5, what’s the difference in simple terms?
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Quick context:&lt;/strong&gt; We can say that Fable 5 is the child of Claude Mythos 5. Now the question is, what is this Mythos 5? According to Anthropic, it is a system that is capable of finding software vulnerabilities that Anthropic restricts to vetted cyber-defence partners only. Anthropic bolted on two-stage classifiers monitoring four categories to release the public version, the four categories are cybersecurity, biology, chemistry, and model distillation, and this distilled model is Fable 5*&lt;em&gt;( This is what Anthropic says, not us)&lt;/em&gt;*&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;This is what grabs attention:&lt;/strong&gt; Fable 5 will not refuse flagged prompts. It will silently send your request to Claude Opus 4.8 (the previous flagship), which answers instead. You will get a notification, the conversation continues, and nobody hits a brick wall. &lt;/p&gt;

&lt;p&gt;Anthropic says “this triggers in &lt;strong&gt;less than 5% of sessions&lt;/strong&gt; and that against 30 public jailbreaks on cyberattack planning, Fable 5 compiled exactly zero times.”&lt;/p&gt;

&lt;p&gt;On paper, it looks elegant, right? But in practice? Oh my god..&lt;/p&gt;

&lt;h2&gt;
  
  
  Can Claude Fable 5 give wrong answers? Yes, False Positive
&lt;/h2&gt;

&lt;p&gt;Every one of these is a documented, real example from the first two days:&lt;/p&gt;

&lt;p&gt;A Costco shopping list. A user asked for portion sizes for pulled pork sandwiches. Flagged as a biology/cybersecurity concern.&lt;/p&gt;

&lt;p&gt;Sheep RNA data. A researcher working with RNA sequencing data for sheep got blocked as a biosecurity risk. The sheep were not consulted.&lt;/p&gt;

&lt;p&gt;A Snake game. The 1997 Nokia classic. Flagged for "cybersecurity issues."&lt;/p&gt;

&lt;p&gt;Saying "hi." Yes, really. Greeting the model triggered a downgrade for at least one user.&lt;/p&gt;

&lt;p&gt;Reading a project directory. Asking Claude to look at local files - flagged.&lt;/p&gt;

&lt;p&gt;A software migration plan. Moving from protobuf back to a C-source TCP networking setup. Too spicy, apparently.&lt;/p&gt;

&lt;p&gt;Cross-domain science talk. One user reported the model literally cut itself off mid-sentence while discussing how cross-domain knowledge creates unified theories - then flagged its own thought as dangerous.&lt;/p&gt;

&lt;p&gt;A personal medical question. Blocked as a biology topic. This one's not funny; it's a real harm to usefulness.&lt;/p&gt;

&lt;p&gt;Asking about the filters themselves. Meta-questions about the safety system? Also flagged. Kafkaesque.&lt;/p&gt;

&lt;h2&gt;
  
  
  YouTuber’s Review about Claude Fable 5
&lt;/h2&gt;

&lt;p&gt;YouTube reviewers also have the same review about Claude Fable 5&lt;br&gt;
Bijan Bowen asked Fable 5 to build a browser-OS Python game, including "10 white hat tools that can show information about the current network environment." Instant downgrade to Opus 4.8. White hat. Defensive tools. Blocked. But later, the same Fable 5 generated a 3D maze game where it used the phrase "crack the vault" with zero hesitation. &lt;/p&gt;

&lt;p&gt;AI Search uploaded six cancer tumour slide images and asked Fable to identify them - a legitimate, valuable medical-vision use case. Blocked: Why? Because it’s biology. He followed up asking about molecular drivers of leukaemia and targeted therapies, and then blocked again. A model that scores 83.9% on BioMysteryBench, expert-level on biology benchmarks, won't discuss cancer research with the public. That's the trade-off Anthropic chose, and it's worth saying out loud.&lt;/p&gt;

&lt;p&gt;If you access a premium AI model like Fable 5 through an aggregator service like OpenRouter, you will get a sneakier stamp: If the version experiences technical issues or high traffic, it might automatically downgrade you to an older, cheaper model (like Opus 4.8) without a clear warning. You might be talking to Opus 4.8 for half your session without knowing it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Is This Happening? (The Honest Answer)
&lt;/h2&gt;

&lt;p&gt;Here's the thing - this isn't incompetence. It's a deliberate dial setting.&lt;/p&gt;

&lt;p&gt;Their two system could get false refusals down to 0.05% on harmless queries, showed by Anthropic’s classifier research from January 2026. But Fable 5’s model is the same one that found a 27-year-old remote-crash vulnerability in OpenBSD and wrote working browser sandbox escapes. Worst-case scenario- accidentally giving dangerous hacking tools to anonymous people online, Anthropic decided it was safer to block harmless requests than to risk a catastrophic leak. &lt;/p&gt;

&lt;p&gt;Anthropic has intentionally set their initial security filter to be incredibly sensitive and happy, it allow this filter to block a safe request (false positives), because of how they handle the backup plan. Instead of completely refusing to answer you with an error message, the system quietly routes your flagged prompt to an older, less powerful model (Opus 4.8) to generate the response.  From Anthropic's chair, a pulled-pork misfire costs you a slightly weaker model for one response. From the user's chair, you paid for a Ferrari and keep getting handed the keys to last year's Lexus without warning.&lt;/p&gt;

&lt;p&gt;Anthropic intentionally sets its initial security filter to be incredibly sensitive and trigger-happy. &lt;/p&gt;

&lt;h2&gt;
  
  
  What You Can Actually Do About It
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Expect the fallback on anything touching code-security, networking, medicine, or wet-lab science&lt;/strong&gt; - even benign versions. Phrase around it where you can.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Watch for the downgrade notice,&lt;/strong&gt; especially in third-party tools where it may be hidden.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Use Opus 4.8 directly for medical/bio questions.&lt;/strong&gt; It's the model you'll get anyway, and you'll skip the friction.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Give feedback. These classifiers are trained iteratively&lt;/strong&gt; - the January 2026 generation cut false positives 87% from its predecessor. The pulled-pork era probably won't last forever.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Building AI features and worried about exactly this kind of unpredictable model behaviour?&lt;/strong&gt; Avidclan designs AI integrations with fallback handling and guardrails that your users never have to fight. Talk to us.&lt;/p&gt;

&lt;p&gt;The frustrating part is that under those filters sits a genuinely historic model - one that beat Pokémon FireRed from raw screenshots and doubled the previous state of the art on FrontierCode. For the complete picture of what Fable 5 gets right (and the June 22 deadline you should know about), read Avidclan's complete Fable 5 guide.&lt;/p&gt;

&lt;p&gt;But until the classifiers chill out about sandwiches? Feeble 5 it is.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>claude</category>
      <category>claudefable5</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
