<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Alexander Zuber-Jatzke</title>
    <description>The latest articles on DEV Community by Alexander Zuber-Jatzke (@alexander_zuberjatzke_7e).</description>
    <link>https://dev.to/alexander_zuberjatzke_7e</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F4014028%2F8b0c1d20-fbdd-4acd-8f69-90830d1a3fb7.jpg</url>
      <title>DEV Community: Alexander Zuber-Jatzke</title>
      <link>https://dev.to/alexander_zuberjatzke_7e</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/alexander_zuberjatzke_7e"/>
    <language>en</language>
    <item>
      <title>I built an AI that predicts its own collapse</title>
      <dc:creator>Alexander Zuber-Jatzke</dc:creator>
      <pubDate>Fri, 03 Jul 2026 19:15:28 +0000</pubDate>
      <link>https://dev.to/alexander_zuberjatzke_7e/i-built-an-ai-that-predicts-its-own-collapse-17h2</link>
      <guid>https://dev.to/alexander_zuberjatzke_7e/i-built-an-ai-that-predicts-its-own-collapse-17h2</guid>
      <description>&lt;h2&gt;
  
  
  1. It predicts its own collapse
&lt;/h2&gt;

&lt;p&gt;This is the part I'm proudest of and least certain about, so let's lead with&lt;br&gt;
it.&lt;/p&gt;

&lt;p&gt;NEOTH keeps an internal event stream (every tool call, model request, agent&lt;br&gt;
dispatch, fallback, retry). A background observer scores every rolling window&lt;br&gt;
of that stream with seven variables — coupling between tools and agents,&lt;br&gt;
semantic convergence pressure, resource/context pressure, agent density,&lt;br&gt;
throughput headroom, and two "buffer" variables for diversity and fallback&lt;br&gt;
redundancy. Amplifiers over buffers gives a collapse score.&lt;/p&gt;

&lt;p&gt;When the score crosses a threshold, you get a warning. The failure definitions&lt;br&gt;
(agent loop, retry storm, context-limit failure, semantic degradation, …) are&lt;br&gt;
deterministic functions of the event stream, and they're &lt;strong&gt;pre-registered&lt;/strong&gt; —&lt;br&gt;
frozen before data collection, so this isn't hindsight curve-fitting. The&lt;br&gt;
predictor even self-calibrates: it tunes its threshold against its own hits and&lt;br&gt;
misses and reports a Brier score, so the accuracy claim is &lt;em&gt;measurable&lt;/em&gt; instead&lt;br&gt;
of asserted.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;neoth babel status     &lt;span class="c"&gt;# threshold, calibration, latest scores&lt;/span&gt;
neoth babel windows    &lt;span class="c"&gt;# the actual measurements, window by window&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The model behind it comes from an open research framework called&lt;br&gt;
&lt;a href="https://github.com/The-Geek-Freaks/delta-kosmologie" rel="noopener noreferrer"&gt;delta-kosmologie&lt;/a&gt;, which&lt;br&gt;
asks a genuinely falsifiable question: can one scalar family predict collapse&lt;br&gt;
across very different complex systems? NEOTH is its first production instrument.&lt;br&gt;
If you opt in (it's off by default, consent- and autonomy-gated), your instance&lt;br&gt;
can federate anonymised, content-free, cryptographically-signed measurements&lt;br&gt;
into a shared pool that tests the theory. If the math doesn't hold, that's a&lt;br&gt;
result too — and I'd rather find out before 1.0.&lt;/p&gt;
&lt;h2&gt;
  
  
  2. Memory with receipts
&lt;/h2&gt;

&lt;p&gt;An assistant that remembers you is only trustworthy if you can see &lt;em&gt;what&lt;/em&gt; it&lt;br&gt;
remembered and prove it wasn't quietly changed.&lt;/p&gt;

&lt;p&gt;In NEOTH, every sensitive action — a profile write, a provider call, a channel&lt;br&gt;
send, a plugin capability use — lands in an append-only, HMAC-chained&lt;br&gt;
write-ahead log. The SQLite views you query are just rebuildable projections&lt;br&gt;
over that log; the log is the source of truth.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;neoth verify              &lt;span class="c"&gt;# recompute the whole chain — tamper and it fails&lt;/span&gt;
neoth wal show &lt;span class="nt"&gt;--last&lt;/span&gt; 20  &lt;span class="c"&gt;# every sensitive frame, in order&lt;/span&gt;
neoth profile pending     &lt;span class="c"&gt;# nothing enters your profile without approval&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The trust anchor is a key on your disk, not a sentence in a README. That's the&lt;br&gt;
difference between "we take privacy seriously" and something you can actually&lt;br&gt;
falsify.&lt;/p&gt;
&lt;h2&gt;
  
  
  3. Fail-closed by default
&lt;/h2&gt;

&lt;p&gt;The last idea is a posture, not a feature. Crossing a trust boundary — a cloud&lt;br&gt;
call, extracting your profile to a cloud model, sending to a channel, raising&lt;br&gt;
autonomy, a plugin using a capability — is &lt;strong&gt;denied by default&lt;/strong&gt; until you&lt;br&gt;
grant it once, on purpose. Both the grant and the refusal are logged.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;neoth preset activate fully-local
neoth privacy audit &lt;span class="nt"&gt;--last&lt;/span&gt; 30d   &lt;span class="c"&gt;# exactly what left the device — zero, locally&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;For plugins specifically, this is a real sandbox: NEOTH runs WASM plugins&lt;br&gt;
(wasmtime) with fuel and memory caps, and a plugin can only use the hostcalls&lt;br&gt;
its manifest declared and you approved. An over-level call is refused at&lt;br&gt;
runtime and written to the audit log as a &lt;code&gt;0xC7 PLUGIN_CAP_DENIED&lt;/code&gt; frame — never&lt;br&gt;
silent.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Rust, and who it's for
&lt;/h2&gt;

&lt;p&gt;NEOTH is a single Rust daemon. That bought me a few things that matter for this&lt;br&gt;
kind of project: a WASM host with hard resource caps, a sealed typestate&lt;br&gt;
&lt;code&gt;PermissionToken&amp;lt;T&amp;gt;&lt;/code&gt; that enforces plugin capabilities at compile time, and the&lt;br&gt;
general property that the audit-critical paths don't have a garbage collector or&lt;br&gt;
a runtime surprising me.&lt;/p&gt;

&lt;p&gt;It's deliberately built for two audiences at once, which is the hard bet:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Normal users&lt;/strong&gt; get a GUI wizard that asks plain questions. No YAML required.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Operators&lt;/strong&gt; get the CLI, local models, the WAL, policies, a plugin sandbox,
n8n automation, and a private mesh over Tailscale/Hysteria.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That "both at once" goal is the single thing I most want held accountable, and&lt;br&gt;
the &lt;a href="https://github.com/The-Geek-Freaks/NEOTH#comparison" rel="noopener noreferrer"&gt;comparison table&lt;/a&gt; is&lt;br&gt;
honest about it: unfinished things are marked &lt;em&gt;Partial&lt;/em&gt; or &lt;em&gt;Goal&lt;/em&gt;, not &lt;em&gt;Yes&lt;/em&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try it, then try to break it
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/The-Geek-Freaks/NEOTH
&lt;span class="nb"&gt;cd &lt;/span&gt;NEOTH/SRC &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; cargo &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;--path&lt;/span&gt; neothd
neoth doctor
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It's pre-1.0, dual-licensed MIT/Apache. The most valuable thing you can do is&lt;br&gt;
run the &lt;a href="https://github.com/The-Geek-Freaks/NEOTH/blob/main/docs/evaluation.md" rel="noopener noreferrer"&gt;verify-it-yourself path&lt;/a&gt;&lt;br&gt;
and file an issue for any claim that doesn't reproduce — especially on the&lt;br&gt;
collapse-prediction model, which is the part I'm least sure generalises.&lt;/p&gt;

&lt;p&gt;Repo: &lt;strong&gt;&lt;a href="https://github.com/The-Geek-Freaks/NEOTH" rel="noopener noreferrer"&gt;https://github.com/The-Geek-Freaks/NEOTH&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>openclaw</category>
      <category>hermes</category>
    </item>
    <item>
      <title>I built an AI that predicts its own collapse... 

## 1. It predicts its own collapse

This is the part I'm proudest of and least certain about, so let's lead with
it.

NEOTH keeps an internal event stream (every tool call, model request, agent
dispatch, fa</title>
      <dc:creator>Alexander Zuber-Jatzke</dc:creator>
      <pubDate>Fri, 03 Jul 2026 19:14:17 +0000</pubDate>
      <link>https://dev.to/alexander_zuberjatzke_7e/i-built-an-ai-that-predicts-its-own-collapse-1-it-predicts-its-own-collapse-this-is-40ea</link>
      <guid>https://dev.to/alexander_zuberjatzke_7e/i-built-an-ai-that-predicts-its-own-collapse-1-it-predicts-its-own-collapse-this-is-40ea</guid>
      <description></description>
    </item>
  </channel>
</rss>
