<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: StatusCake</title>
    <description>The latest articles on DEV Community by StatusCake (@statuscake).</description>
    <link>https://dev.to/statuscake</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3940187%2F5ab208f0-9a31-43c3-b2ea-e8db806ff43a.png</url>
      <title>DEV Community: StatusCake</title>
      <link>https://dev.to/statuscake</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/statuscake"/>
    <language>en</language>
    <item>
      <title>Turn StatusCake into a verified alerting flow with Hermes</title>
      <dc:creator>StatusCake</dc:creator>
      <pubDate>Tue, 19 May 2026 11:56:13 +0000</pubDate>
      <link>https://dev.to/statuscake/turn-statuscake-into-a-verified-alerting-flow-with-hermes-3nhg</link>
      <guid>https://dev.to/statuscake/turn-statuscake-into-a-verified-alerting-flow-with-hermes-3nhg</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15"&gt;Hermes Agent Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;Most monitoring setups have the same weak spot.&lt;/p&gt;

&lt;p&gt;Detection is easy. Decision-making is not.&lt;/p&gt;

&lt;p&gt;StatusCake is good at telling you that something might be wrong. What happens next is where things usually get messy. One alert goes straight to a chat room. Another wakes the wrong person. A third turns out to be missed and never verified because the site had a brief wobble and recovered before anyone looked.&lt;/p&gt;

&lt;p&gt;Hermes is useful in that gap.&lt;/p&gt;

&lt;p&gt;Instead of treating StatusCake as the whole incident workflow, you can put Hermes behind the webhook and use it as a small verification and routing layer. The pattern is simple:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;StatusCake detects the issue&lt;/li&gt;
&lt;li&gt;Hermes receives the webhook&lt;/li&gt;
&lt;li&gt;Hermes logs the raw request&lt;/li&gt;
&lt;li&gt;Hermes checks the target from its own network perspective&lt;/li&gt;
&lt;li&gt;Hermes decides whether the alert is actually worth escalating&lt;/li&gt;
&lt;li&gt;Hermes sends the notification and keeps a local event history&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That gives you something better than "dump every alert into a channel" without dragging in a full incident-management platform.&lt;/p&gt;

&lt;h2&gt;
  
  
  The short version
&lt;/h2&gt;

&lt;p&gt;StatusCake detects. Hermes verifies, routes, emails, and remembers.&lt;/p&gt;

&lt;p&gt;That is the whole idea.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why bother adding Hermes at all?
&lt;/h2&gt;

&lt;p&gt;Because most teams do not want every uptime alert treated the same way.&lt;/p&gt;

&lt;p&gt;A &lt;code&gt;down&lt;/code&gt; alert at 2:14pm during business hours is not the same as a &lt;code&gt;down&lt;/code&gt; alert at 2:14am. A monitor flapping for ten seconds is not always the same as a real outage. And when something goes wrong, you usually want a record of what arrived, what Hermes saw, and who got notified.&lt;/p&gt;

&lt;p&gt;Hermes gives you a good place to add that logic.&lt;/p&gt;

&lt;p&gt;In this setup it does four jobs:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;It receives the webhook in a deterministic way.&lt;/li&gt;
&lt;li&gt;It verifies whether the site is actually down from where Hermes is running.&lt;/li&gt;
&lt;li&gt;It applies simple time-of-day routing.&lt;/li&gt;
&lt;li&gt;It keeps a JSONL incident history and can send a daily summary by email.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;That is enough to make a cheap monitoring stack feel much more operationally sane.&lt;/p&gt;

&lt;h2&gt;
  
  
  The architecture
&lt;/h2&gt;

&lt;p&gt;This is intentionally small.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;StatusCake
  -&amp;gt; webhook POST
  -&amp;gt; local receiver beside Hermes
  -&amp;gt; raw request log
  -&amp;gt; verification step
  -&amp;gt; escalation decision
  -&amp;gt; immediate email notification
  -&amp;gt; JSONL event history
  -&amp;gt; daily summary email
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;There is no requirement to package this as a formal Hermes skill on day one. If your goal is to get a useful workflow running quickly, an ad hoc setup with a few scripts and a config file is enough.&lt;/p&gt;

&lt;p&gt;That is the version I would recommend first.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why webhook-first is the right move
&lt;/h2&gt;

&lt;p&gt;You could try to build something around email parsing, but that is the wrong direction here.&lt;/p&gt;

&lt;p&gt;Webhook delivery is cleaner for three reasons:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;StatusCake already supports it natively&lt;/li&gt;
&lt;li&gt;you keep the alert payload structured instead of scraping email text&lt;/li&gt;
&lt;li&gt;debugging gets much easier because you can log the exact request body and headers&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In practice, that last bit matters more than people expect. When an alert does not behave the way you thought it would, the first question is usually: did StatusCake send what I think it sent?&lt;/p&gt;

&lt;p&gt;If you keep a raw inbound log, you can answer that immediately.&lt;/p&gt;

&lt;h2&gt;
  
  
  The receiver can stay tiny
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F03xz73ju62cm9a11bx4l.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F03xz73ju62cm9a11bx4l.png" alt=" " width="800" height="360"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;You do not need a huge service for this. A small local HTTP receiver is enough.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# scripts/statuscake_webhook_receiver.py
&lt;/span&gt;&lt;span class="n"&gt;python3&lt;/span&gt; &lt;span class="n"&gt;scripts&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;statuscake_webhook_receiver&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;py&lt;/span&gt; \
  &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt; &lt;span class="n"&gt;var&lt;/span&gt;&lt;span class="o"&gt;/&lt;/span&gt;&lt;span class="n"&gt;escalation&lt;/span&gt;&lt;span class="o"&gt;-&lt;/span&gt;&lt;span class="n"&gt;config&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;json&lt;/span&gt; \
  &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;host&lt;/span&gt; &lt;span class="mf"&gt;127.0&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="mf"&gt;0.1&lt;/span&gt; \
  &lt;span class="o"&gt;--&lt;/span&gt;&lt;span class="n"&gt;port&lt;/span&gt; &lt;span class="mi"&gt;8934&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Expose that locally running receiver with whatever public entrypoint you trust for machine-to-machine POSTs. For demos, a Cloudflare quick tunnel is usually fine. The important part is that StatusCake can reach:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;https://&amp;lt;public-host&amp;gt;/webhooks/statuscake-alerts
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;And before you test from StatusCake, check both health endpoints:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl http://127.0.0.1:8934/health
curl https://&amp;lt;public-host&amp;gt;/health
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;One small gotcha: loading &lt;code&gt;/webhooks/statuscake-alerts&lt;/code&gt; in a browser proves nothing. It is a POST route, not a page. Use &lt;code&gt;/health&lt;/code&gt; for GET checks and an actual POST when you test the webhook path.&lt;/p&gt;

&lt;h2&gt;
  
  
  Log first, validate second
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsmpl1v0xub2f75g1l26j.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fsmpl1v0xub2f75g1l26j.png" alt=" " width="800" height="611"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;One of the best decisions in this design is to write the inbound request to disk before doing anything clever with it.&lt;/p&gt;

&lt;p&gt;That gives you two useful artifacts:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;var/last-webhook-payload.json&lt;/code&gt; for quick inspection&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;var/incoming-webhooks.jsonl&lt;/code&gt; as an append-only inbound ledger&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That means you can answer the boring but important questions later:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Did the webhook arrive?&lt;/li&gt;
&lt;li&gt;Was the payload malformed?&lt;/li&gt;
&lt;li&gt;Did the sender include the token?&lt;/li&gt;
&lt;li&gt;Did Hermes reject it, or was it never delivered?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without that log, webhook debugging turns into folklore.&lt;/p&gt;

&lt;h2&gt;
  
  
  Verification is where this gets interesting
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fegzgn11h6lece4hl3qjy.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fegzgn11h6lece4hl3qjy.png" alt=" " width="800" height="863"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This is the part that makes Hermes more than a relay.&lt;/p&gt;

&lt;p&gt;When a &lt;code&gt;down&lt;/code&gt; alert comes in, Hermes does not need to panic immediately. It can probe the target itself and decide whether the outage looks real.&lt;/p&gt;

&lt;p&gt;A very simple version is enough:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;hit the health endpoint if you have one&lt;/li&gt;
&lt;li&gt;hit the public URL&lt;/li&gt;
&lt;li&gt;count failures&lt;/li&gt;
&lt;li&gt;only escalate if the failure threshold is met&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The config supports that directly:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"timezone"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"Europe/London"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"probe_timeout_seconds"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;8&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"min_failed_probes"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"probe_urls"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"notifications"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"immediate"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"type"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"email"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"transport"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"sendmail"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"from"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"statuscake-hermes@localhost"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"to"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"myagent@agentmail.com"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"events"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"DOWN_CONFIRMED"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"UP_CONFIRMED"&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"subject_prefix"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"[StatusCake]"&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;A detail worth calling out: leaving &lt;code&gt;probe_urls&lt;/code&gt; empty is not a bug. In this setup, that tells Hermes to verify against the &lt;code&gt;website_url&lt;/code&gt; coming from the StatusCake payload. That is a good default when you want the monitor to carry its own target.&lt;/p&gt;

&lt;p&gt;If you do have a better health endpoint than the public homepage, use it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Time-aware escalation without buying more software
&lt;/h2&gt;

&lt;p&gt;A lot of teams want basic routing rules but do not want a full paging product yet.&lt;/p&gt;

&lt;p&gt;That is fine. Hermes can do the simple version well.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"escalation_schedule"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="nl"&gt;"windows"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"name"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"business-hours"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"start_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;8&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"end_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;18&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"target"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"primary-on-call"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"channel"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"discord:#ops-business-hours"&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"name"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"out-of-hours"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"start_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;18&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"end_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;24&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"target"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"after-hours-escalation"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"channel"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"discord:#ops-after-hours"&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"name"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"night-shift"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"start_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"end_hour"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;8&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"target"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"night-duty-engineer"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
        &lt;/span&gt;&lt;span class="nl"&gt;"channel"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"discord:#ops-night"&lt;/span&gt;&lt;span class="w"&gt;
      &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
    &lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That is enough to stop every alert from behaving like a fire alarm.&lt;/p&gt;

&lt;h2&gt;
  
  
  Immediate email is often enough
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fn6ie0sye6uhbu8bfp5je.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fn6ie0sye6uhbu8bfp5je.png" alt=" " width="800" height="759"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;You do not need to overcomplicate delivery either.&lt;/p&gt;

&lt;p&gt;For this build, immediate alert delivery is handled through &lt;code&gt;sendmail&lt;/code&gt;, which keeps the integration dead simple on a machine that already knows how to send mail.&lt;/p&gt;

&lt;p&gt;A confirmed &lt;code&gt;DOWN_CONFIRMED&lt;/code&gt; or &lt;code&gt;UP_CONFIRMED&lt;/code&gt; event triggers an email right away. In our case, that means messages go straight to &lt;code&gt;myagent@agentmail.com&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;The CLI can also send a daily summary on demand:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;python3 scripts/statuscake_alert_engine.py send-daily-summary-email &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;--config&lt;/span&gt; var/escalation-config.json &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;--print-result&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;And if you want the recap every morning, schedule the wrapper script:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;python3 scripts/statuscake_daily_summary_email.py
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;In this project, that job is scheduled for 09:00 Europe/London each day.&lt;/p&gt;

&lt;h2&gt;
  
  
  The event store is deliberately boring
&lt;/h2&gt;

&lt;p&gt;Every verified event gets appended to &lt;code&gt;var/alerts.jsonl&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;That sounds plain because it is plain. That is also why it is useful.&lt;/p&gt;

&lt;p&gt;A JSONL file gives you:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a durable incident ledger&lt;/li&gt;
&lt;li&gt;easy grepping and ad hoc analysis&lt;/li&gt;
&lt;li&gt;simple daily summaries&lt;/li&gt;
&lt;li&gt;a way to compare raw StatusCake signals with Hermes-confirmed incidents&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You do not need a dashboard before you have a history.&lt;/p&gt;

&lt;p&gt;Start with the history.&lt;/p&gt;

&lt;h2&gt;
  
  
  A quick local test loop
&lt;/h2&gt;

&lt;p&gt;Once the receiver is up, you can test the full path locally with a form-encoded POST that looks like StatusCake.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-X&lt;/span&gt; POST http://127.0.0.1:8934/webhooks/statuscake-alerts &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;-H&lt;/span&gt; &lt;span class="s1"&gt;'Content-Type: application/x-www-form-urlencoded'&lt;/span&gt; &lt;span class="se"&gt;\&lt;/span&gt;
  &lt;span class="nt"&gt;--data&lt;/span&gt; &lt;span class="s1"&gt;'website_name=Example+Storefront&amp;amp;website_url=https%3A%2F%2Fexample.com&amp;amp;status=down&amp;amp;check_rate=300&amp;amp;test_id=123456&amp;amp;alert_type=uptime'&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The response tells you a lot in one go:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;whether Hermes accepted the payload&lt;/li&gt;
&lt;li&gt;whether verification passed&lt;/li&gt;
&lt;li&gt;which escalation window matched&lt;/li&gt;
&lt;li&gt;whether the event was stored&lt;/li&gt;
&lt;li&gt;whether an email notification succeeded&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the kind of tight feedback loop you want when you are building monitoring workflows.&lt;/p&gt;

&lt;h2&gt;
  
  
  What this setup is good for
&lt;/h2&gt;

&lt;p&gt;This pattern is a good fit if you want:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fewer false positive escalations&lt;/li&gt;
&lt;li&gt;lightweight time-based routing&lt;/li&gt;
&lt;li&gt;a raw audit trail of webhook delivery&lt;/li&gt;
&lt;li&gt;immediate email notifications without a big notification stack&lt;/li&gt;
&lt;li&gt;simple daily incident recaps&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Final thought
&lt;/h2&gt;

&lt;p&gt;The useful part of this setup is not that Hermes can receive a webhook. Lots of tools can receive a webhook.&lt;/p&gt;

&lt;p&gt;The useful part is that Hermes adds judgment after detection.&lt;/p&gt;

&lt;p&gt;StatusCake tells you that something might be broken. Hermes can check whether it really looks broken, decide who should hear about it, send the email, and keep the record for tomorrow morning's summary.&lt;/p&gt;

&lt;p&gt;That is a much better workflow than forwarding every alert and hoping the humans sort it out.&lt;/p&gt;

</description>
      <category>hermesagentchallenge</category>
      <category>devchallenge</category>
      <category>agents</category>
      <category>monitoring</category>
    </item>
  </channel>
</rss>
