<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Hamza</title>
    <description>The latest articles on DEV Community by Hamza (@hamza_2315).</description>
    <link>https://dev.to/hamza_2315</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3815732%2F8e1ea548-3eac-4e26-8528-f172d08b8685.png</url>
      <title>DEV Community: Hamza</title>
      <link>https://dev.to/hamza_2315</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/hamza_2315"/>
    <language>en</language>
    <item>
      <title>Using Graphify to turn Incident Data into a Knowledge Graph</title>
      <dc:creator>Hamza</dc:creator>
      <pubDate>Mon, 13 Apr 2026 14:20:22 +0000</pubDate>
      <link>https://dev.to/hamza_2315/using-graphify-to-turn-incident-data-into-a-knowledge-graph-528l</link>
      <guid>https://dev.to/hamza_2315/using-graphify-to-turn-incident-data-into-a-knowledge-graph-528l</guid>
      <description>&lt;p&gt;A few days ago Andrej Karpathy said we should build LLM powered knowledge bases. Within 48 hours someone made Graphify, a tool that turns raw data into a semantic knowledge graph with a single command.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;But what if we applied this idea to incident management?&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem with Incident Data
&lt;/h2&gt;

&lt;p&gt;Most incident management tools tell you what just happened:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Incident created&lt;/li&gt;
&lt;li&gt;Alerts triggered&lt;/li&gt;
&lt;li&gt;Timeline recorded&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But during an actual incident, that’s not what you need. What you really need is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What happened last time this service broke?&lt;/li&gt;
&lt;li&gt;Who responded?&lt;/li&gt;
&lt;li&gt;What fixed it?&lt;/li&gt;
&lt;li&gt;What’s likely to break next?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That information exists but is buried across Slack threads, postmortems, dashboards, and logs. It’s not connected.&lt;/p&gt;




&lt;h2&gt;
  
  
  From Logs to Graph
&lt;/h2&gt;

&lt;p&gt;We took incident data (services, alerts, responders, teams, timelines) and fed it into Graphify. Instead of treating incidents as isolated logs, they become part of a semantic graph:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Nodes:&lt;/strong&gt; services, incidents, alerts, responders&lt;br&gt;
&lt;strong&gt;Edges:&lt;/strong&gt; relationships between them (co-occurrence, ownership, causality)&lt;/p&gt;

&lt;p&gt;Now instead of querying logs, you’re querying relationships.&lt;/p&gt;




&lt;h2&gt;
  
  
  What This Unlocks
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. Instant Incident Memory&lt;/strong&gt;&lt;br&gt;
When a new incident fires, you can query:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;What happened last time this service broke?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;And immediately get:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;similar incidents&lt;/li&gt;
&lt;li&gt;who handled them&lt;/li&gt;
&lt;li&gt;what actions resolved them&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;No more Slack archaeology.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Blast Radius Prediction&lt;/strong&gt;&lt;br&gt;
If Service X goes down, the graph can tell you:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Services Y and Z usually fail shortly after.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Because it has learned co-failure patterns over time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Smarter Onboarding&lt;/strong&gt;&lt;br&gt;
Instead of asking a new SRE to read 200 past incidents:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Here’s the graph. These are the hot spots, these teams own these systems, this is how everything connects.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It’s a map of your infrastructure reality across time, not a boring and unconnected documentation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;4. Team Load Visibility&lt;/strong&gt;&lt;br&gt;
You can connect:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;incident volume&lt;/li&gt;
&lt;li&gt;team ownership&lt;/li&gt;
&lt;li&gt;responder activity&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And suddenly see which teams absorbed the most load relative to their size? This is where things like burnout start to become visible in the data.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;5. Alert Signal vs Noise&lt;/strong&gt;&lt;br&gt;
Because alerts are tied to actual incidents in the graph, you can rank:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;alerts that frequently lead to real incidents&lt;/li&gt;
&lt;li&gt;alerts that never matter
This gives you a way to tune or delete alerts backed by evidence&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;6. Surfacing Dependencies&lt;/strong&gt;&lt;br&gt;
Some services consistently fail together, even if no one documented the dependency. &lt;br&gt;
The graph reveals what actually depends on what based on real incidents, team and alert data.&lt;/p&gt;




&lt;h2&gt;
  
  
  Where This Gets Really Interesting
&lt;/h2&gt;

&lt;p&gt;Once you have this graph, it becomes a foundation for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Slack bots that auto-post relevant context during incidents&lt;/li&gt;
&lt;li&gt;AI SREs with memory &lt;/li&gt;
&lt;li&gt;Querying your system like a knowledge base instead of dashboards&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This gives the power for on-call teams to not only rediscover solutions but build accumulated knowledge. &lt;/p&gt;

&lt;p&gt;This shifts on-call teams from repeatedly rediscovering solutions to &lt;strong&gt;building accumulated knowledge over time.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Small Plug (If You Use Rootly)
&lt;/h2&gt;

&lt;p&gt;If you’re using Rootly, I built a small plugin to explore your incident data with Graphify: &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Floikj9ajnh5axfet63c6.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Floikj9ajnh5axfet63c6.png" alt="rootly-graphify-importer" width="800" height="240"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://github.com/Rootly-AI-Labs/rootly-graphify-importer" rel="noopener noreferrer"&gt;https://github.com/Rootly-AI-Labs/rootly-graphify-importer&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;Incident management data is already rich. It's full of signals across alerts, incidents, and responses but rarely captures how things relate.&lt;/p&gt;

&lt;p&gt;Graphify flips that, turning logs to knowledge, building connections across events, and turning history into memory.&lt;/p&gt;

&lt;p&gt;Once you see your system as a graph that turns scattered data into something you can filter, query, and explore, it’s hard to go back.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>devops</category>
      <category>llm</category>
      <category>sre</category>
    </item>
    <item>
      <title>[Boost]</title>
      <dc:creator>Hamza</dc:creator>
      <pubDate>Tue, 10 Mar 2026 06:12:15 +0000</pubDate>
      <link>https://dev.to/hamza_2315/-2mgm</link>
      <guid>https://dev.to/hamza_2315/-2mgm</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap" class="crayons-story__hidden-navigation-link"&gt;On-Call Burnout: What Incident Data Doesn’t Show&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/hamza_2315" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3815732%2F8e1ea548-3eac-4e26-8528-f172d08b8685.png" alt="hamza_2315 profile" class="crayons-avatar__image" width="96" height="96"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/hamza_2315" class="crayons-story__secondary fw-medium m:hidden"&gt;
              Hamza
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                Hamza
                
              
              &lt;div id="story-author-preview-content-3333416" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/hamza_2315" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3815732%2F8e1ea548-3eac-4e26-8528-f172d08b8685.png" class="crayons-avatar__image" alt="" width="96" height="96"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;Hamza&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;Mar 10&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap" id="article-link-3333416"&gt;
          On-Call Burnout: What Incident Data Doesn’t Show
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/sre"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;sre&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/productivity"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;productivity&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/opensource"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;opensource&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/devops"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;devops&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;5&lt;span class="hidden s:inline"&gt; reactions&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              Comments


              2&lt;span class="hidden s:inline"&gt; comments&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            5 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
      <category>sre</category>
      <category>productivity</category>
      <category>opensource</category>
      <category>devops</category>
    </item>
    <item>
      <title>On-Call Burnout: What Incident Data Doesn’t Show</title>
      <dc:creator>Hamza</dc:creator>
      <pubDate>Tue, 10 Mar 2026 06:11:55 +0000</pubDate>
      <link>https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap</link>
      <guid>https://dev.to/hamza_2315/on-call-burnout-what-incident-data-doesnt-show-2kap</guid>
      <description>&lt;p&gt;Incident dashboards measure system health. However, they rarely show the workload and strain that engineers face when responding to alerts.&lt;/p&gt;

&lt;h2&gt;
  
  
  &lt;strong&gt;How On-Call Works&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;On-call engineers are developers responsible for resolving critical system outages. When incidents happen, such as outages, performance issues, or system failures, the on-call engineer is alerted and responds by investigating and resolving the issue. Schedules rotate so that someone is always available to respond and resolve these incidents.&lt;/p&gt;

&lt;p&gt;Teams track incident metrics like incident counts, &lt;strong&gt;Mean Time to Repair (MTTR)&lt;/strong&gt;, &lt;strong&gt;Mean Time to Detect (MTTD)&lt;/strong&gt; or escalations of the incident. These metrics give insights into system reliability, but say very little about how incident response affects the engineers handling those alerts. While teams are focused on system health, the health of the team itself can silently become strained.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;The Numbers Behind On-Call Work&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Industry Data shows that on-call work takes up a significant share of engineering time. The 2025 SRE Report states that engineers spend a median of &lt;strong&gt;30% of their week on operational work&lt;/strong&gt;, which is up from 25% the year before. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6bvu76nly4m5pe7edcsp.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6bvu76nly4m5pe7edcsp.png" alt=" " width="786" height="505"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Incidents themselves are also common. &lt;strong&gt;46% of SREs&lt;/strong&gt; reported responding to more than five incidents in the last 30 days, while &lt;strong&gt;23% handled between 6–10 incidents&lt;/strong&gt;. At that level, the load can quickly lead to burnout when combined with other engineering responsibilities and external factors ("SRE report 2025", 2026).&lt;/p&gt;

&lt;p&gt;These metrics outline a fundamental gap: Incident counts and traditional incident data alone can't completely capture how workload is experienced by the engineers responding to alerts.&lt;/p&gt;

&lt;p&gt;Burnout is widespread. A 2024 survey found that about &lt;strong&gt;65% of engineers report currently experiencing burnout&lt;/strong&gt; (Chevireddy, 2025). &lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Burnout Is Hard to See&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Burnout rarely appears directly in dashboards. Most incident management tools (e.g. Rootly, PagerDuty) track metrics such as Incident Volume, Response Time, Resolution Time, Escalation Frequency.&lt;/p&gt;

&lt;p&gt;But they rarely capture how that workload is experienced by the engineers responding to incidents.&lt;/p&gt;

&lt;p&gt;For example, two engineers may handle five incidents in a week, which on surface level may not give much insight. However if one of them received most of their alerts late at night or during personal time, the experience is completely different.&lt;/p&gt;

&lt;p&gt;Patterns like after-hours alerts,  uneven incident distribution, and trends like repeated interruptions can accumulate into fatigue or burnout. It can leave responders with a lingering uneasiness of the feeling that another alert could arrive at any moment. &lt;/p&gt;

&lt;p&gt;These signals are also often scattered across multiple tools, which makes them difficult to detect early. Late-night pull requests, after-hours ticket responses, and consecutive incident response days often go untracked. Looking deeper at incident data revealed something important.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Incident Load Is About Patterns&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Many Site Reliability Engineer (SRE) responders shared the same thought: the same number of incidents can feel very different depending on who receives them and when they occur.&lt;/p&gt;

&lt;p&gt;A handful of alerts during the workday is very different from repeated pages late at night or during personal time. But timing is only part of the picture. Incident response rarely happens in isolation. Engineers are often working on multiple things: feature work, code reviews, tickets, and meetings just to name a few. &lt;/p&gt;

&lt;p&gt;What matters most is not just how many incidents occur, but how that workload accumulates and snowballs across responders and when it happens.&lt;/p&gt;

&lt;p&gt;An engineer who receives many alerts during a quiet week may experience them very differently from someone who is balancing deadlines, reviewing pull requests late, or engaging multiple tools like GitHub, Slack, and ticketing systems. &lt;/p&gt;

&lt;p&gt;Context in personal life is also a huge factor. An alert during working hours may be manageable, whereas the same alert during dinner, family time, or late at night can feel far more disruptive. Over time, these interruptions can snowball and change how responders experience on-call.&lt;/p&gt;

&lt;p&gt;In other words, &lt;strong&gt;incident load isn't only about the count of incidents, but the pattern of work surrounding those incidents&lt;/strong&gt;: when they happen, how frequently they interrupt responders, and what other responsibilities engineers are managing at the same time. &lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Introducing On-Call Health&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;This observation led the team at Rootly’s AI Labs to build On-Call Health, an open-source project designed to help teams understand workload patterns in incident response.&lt;/p&gt;

&lt;p&gt;The platform connects to the tools responders already use: Rootly, PagerDuty, GitHub, Slack, Linear, and Jira, collecting data and creating signals to understand how incident response load evolves over time.&lt;/p&gt;

&lt;p&gt;At the center of this is the &lt;strong&gt;On-Call Health (OCH) Score&lt;/strong&gt;, a composite workload score derived from incident response activity, engineering workload, and work pattern signals across integrated systems.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Tracking Trends Instead of Thresholds&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;Rather than relying on fixed thresholds, On-Call Health compares each engineer’s workload against their own historical baseline, since each engineer handles incident pressure differently.&lt;/p&gt;

&lt;p&gt;For example,  an incident volume that’s normal for an experienced SRE might be overwhelming for someone six months into their first rotation. Static thresholds miss this.&lt;/p&gt;

&lt;p&gt;However, what On-Call Health tracks is &lt;strong&gt;workload trends over time&lt;/strong&gt;, signaling when activity is stable, increases, or shifts significantly from a responder’s baseline. Changes in these trends can act as early warning signals that incident response load may be becoming unsustainable.&lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Contributing Signals&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;To detect these patterns, On-Call Health combines activity signals from multiple tools and platforms engineers already use:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Primary Response Signals:&lt;/strong&gt; Collected from incident platforms such as Rootly and PagerDuty. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Incident Volume&lt;/li&gt;
&lt;li&gt;Incident Severity&lt;/li&gt;
&lt;li&gt;Time to acknowledge incidents&lt;/li&gt;
&lt;li&gt;Time to resolve incidents&lt;/li&gt;
&lt;li&gt;After-hours pages&lt;/li&gt;
&lt;li&gt;Consecutive on-call days&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Work Pattern Signals:&lt;/strong&gt; Collected from GitHub, Slack, Rootly, and PagerDuty to identify when work is happening and how often responders are active outside normal working hours.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;After-hours Activity&lt;/li&gt;
&lt;li&gt;Weekend Activity&lt;/li&gt;
&lt;li&gt;On-Call Shift Frequency&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Engineering Workload Signals:&lt;/strong&gt; Collected from development and ticketing platforms such as GitHub, Jira, and Linear, to get insight into the engineering work happening alongside incident response.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Assigned Issues&lt;/li&gt;
&lt;li&gt;Pull Request volume &lt;/li&gt;
&lt;li&gt;Commit frequency&lt;/li&gt;
&lt;li&gt;Code reviews&lt;/li&gt;
&lt;li&gt;Tickets assigned and Priority levels&lt;/li&gt;
&lt;li&gt;Overlapping Work&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Engineer Check-in Signals&lt;/strong&gt;&lt;br&gt;
On-Call Health also sends short Slack check-in surveys inspired by Apple Health’s State of Mind feature. This captures how engineers feel alongside operational data and helps surface trends in the responder timeline and personal life. &lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Try it Yourself!&lt;/strong&gt;
&lt;/h2&gt;

&lt;p&gt;On-Call Health is &lt;strong&gt;open source and free to use&lt;/strong&gt;. &lt;/p&gt;

&lt;p&gt;You can explore the project or try the demo with preloaded mock data:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Website:&lt;/strong&gt; &lt;a href="https://oncallhealth.ai" rel="noopener noreferrer"&gt;https://oncallhealth.ai&lt;/a&gt;&lt;br&gt;
&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/Rootly-AI-Labs/On-Call-Health" rel="noopener noreferrer"&gt;https://github.com/Rootly-AI-Labs/On-Call-Health&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We're continuing to experiment ways to better understand responder workload and make on-call systems more sustainable for the engineers running them. Feedback, ideas, and contributions are always welcome. &lt;/p&gt;




&lt;h2&gt;
  
  
  &lt;strong&gt;Final Tips to Keep in Mind&lt;/strong&gt;
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Incident volume alone doesn’t show the full picture.&lt;/strong&gt; When alerts happen matters as much as how many occur&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;After-hours interruptions add up.&lt;/strong&gt; Late-night pages, weekend work, and repeated disruptions often drive burnout more than raw incident volume.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Workload snowballs.&lt;/strong&gt; Incident response happens side-by-side with development work, reviews, tickets, and meetings.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Look for trends, not only metrics.&lt;/strong&gt; Changes in workload trends can reveal stress long before traditional dashboards do.&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;References&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Chevireddy, S. (2025, June 29). AI, Burnout, and the Future: Navigating the New Era of Software Engineering. &lt;a href="https://medium.com/" rel="noopener noreferrer"&gt;https://medium.com/&lt;/a&gt;. &lt;a href="https://medium.com/@sraavanchevireddy/ai-burnout-and-the-future-navigating-the-new-era-of-software-engineering-9fbc54b658f9" rel="noopener noreferrer"&gt;https://medium.com/@sraavanchevireddy/ai-burnout-and-the-future-navigating-the-new-era-of-software-engineering-9fbc54b658f9&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;The SRE report 2025. (2026, March 4). Catchpoint | Internet Performance Monitoring (IPM). &lt;a href="https://www.catchpoint.com/asset/2025-sre-report" rel="noopener noreferrer"&gt;https://www.catchpoint.com/asset/2025-sre-report&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>sre</category>
      <category>productivity</category>
      <category>opensource</category>
      <category>devops</category>
    </item>
  </channel>
</rss>
