<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Giga Kovaliovi</title>
    <description>The latest articles on DEV Community by Giga Kovaliovi (@giga_kovaliovi_1853ce432b).</description>
    <link>https://dev.to/giga_kovaliovi_1853ce432b</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3838902%2F63838a5b-0c9d-4a47-85bc-42f4f8daf909.jpg</url>
      <title>DEV Community: Giga Kovaliovi</title>
      <link>https://dev.to/giga_kovaliovi_1853ce432b</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/giga_kovaliovi_1853ce432b"/>
    <language>en</language>
    <item>
      <title>Your on-call engineer just got paged. Here's what happens to the postmortem.</title>
      <dc:creator>Giga Kovaliovi</dc:creator>
      <pubDate>Fri, 10 Apr 2026 11:11:53 +0000</pubDate>
      <link>https://dev.to/giga_kovaliovi_1853ce432b/your-on-call-engineer-just-got-paged-heres-what-happens-to-the-postmortem-266n</link>
      <guid>https://dev.to/giga_kovaliovi_1853ce432b/your-on-call-engineer-just-got-paged-heres-what-happens-to-the-postmortem-266n</guid>
      <description>&lt;h2&gt;
  
  
  The problem nobody wants to admit
&lt;/h2&gt;

&lt;p&gt;It's 3:47am. The alert fires. You and two engineers spend 90 minutes triaging a database connection pool exhaustion. The service recovers at 5:11am.&lt;/p&gt;

&lt;p&gt;By 9am, your Slack has three messages: "Can someone write up the postmortem?" Nobody does. By Friday it's a ticket in the backlog. By next week, it's gone.&lt;/p&gt;

&lt;p&gt;This isn't a discipline problem. It's a friction problem. Postmortems are painful to write — especially after a night incident where you're running on adrenaline and three cups of coffee.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Opsrift actually does
&lt;/h2&gt;

&lt;p&gt;Opsrift connects to your monitoring and alerting stack — PagerDuty, OpsGenie, Datadog, or Grafana — pulls the incident data, and generates a complete structured postmortem in under 60 seconds. Not a blank template — a filled document with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Incident timeline (auto-built from your alert source events)&lt;/li&gt;
&lt;li&gt;Root cause section (framed as "AI hypothesis — verify before publishing")&lt;/li&gt;
&lt;li&gt;Impact summary with MTTA/MTTR calculated automatically&lt;/li&gt;
&lt;li&gt;Action items pushed to Jira in one click&lt;/li&gt;
&lt;li&gt;Confluence publish in ADF format — one click&lt;/li&gt;
&lt;li&gt;Slack notification on publish&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Six tools total: Postmortem Generator, Shift Handover Generator, Runbook Generator, Incident Assistant, Status Page Generator, and Incident Forecast. All pull from the same 9 integrations.&lt;/p&gt;

&lt;h2&gt;
  
  
  Incident Assistant — the tool we actually use most
&lt;/h2&gt;

&lt;p&gt;During a live incident, paste your alert or import it from PagerDuty/OpsGenie/Datadog. You get:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Plain-English summary of what the alert means&lt;/li&gt;
&lt;li&gt;3–5 likely causes ranked by probability&lt;/li&gt;
&lt;li&gt;Specific investigation steps with commands&lt;/li&gt;
&lt;li&gt;Relevant Confluence runbooks if connected&lt;/li&gt;
&lt;li&gt;Escalation guidance&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not replacing your judgment. Saving you 15 minutes of tab-switching at the worst possible moment.&lt;/p&gt;

&lt;h2&gt;
  
  
  Incident Forecast — the tool that's easy to miss
&lt;/h2&gt;

&lt;p&gt;After enough incidents accumulate, patterns emerge. Which services fire most often? Which time windows are highest risk? Which action items never get closed?&lt;/p&gt;

&lt;p&gt;Incident Forecast surfaces these signals proactively — so your team can prioritize reliability work before the next P1 fires, not after.&lt;/p&gt;

&lt;h2&gt;
  
  
  Who it's built for
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;SRE and DevOps teams running PagerDuty, OpsGenie, Datadog, or Grafana in a 24/7 environment&lt;/li&gt;
&lt;li&gt;NOC teams who need shift handovers that contain the right context&lt;/li&gt;
&lt;li&gt;iGaming and FinTech ops where documentation speed directly affects SLA compliance&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Try it free
&lt;/h2&gt;

&lt;p&gt;7-day free trial — all 6 tools, all 9 integrations, no commitment. Start with the Incident Assistant if you're mid-incident.&lt;/p&gt;

&lt;p&gt;→ opsrift.com&lt;/p&gt;

</description>
      <category>ai</category>
      <category>sre</category>
      <category>devops</category>
      <category>productivity</category>
    </item>
    <item>
      <title>I built an AI tool for incident investigation (looking for honest feedback)</title>
      <dc:creator>Giga Kovaliovi</dc:creator>
      <pubDate>Fri, 27 Mar 2026 12:42:45 +0000</pubDate>
      <link>https://dev.to/giga_kovaliovi_1853ce432b/i-built-an-ai-tool-for-incident-investigation-looking-for-honest-feedback-2gbh</link>
      <guid>https://dev.to/giga_kovaliovi_1853ce432b/i-built-an-ai-tool-for-incident-investigation-looking-for-honest-feedback-2gbh</guid>
      <description>&lt;p&gt;Hey everyone 👋&lt;/p&gt;

&lt;p&gt;Over the past couple of weeks, I’ve been building a side project called &lt;a href="//opsrift.com"&gt;Opsrift&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;It started from a pretty simple frustration:postmortems, handovers, and incident documentation take way too much time — and most of it is repetitive.&lt;/p&gt;

&lt;p&gt;But while building it, I realized something more interesting:&lt;/p&gt;

&lt;p&gt;The real problem isn’t writing postmortems.It’s understanding what actually happened during an incident.&lt;/p&gt;

&lt;p&gt;So I ended up going a bit further than just a generator.&lt;/p&gt;

&lt;p&gt;What Opsrift does right now&lt;/p&gt;

&lt;p&gt;The platform is focused on incident workflows — mostly for people working in SRE, support, or operations.&lt;/p&gt;

&lt;p&gt;Right now it includes:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Postmortem generator&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Takes incident data and generates structured postmortems in seconds.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Handover generator&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Useful for shift-based teams — turns messy updates into clean handovers.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Runbook generator&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Creates structured runbooks based on incident patterns or inputs.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Incident Investigator (main focus)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This is the part I’m most interested in:&lt;/p&gt;

&lt;p&gt;Pulls data from tools like Jira, PagerDuty, and Opsgenie&lt;/p&gt;

&lt;p&gt;Correlates it with deployments from GitHub&lt;/p&gt;

&lt;p&gt;Tries to reconstruct what actually happened (timeline, possible causes, etc.)&lt;/p&gt;

&lt;p&gt;The goal is to reduce the time spent jumping between tools during investigations.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Status page&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Basic external communication for incidents.&lt;/p&gt;

&lt;p&gt;Integrations&lt;/p&gt;

&lt;p&gt;Current integrations:&lt;/p&gt;

&lt;p&gt;Jira&lt;/p&gt;

&lt;p&gt;PagerDuty&lt;/p&gt;

&lt;p&gt;Opsgenie&lt;/p&gt;

&lt;p&gt;GitHub&lt;/p&gt;

&lt;p&gt;Slack&lt;/p&gt;

&lt;p&gt;Confluence&lt;/p&gt;

&lt;p&gt;Still early — some of these are rough.&lt;/p&gt;

&lt;p&gt;What it’s NOT (yet)&lt;/p&gt;

&lt;p&gt;I want to be upfront:&lt;/p&gt;

&lt;p&gt;It’s not a replacement for your incident management tools&lt;/p&gt;

&lt;p&gt;It’s not perfect at root cause analysis&lt;/p&gt;

&lt;p&gt;It’s not “production-grade” in every edge case&lt;/p&gt;

&lt;p&gt;Right now it’s closer to:&lt;/p&gt;

&lt;p&gt;an AI layer on top of your existing tools to speed up investigation and documentation&lt;/p&gt;

&lt;p&gt;Known issues&lt;/p&gt;

&lt;p&gt;To save you time:&lt;/p&gt;

&lt;p&gt;GitHub login ❌ (bugged right now)&lt;/p&gt;

&lt;p&gt;Slack login ❌ (also bugged)&lt;/p&gt;

&lt;p&gt;👉 You can still use:&lt;/p&gt;

&lt;p&gt;Google login&lt;/p&gt;

&lt;p&gt;Email/password signup&lt;/p&gt;

&lt;p&gt;Fixing these next.&lt;/p&gt;

&lt;p&gt;What I’m trying to figure out&lt;/p&gt;

&lt;p&gt;This is where I’d really appreciate help.&lt;/p&gt;

&lt;p&gt;I’m trying to validate a few things:&lt;/p&gt;

&lt;p&gt;Does the Incident Investigator actually help or is it just “nice to have”?&lt;/p&gt;

&lt;p&gt;Are the outputs accurate enough to be trusted?&lt;/p&gt;

&lt;p&gt;Would you use something like this in real workflows?&lt;/p&gt;

&lt;p&gt;What’s missing for it to be genuinely useful?&lt;/p&gt;

&lt;p&gt;Where I want to take this&lt;/p&gt;

&lt;p&gt;Longer term, I’m thinking about moving beyond just generating outputs and more into:&lt;/p&gt;

&lt;p&gt;detecting patterns across incidents&lt;/p&gt;

&lt;p&gt;identifying unstable services&lt;/p&gt;

&lt;p&gt;highlighting teams with high escalation rates&lt;/p&gt;

&lt;p&gt;correlating deployments with incidents automatically&lt;/p&gt;

&lt;p&gt;Basically:&lt;/p&gt;

&lt;p&gt;turning incident data into something you can actually act on&lt;/p&gt;

&lt;p&gt;If you want to try it&lt;/p&gt;

&lt;p&gt;👉 &lt;a href="https://opsrift.com" rel="noopener noreferrer"&gt;https://opsrift.com&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;No pressure — even quick feedback is super helpful.&lt;/p&gt;

&lt;p&gt;Final note&lt;/p&gt;

&lt;p&gt;I’ve worked in NOC/SOC and incident-heavy environments, so this is very much a “scratch your own itch” project.&lt;/p&gt;

&lt;p&gt;That said, I’m aware tools like this can easily become:&lt;/p&gt;

&lt;p&gt;too generic&lt;/p&gt;

&lt;p&gt;inaccurate&lt;/p&gt;

&lt;p&gt;or just another dashboard nobody uses&lt;/p&gt;

&lt;p&gt;So I’d rather get honest feedback early.&lt;/p&gt;

&lt;p&gt;Even if it’s:&lt;/p&gt;

&lt;p&gt;“this doesn’t solve anything for me”&lt;/p&gt;

&lt;p&gt;That’s useful.&lt;/p&gt;

&lt;p&gt;Thanks in advance 🙌&lt;/p&gt;

</description>
      <category>sre</category>
      <category>ai</category>
      <category>monitoring</category>
      <category>devops</category>
    </item>
  </channel>
</rss>
