DEV Community

Cover image for Camunda Health Monitor v2.0 Open Source: AI-Powered Intelligence
Dmitriy Champa
Dmitriy Champa

Posted on

Camunda Health Monitor v2.0 Open Source: AI-Powered Intelligence

Remember when I shared the story of building a lightweight Camunda monitoring tool because enterprise APM solutions were overkill?

Version 2.0 just landed. And it's not just an update—it's an evolution.

🎥 See It In Action

🚀 What's New in v2.0

1. Incident Intelligence Center

The headline feature. A comprehensive command center that adapts to your Camunda configuration—whether you're running with full history logging or runtime-only mode.

What it gives you:

  • Active Incidents Dashboard - Real-time tracking with severity scoring and root cause classification
  • Recurring Issues Detection - Flags patterns that keep coming back with historical occurrence rates
  • Historical Pattern Analysis - Groups incidents by root cause across all processes
  • AI Recommendations - Prioritized action items with specific remediation steps

The game-changer? Every incident is clickable, revealing comprehensive diagnostic modals with error messages, business keys, instance IDs, and timeline data. Everything you need to move from detection to resolution in seconds.

Incident Intelligence Center

2. Weekly Load Patterns

Capacity planning transformed from guesswork into data-driven strategy.

  • Daily Distribution Analysis - See exactly where capacity is needed (e.g., Friday = 69% of weekly traffic)
  • Hourly Heatmap - Color-coded visualization of every hour of every day
  • Peak Hours Identification - System automatically finds your busiest time slots
  • Scheduling Recommendations - AI suggests optimal windows for batch jobs and maintenance

This is the feature I wish I had when planning our infrastructure scaling last year.

3. Enhanced ML Analytics

Building on the existing anomaly detection, v2.0 adds:

  • Stuck Activities Smart Detection - Statistical analysis finds tasks that are technically running but practically frozen
  • Capacity Forecasting - ML predictions for the next 30 days
  • Activity Bottleneck ROI - Calculates exact time savings from optimizing specific service tasks
  • Version Performance Tracking - Instantly detect regressions between deployments

4. Process Categorization

New automatic categorization system groups your processes by execution speed:

  • Ultra Fast (<5s)
  • Very Fast (5-30s)
  • Fast Background (<6min)
  • Standard (6-30min)
  • Extended (30min-4h)
  • Long Running (4-24h)
  • Batch/Manual (>24h)

Click into any category to see exactly which processes belong there. Perfect for architectural planning.

🎯 Why This Matters

In v1.0, I focused on solving the "what's happening right now" problem. You could see cluster health, JVM metrics, and basic process execution data.

v2.0 answers the harder questions:

  • ❓ Why are incidents recurring?
  • ❓ Which optimizations will save the most time?
  • ❓ When should we schedule maintenance?
  • ❓ Is our latest deployment performing worse?
  • ❓ Which processes are at risk of failing?

🏗️ Still Lightweight, Still Simple

Despite adding AI/ML capabilities, the core philosophy remains:

  • No build steps - Clone and run
  • <500KB footprint - Excluding Python deps
  • <100MB memory - Runs on a t2.micro
  • 5-minute setup - No configuration hell

The "heavyweight monitoring for lightweight needs" problem hasn't changed. My solution stays lean.

🌍 Ecosystem Compatibility

v2.0 maintains zero-configuration compatibility (and was successfully tested) with:

  • Camunda 7 (all versions)
  • Operaton
  • CIB Seven
  • Eximee BPMS

Your platform evolves, your monitoring layer stays stable.

🎓 What I Learned Building This

1. Start with Real Pain

The best features come from solving actual production problems.

2. AI Doesn't Mean Complex

You don't need TensorFlow to add intelligence. Statistical analysis + smart categorization + clear recommendations often beats black-box heavy ML.

3. Simple Architecture Enables Community

Keeping the codebase straightforward (Flask + Alpine.js) has led to more contributions than expected. Lower barrier = more engagement.

4. Open Source First, Enterprise Second

Building features in the open-source version first ensures they solve real problems before commercializing advanced versions.

💬 Let's Talk, don't be shy!

What monitoring challenges are you facing with Camunda?

I'm particularly curious about:

  • What metrics do you wish you could see?
  • How do you currently handle incident analysis?
  • What would make this indispensable for your team?

Drop a comment below—I read and respond to every one.


🔗 Links

GitHub logo bibacrm / camunda-health-monitor

Lightweight monitoring dashboard for Camunda 7 clusters with real-time metrics, JVM health tracking, and database analytics. Built with Flask, Alpine.js, and Tailwind CSS.

Camunda Health Monitor & Advanced AI/ML Analytics

A lightweight, real-time monitoring dashboard for Camunda 7 based BPM Platform clusters. Monitor your process engines, track performance metrics, and gain insights into your workflow automation with a modern, responsive interface.

License Docker Hub Docker Pulls Python Flask

✅ Verified Compatible With:

Camunda 7 Operaton CIB Seven eximeeBPMS

🎥 Video Demo

Watch the feature demonstration on YouTube:

Camunda Health Monitor Demo

▶️ Watch Demo: Champa Camunda 7 Health Monitor

📸 Screenshots

Full Dashboard View

Dashboard Overview Real-time cluster monitoring with comprehensive metrics

AI/ML Analytics Overview AI/ML Analysis Health & Performance Tab

AI/ML Analytics Issues AI/ML Analysis Issues & Alerts Tab

AI/ML Analytics Modal AI/ML Analysis Modal Example

Light Theme

Light Theme Clean, professional light mode interface

Node Status Monitoring

Node Down Example Immediate visual feedback when nodes become unavailable


Quick Start with Docker

https://hub.docker.com/r/champabpmn/camunda-health-monitor

  1. Create a .env file with your configuration (example for Camunda 2 nodes cluster):
# Database Configuration (PostgreSQL)
DB_NAME=PUT_YOUR_CAMUNDA_DB_NAME_HERE
DB_USER=PUT_YOUR_CAMUNDA_DB_USERNAME_HERE
DB_PASSWORD=PUT_YOUR_CAMUNDA_DB_PASS_HERE
DB_HOST=PUT_YOUR_CAMUNDA_DB_HOSTNAME_OR_IP_ADDRESS_HERE
DB_PORT=5432
# Camunda Node 1 (Required)
CAMUNDA_NODE_1_NAME=node1
CAMUNDA_NODE_1_URL=http://PUT_YOUR_CAMUNDA_BPM_1_NODE_HOST_HERE/engine-rest

# Camunda Node 2 (Optional)
CAMUNDA_NODE_2_NAME=node2
CAMUNDA_NODE_2_URL=http://PUT_YOUR_CAMUNDA_BPM_2_NODE_HOST_HERE/engine-rest

# Camunda Node 3 (Optional)
#
Enter fullscreen mode Exit fullscreen mode

Star the repo if you find it useful!

🔔 Follow Champa Intelligence for updates

💬 Share your monitoring war stories in the comments

Building better monitoring tools, one incident at a time.

Top comments (0)