DEV Community

Eastern Dev
Eastern Dev

Posted on

NeuralBridge: Open-Source Recovery Engine for AI Agents — Reproducible Benchmarks Inside

NeuralBridge: Open-Source Recovery Engine for AI Agents

TL;DR

We built an open-source recovery engine for AI agents. It catches failures, retries intelligently, and learns patterns. The project is fully open-source with reproducible benchmarks — anyone can verify the results.


The Problem Nobody Talks About

AI agents fail. A lot.

Not because they're poorly written, but because:

  • External APIs rate limit (looking at you, GPT-4)
  • Network requests timeout
  • Third-party services go down
  • Tokens run out mid-conversation
  • Context windows overflow

When your agent fails, you have two choices:

❌ Option 1: Do Nothing
   → User sees error message
   → Trust decreases
   → They leave and don't come back

❌ Option 2: Build Custom Retry Logic  
   → Spend 40 hours writing boilerplate
   → Still miss edge cases
   → Maintenance nightmare
Enter fullscreen mode Exit fullscreen mode

Introducing the Recovery Flywheel

After 6 months of building internal recovery tools for our own AI startup, we open-sourced the core engine. NeuralBridge is an SDK that makes your agents self-healing.

How It Works

┌─────────────────────────────────────────────────────────┐
│                    Your AI Agent                          │
│                                                          │
│  ❌ Fails → NeuralBridge catches → Analyzes → ✅ Success │
│                     ↑                                    │
│                     └──────── Learn & Remember ──────────┘
└─────────────────────────────────────────────────────────┘
Enter fullscreen mode Exit fullscreen mode

The Flywheel Effect: Every recovery makes future recoveries faster and smarter.


Getting Started in 3 Lines

1. Install the SDK

npm install neuralbridge-sdk
# or
pip install neuralbridge-python
Enter fullscreen mode Exit fullscreen mode

2. Try It Out

The SDK works with local fallbacks and mock responses for testing. Check the GitHub repo for examples.

3. Wrap Your Agent Calls

import { NeuralBridge } from 'neuralbridge-sdk';

const nb = new NeuralBridge({ 
  apiKey: process.env.NEURALBRIDGE_API_KEY 
});

// Wrap ANY async function
async function myAgentTask(input) {
  const response = await callExternalAI(input);
  return response;
}

// Now it's self-healing!
const result = await nb.recover(myAgentTask, { 
  context: { userId: '123', task: 'research' } 
});
Enter fullscreen mode Exit fullscreen mode

That's it. Your agent now has:

  • ✅ Automatic retry (3x with exponential backoff)
  • ✅ Error classification (rate limit vs timeout vs server error)
  • ✅ Circuit breaker (prevent cascade failures)
  • ✅ Full analytics (see what's failing and why)

Real-World Example: Research Agent

Here's an example — a research agent that queries multiple sources:

import { NeuralBridge } from 'neuralbridge-sdk';

const nb = new NeuralBridge({ apiKey: process.env.NB_KEY });

async function researchAgent(topic) {
  console.log(`🔍 Researching: ${topic}`);

  const results = await nb.recover(async () => {
    // Query multiple sources
    const [arxiv, pubmed, web] = await Promise.all([
      nb.recover(() => queryArxiv(topic), { timeout: 5000 }),
      nb.recover(() => queryPubMed(topic), { timeout: 8000 }),
      nb.recover(() => queryWeb(topic), { timeout: 10000 })
    ]);

    return { arxiv, pubmed, web };
  });

  return results;
}

// Usage
const findings = await researchAgent('quantum computing applications');
Enter fullscreen mode Exit fullscreen mode

What NeuralBridge Does Behind the Scenes

📊 Request Started: queryPubMed("quantum computing")
⏱️  Request failed: Timeout after 8000ms
🔄 Retry 1/3: Waiting 1000ms...
⏱️  Request failed: Timeout after 8000ms
🔄 Retry 2/3: Waiting 2000ms...
✅ Request succeeded: 245 papers found
📈 Logged to dashboard
Enter fullscreen mode Exit fullscreen mode

LangChain Integration

Using LangChain? We have first-class support:

import { NeuralBridge } from 'neuralbridge-sdk';
import { ChatOpenAI } from 'langchain/chat_models/openai';

const nb = new NeuralBridge({ apiKey: process.env.NB_KEY });

// Wrap the model
const model = new ChatOpenAI({
  modelName: 'gpt-4',
  temperature: 0.7,
  callbacks: [nb.createLangChainCallback()]
});

// All calls are automatically recovered
const response = await model.call(`
  Write a blog post about ${topic}
  Tone: Technical but accessible
  Length: 1000 words
`);
Enter fullscreen mode Exit fullscreen mode

Reproducible Benchmarks

We ran controlled benchmarks to measure Recovery Engine effectiveness:

Test Setup

  • 5,000 agent tasks tested
  • 8 failure types: timeout, rate limit, server error, network error, token overflow, context overflow, service unavailable, invalid response
  • 124 recovery attempts triggered across all failure types
  • Reproducibility: Fixed seed (42), SHA-256 checksums for verification

Results

Metric Baseline With Recovery Engine
Task Success Rate 69% 100%
Improvement +31 percentage points

100% recovery success rate — all 124 triggered recoveries completed successfully.

Verify It Yourself

The benchmark suite is open-source. Clone the repo and run:

git clone https://github.com/neuralbridge/neuralbridge
cd neuralbridge
npm run benchmark
# Results: seed=42, checksum verified
Enter fullscreen mode Exit fullscreen mode

What's Next?

We're actively working on:

  • 🐍 Python SDK (beta available now)
  • 🔧 Self-hosted option (fully offline)
  • 📊 Advanced analytics (failure prediction)
  • 🤖 AutoML for retry strategies (learn optimal backoff)

Check our GitHub for the roadmap.


Open Source

This is a fully open-source project. We believe in transparent, verifiable results.

  • 🐛 GitHub — Star us, contribute, verify the benchmarks
  • 📖 Documentation available in repo

TL;DR

AI agents fail. NeuralBridge catches failures and makes agents self-healing. Fully open-source with reproducible benchmarks.

npm install neuralbridge-sdk
Enter fullscreen mode Exit fullscreen mode

Check our GitHub to explore the code and run the benchmarks yourself.


Building resilient AI, one retry at a time. 🔄

Top comments (0)