vishalmysore

Posted on Mar 7

CLI Based AI Agent : Tool Calling with CLI

#ai #mcp #cli #agents

A Technical Deep Dive into CLI Chaining, Pipelines, and Workflow Patterns

📋 Stop Over-Engineering Your AI Agents

The industry is rushing to wrap every simple tool in a heavyweight service protocol, a sidecar container, or a JSON-RPC handshake. Meanwhile, the most performant orchestration layer for local workflows sits ignored: direct process execution.

Code for this article is here https://github.com/vishalmysore/cli-vs-mcp

This article explores CLI Orchestration as a complement to distributed protocols like MCP. Not as a replacement—as a boundary-aware decision. Use CLI for local, stateless operations. Use protocols for remote, stateful services.

When CLI Makes Sense:

All tools run on the same machine
Stateless data transformations
Latency-sensitive workflows (<50ms)
Existing command-line tools need integration

When It Doesn't:

Tools span multiple machines
Stateful connections (databases, long-lived sessions)
Dynamic service discovery required
Network authentication necessary

The problem isn't MCP. It's protocol over-application—wrapping trivial file operations in HTTP calls because that's what we know.

🎯 Motivation: The Tool Orchestration Spectrum

The Three Paradigms

Approach	Best For	Trade-offs
Direct CLI Execution	Single-purpose tasks, legacy integration	Limited composability
CLI Chaining & Pipelines	Multi-stage data workflows, batch processing	Requires careful output formatting
Service Protocols (MCP/REST)	Distributed systems, long-running services	Network overhead, complexity

This project explores the middle ground—CLI composition patterns—demonstrating that significant architectural sophistication can be achieved without crossing the network boundary.

🏗️ Architecture Overview

System Components

┌─────────────────────────────────────────────────────────────┐
│                  AI AGENT (Tools4AI)                        │
│  • Intent Recognition  • Workflow Orchestration             │
│  • Error Handling     • Parallel Execution                  │
└────────────┬────────────────────────────────────────────────┘
             │
             ├─── [Data Source CLIs] ─────────────┐
             │    • fetch_customers.cmd            │
             │    • fetch_transactions.cmd         │
             │    • fetch_metrics.cmd              │
             │                                     │
             ├─── [Processor CLIs] ───────────────┤
             │    • filter_by.cmd (filtering)     │
             │    • transform_data.cmd (enrich)   │
             │                                     │
             ├─── [Aggregator CLIs] ──────────────┤
             │    • count_by.cmd (grouping)       │
             │    • calculate_stats.cmd (stats)   │
             │                                     │
             └─── [Workflow CLIs] ────────────────┘
                  • workflow_customer_analysis
                  • workflow_transaction_analytics

Design Principles

Single Responsibility: Each CLI does one thing exceptionally well
Structured Output: Consistent data formats (pipe-delimited) for chainability
Composition Over Monoliths: Complex workflows emerge from simple building blocks
Fail Fast: Clear error messages, non-zero exit codes for failures
Observability: Each stage logs its operations for debugging

🔗 Pattern: Sequential Pipeline

Concept

Chain CLIs by connecting stdout to stdin. Data flows through transformation stages without intermediate network calls.

Example Workflow:

fetch_customers → filter(TIER=GOLD) → count_by(TIER)

Linux/macOS:

fetch_customers | filter_by TIER EQUALS GOLD | count_by TIER

Windows (Tools4AI ProcessBuilder):

ProcessBuilder stage1 = new ProcessBuilder("fetch_customers");
String data = captureOutput(stage1.start());

ProcessBuilder stage2 = new ProcessBuilder("filter_by", "TIER", "EQUALS", "GOLD");
stage2.redirectInput(Redirect.from(createTempFile(data)));
String filtered = captureOutput(stage2.start());

Reality Check: This uses temp files, not true piping. On Windows, batch scripts don't support stdin redirection cleanly. It works, but it's not elegant.

When This Makes Sense

Good fit:

Local data transformations (parsing logs, filtering CSVs)
Existing CLI tools you can't modify
Latency matters (no network round-trip)

Bad fit:

Need distributed execution
Stateful operations (database connections)
Requires sophisticated error recovery

🌊 Pattern: Conditional Workflows

Concept

Dynamic workflow branching based on CLI output analysis—implementing decision trees at the orchestration layer.

Implementation: DevOps Log Monitoring

// Stage 1: Fetch recent application logs
ProcessBuilder logFetch = new ProcessBuilder("fetch_logs", "--last=1h");
String logs = captureOutput(logFetch.start());

// Stage 2: Extract error patterns
ProcessBuilder errorExtract = new ProcessBuilder("extract_errors");
errorExtract.redirectInput(Redirect.from(createTempFile(logs)));
String errors = captureOutput(errorExtract.start());

// Stage 3: Count critical vs warning errors
int criticalCount = countOccurrences(errors, "CRITICAL");
int warningCount = countOccurrences(errors, "WARNING");

// Stage 4: Branch on severity thresholds
if (criticalCount > 10 || warningCount > 100) {
    // Alert path: Page on-call engineer
    executeCli("alert_oncall", "--severity=HIGH");

    // Deep dive: Root cause analysis
    ProcessBuilder analysis = new ProcessBuilder("analyze_error_patterns");
    analysis.redirectInput(Redirect.from(createTempFile(errors)));
    String rootCause = captureOutput(analysis.start());

    // Escalate with context
    // WARNING: In production, sanitize rootCause before passing to CLI
    // (shell injection risk). Use temp files or proper escaping.
    executeCli("create_incident", "--details=" + rootCause);
} else {
    // Normal path: Generate routine report
    executeCli("generate_health_report", "--status=healthy");
}

Key Insight

The orchestrator acts as a decision engine—analyzing intermediate outputs and routing execution flow without LLM intervention for every branch. This keeps latency low while preserving intelligent behavior.

�️ The Layered Intelligence Model: CLI + MCP Hybrid

The Nuanced Position

The goal isn't to kill MCP—it's to use the right tool for the right boundary.

Most agent architectures suffer from protocol over-application: wrapping trivial file operations in HTTP calls, or forcing stateless transforms through stateful service layers. This creates artificial bottlenecks.

Proposed Architecture: Boundary-Aware Orchestration

┌─────────────────────────────────────────────────────┐
│  AI AGENT (Tools4AI LLM Orchestrator)             │
├─────────────────────────────────────────────────────┤
│                                                     │
│  🔹 THE CORE (CLI Layer)                           │
│     • High-speed stateless transformations         │
│     • Local file I/O and system calls              │
│     • Data parsing, filtering, aggregation         │
│     • Process orchestration (parallel execution)   │
│     ⚡ Latency: 1-50ms per operation               │
│                                                     │
│  🔸 THE EDGE (MCP/Protocol Layer)                  │
│     • Stateful database connections (pooling)      │
│     • Authenticated third-party APIs (OAuth)       │
│     • Long-running distributed services            │
│     • WebSocket streams and pub/sub                │
│     ⚡ Latency: 50-500ms per operation             │
│                                                     │
└─────────────────────────────────────────────────────┘

Decision Framework

If it runs on the same machine and needs no persistent state, CLI. Otherwise, MCP.

Real-World Hybrid Workflow

Example: Log Analysis Agent

1. CLI: fetch_logs → extract_errors (local, <10ms)
2. MCP: query_database → check_known_issues (remote, 150ms)
3. CLI: generate_report → write_to_disk (local, <5ms)

Why This Works:

Heavy lifting stays local: Parsing 100MB of logs via CLI avoids network transfer
Stateful queries stay remote: Database connection pooling requires long-lived MCP server
LLM context preserved: Only final analysis sent to LLM, not raw logs

The Strategic Insight

By offloading the "fast" operations to CLI chains, you:

Preserve LLM context window (less data serialization)
Reduce failure surface area (fewer network hops)
Achieve lower tail latencies (no protocol handshake tax)

MCP isn't the enemy—protocol over-application is.

�️ Implementation Example

Project Structure

cli-vs-mcp/
├── pom.xml                          # Maven configuration
├── cli/
│   ├── datasource/
│   │   ├── fetch_customers          # Customer data source
│   │   ├── fetch_transactions       # Transaction data source
│   │   └── fetch_metrics            # Metrics data source
│   ├── processors/
│   │   ├── filter_by                # Generic filter processor
│   │   └── transform_data           # Data enrichment processor
│   ├── aggregators/
│   │   ├── count_by                 # Counting aggregator
│   │   └── calculate_stats          # Statistical aggregator
│   └── workflows/
│       ├── workflow_customer_analysis
│       └── workflow_transaction_analytics
├── src/main/
│   ├── java/.../AdvancedCliOrchestrator.java
│   └── resources/
│       ├── shell_actions.yaml       # CLI action registry
│       ├── skills.json              # LLM skill definitions
│       └── tools4ai.properties      # Framework configuration

Core Technologies

Framework: Tools4AI 1.1.9.9

Provides LLM-driven CLI selection
Manages process lifecycle
Handles parameter extraction

Language: Java 18 (orchestrator), Batch scripts (CLIs)

LLM Integration: OpenAI GPT-4 (configurable)

Execution Model: Process-per-CLI with output capture

Sample Code: Executing CLI Chain

// Execute a 3-stage pipeline (cross-platform)
ProcessBuilder stage1 = new ProcessBuilder("fetch_customers");
Process p1 = stage1.start();
String data = captureOutput(p1);

// Stage 2: Filter
ProcessBuilder stage2 = new ProcessBuilder("filter_by", "TIER", "EQUALS", "GOLD");
stage2.redirectInput(ProcessBuilder.Redirect.from(createTempFile(data)));
Process p2 = stage2.start();
String filtered = captureOutput(p2);

// Stage 3: Aggregate
ProcessBuilder stage3 = new ProcessBuilder("count_by", "TIER");
stage3.redirectInput(ProcessBuilder.Redirect.from(createTempFile(filtered)));
Process p3 = stage3.start();
String result = captureOutput(p3);

System.out.println(result);

🧪 Running the Demo

Prerequisites

# Java 18+
java -version

# Maven 3.8+
mvn -version

# OpenAI API Key (or use Ollama for local LLM)
export OPENAI_API_KEY="your-key-here"

Build & Run

# Clone and navigate
cd cli-vs-mcp

# Build project
mvn clean compile

# Run agent (interactive mode)
mvn exec:java

# Or run demo showcase
cmd /c cli\demo_showcase.cmd

� Summary

CLI orchestration isn't a replacement for distributed protocols. It's a boundary-aware choice.

Use it for: Local, stateless operations where latency matters and existing tools work.

Don't use it for: Distributed systems, stateful services, or when you're forcing it.

The fastest code is often the simplest code. But only when it actually solves your problem.

🎯 The Question

Before you reach for a service protocol, ask:

Does this need to run on a different machine?
Does it require stateful connections?
Is the overhead justified?

If "no" to all three, consider CLI.

Built with Tools4AI by Vishal Mysore

March 2026

DEV Community

CLI Based AI Agent : Tool Calling with CLI

A Technical Deep Dive into CLI Chaining, Pipelines, and Workflow Patterns

📋 Stop Over-Engineering Your AI Agents

🎯 Motivation: The Tool Orchestration Spectrum

The Three Paradigms

🏗️ Architecture Overview

System Components

Design Principles

🔗 Pattern: Sequential Pipeline

Concept

When This Makes Sense

🌊 Pattern: Conditional Workflows

Concept

Implementation: DevOps Log Monitoring

Key Insight

�️ The Layered Intelligence Model: CLI + MCP Hybrid

The Nuanced Position

Proposed Architecture: Boundary-Aware Orchestration

Decision Framework

Real-World Hybrid Workflow

The Strategic Insight

�️ Implementation Example

Project Structure

Core Technologies

Sample Code: Executing CLI Chain

🧪 Running the Demo

Prerequisites

Build & Run

� Summary

🎯 The Question

Top comments (0)