DEV Community

Cover image for PeeperFrog Create
Heather Scott (PeeperFrog)
Heather Scott (PeeperFrog)

Posted on

PeeperFrog Create

AI Integrated Image Generation for Content Creators

An open-source MCP server with Claude Skills that automates AI image generation across Gemini, OpenAI, and Together AI - from prompt to WordPress

Robots playing poker

Content creation has a bottleneck problem. You need images. Lots of them. Each one requires choosing a provider, crafting prompts, managing costs, converting formats, and uploading to your CMS. What if your AI assistant could handle all of that, right on your desktop?

I built PeeperFrog Create to solve this - an open-source MCP (Model Context Protocol) server that gives Claude (by Anthropic) access to three major AI image providers with intelligent routing, batch workflows, and direct WordPress publishing. Here's how it works and why it matters.

The Content Creator's Dilemma

When I started running newsletters, I faced a recurring problem:

  1. Write article ✅
  2. Research image requirements
  3. Choose provider (Gemini? DALL-E? FLUX?)
  4. Craft a prompt for that provider's strengths
  5. Generate image
  6. Download, convert to WebP
  7. Upload to WordPress
  8. Add to the article

Steps 2-8 took longer than writing the actual content. Multiply this across multiple articles per week, and you're spending hours on image pipeline management instead of creating.

Enter PeeperFrog Create

PeeperFrog Create is an MCP server that connects Claude directly to AI image generation services. But unlike single-provider solutions, it offers:

  • Multi-provider support: Gemini Pro, OpenAI DALL-E, Together AI (FLUX models)
  • Auto mode: The MCP server picks the best provider based on your budget and needs
  • Full control mode: Claude picks the best provider and controls all the parameters
  • Batch workflows: Queue multiple images, review, and generate them in one run (cost-effective)
  • Cost estimation and tracking: Know the approximate price before generating, and log it afterward
  • Skills system: Teach Claude best practices for each provider
  • WebP conversion: Automatic optimization for web delivery
  • WordPress integration: Direct upload to media library

Here's what that looks like in practice:

MCP Flow

The Architecture: Why MCP Changes Everything

Model Context Protocol is Anthropic's open standard for connecting AI models to external tools. Instead of context switching between applications, MCP lets Claude call tools directly within the conversation.

Traditional workflow:

You → Claude → Copy prompt → Open DALL-E → Generate → Download → Covert → Upload
Enter fullscreen mode Exit fullscreen mode

With MCP:

You → Claude → [MCP handles generation, conversion, upload] → Done
Enter fullscreen mode Exit fullscreen mode

The server acts as a bridge between Claude and image generation APIs:

Auto Mode: The Secret Weapon

Here's where PeeperFrog Create gets interesting. Each provider has different strengths:

  • Gemini Pro: Best for complex compositions, reference images, search grounding
  • OpenAI DALL-E: Excellent photorealism, reliable text rendering
  • Together FLUX: Cost-effective, fast iterations, artistic styles

Instead of manually choosing, auto mode analyzes your request:

Auto Image Mode

Example conversation:

You: "Create a professional infographic showing AI cost trends using peeperfrog-create."

Claude: "I'll create a professional infographic showing AI cost trends for you. Let me generate this using the image generation system."

Generation details:

Provider: Gemini Pro
Resolution: 2K (16:9 aspect ratio)
Cost: $0.14 USD

You: "Convert it to WebP."

Conversion results:

✓ Original PNG: 3.1 MB
✓ WebP version: 319 KB
✓ 89.7% file size reduction
✓ Quality: 85 (high quality retained)

The actual image this prompt created:

Example Output

Five auto modes cover every scenario:

  • cheapest (max $0.003/MP): Minimize cost — dreamshaper, flux1-schnell
  • budget (max $0.01/MP): Decent quality, low cost — hidream-fast, juggernaut-pro
  • balanced (max $0.04/MP): Production use, good quality/cost — seedream3, flux2-dev, flux2-pro, imagen4
  • quality (max $0.08/MP): Premium quality — ideogram3, imagen4-ultra, flux1-kontext-max
  • best (no limit): Maximum quality — Gemini Pro, OpenAI Pro

Real-World Example: Robot Poker Scene

Let me show you what reference images can do. I wanted to create a promotional image for this article showing different robot designs playing poker. Here are the reference robots:

These are real commercial humanoid robots: Boston Dynamics' Atlas, various research platforms, Unitree's H1, and others. Each was an individual photo. I wanted an image that maintained their distinct designs while composing them into a coherent scene.

Using Gemini Pro with reference images:

# In the conversation with Claude:
"Create an image of these five robots playing poker."
Enter fullscreen mode Exit fullscreen mode

Result:

5 robots playing poke

Claude wrote the prompt and selected the model based on the need for reference images. Gemini Pro analyzed all five robots, understood their proportions and aesthetics, and composed them into a coherent scene with proper lighting, atmosphere, and context. This is the power of reference images - you get consistency across generated content while maintaining specific design requirements.

Try that with prompt-only generation, and you'll spend hours iterating to get five consistent robot designs that feel like they belong in the same universe.

The Skills System: Teaching Claude Best Practices

MCP provides the tools. Skills teach Claude how to use them effectively.

Each skill is a markdown file that guides Claude through specific workflows:

Available Skills

Core Image Generation:

  • image-generation: Overview of all tools and workflows
  • image-auto-mode: When to use auto mode vs manual control
  • image-manual-control: Advanced provider-specific options
  • image-queue-management: Batch workflow best practices
  • cost-estimation: Budget planning and provider comparison

Publishing Pipeline:

  • webp-conversion: Web optimization strategies
  • wordpress-upload: CMS integration patterns

Creative Guidance:

  • graphic-prompt-types: Reference guide for visual styles
  • example-brand-image-guidelines: Template for brand consistency

Skills work in both Claude Desktop (GUI) and Claude Code (CLI). Once installed, Claude automatically applies relevant knowledge without you needing to remember command syntax or provider limitations.

Example: You ask for "five images for a newsletter about quantum computing." When using the skills together, Claude can:

  1. Check image-queue-management skill for best practices
  2. Add five prompts to the batch queue
  3. Use cost-estimation to show the total cost before generating
  4. Wait for your approval
  5. Generate all images in one batch run (reducing cost)
  6. Convert to WebP when you request optimization
  7. Upload to WordPress when you request publishing

Batch Workflows: Production at Scale

The batch system transforms how you handle multiple images:

Image Batch System

The Problem:
Generate five images individually = five separate API calls, five interruptions, five manual downloads, five uploads. About 20 minutes of context switching. All at full price.

The Solution:

1. Queue all five images with prompts
2. Review queue and estimated costs
3. Generate all in one batch (Gemini cost cut in half)
4. Convert all to WebP
5. Upload all to WordPress in bulk
Enter fullscreen mode Exit fullscreen mode

Time: Under 5 minutes. Cost: Optimized through provider selection and batch processing. Mental overhead: Minimal.

Real example from my workflow:

Newsletter: "The $0.50 Intelligence Revolution."
Images needed:
- Hero image: Cost decline chart (Gemini, text/infographic)
- Figure 1: Timeline diagram (Gemini, complex layout)
- Figure 2: Comparison table (OpenAI, clear text)
- Social: Square version (Together FLUX, artistic)
- Thumbnail: Simplified hero (Together FLUX, fast)

Total cost: ~$0.40-0.45
Generation time: ~3 minutes
Manual time saved: ~25 minutes
Enter fullscreen mode Exit fullscreen mode

The WordPress Pipeline: From Prompt to Published

The final piece - direct publishing:

WordPress Image Publishing

Process:

  1. Generate image (PNG format from provider)
  2. Convert to WebP (90-94% size reduction for web optimization)
  3. Upload to WordPress via REST API
  4. Title set from filename; alt text/caption added manually in WordPress
  5. Return media ID for insertion in posts

Configuration:

{
  "wordpress": {
    "https://yourblog.com": {
      "username": "your-username",
      "password": "your-app-password"
    }
  }
}
Enter fullscreen mode Exit fullscreen mode

One API call uploads your optimized images directly to your media library, ready to insert into posts.

Cost Comparison: Why Multi-Provider Matters

The price difference between providers is massive - up to 400x difference between the cheapest and most expensive options, and batch processing can save you 50% on the most expensive Gemini Pro images:

Model Cost Comparison

Gemini Pro Image ($0.134-0.24/image):

  • Up to 14 reference images (unique capability)
  • Search grounding for factually accurate images
  • Thinking levels for quality control
  • Up to 4K resolution (4096×4096)
  • Pricing: $0.134 for 2K, $0.24 for 4K
  • Batch API available: 50% discount ($0.067 for 2K, $0.12 for 4K)

OpenAI DALL-E / gpt-image-1 ($0.01-0.17/image):

  • Three quality tiers: Low ($0.01), Medium ($0.04), High ($0.17)
  • Superior photorealism at medium/high tiers
  • Excellent text rendering
  • Consistent style generation
  • Industry-standard quality

Together AI FLUX ($0.0027-0.08/MP):

  • FLUX.1 Schnell: $0.0027/MP (fastest, most cost-effective)
  • FLUX.1 Dev: $0.025/MP (balanced quality/cost)
  • FLUX.1.1 Pro: $0.04/MP (premium quality)
  • FLUX.1 Kontext Max: $0.08/MP (highest quality with editing)
  • Fast iterations for creative exploration
  • Artistic and illustration strengths

Auto mode considers all these factors when routing your request. Budget constraints? Style requirements? Resolution needs? It handles the decision automatically.

Real Production Metrics

Time Savings (from my workflow):

  • Old workflow: ~30 minutes per article (5 images avg)
  • New workflow: ~5 minutes per article
  • Savings: ~83% reduction in image production time
  • Your results may vary based on complexity and iteration needs

Cost Efficiency:
Newsletter with 5 diverse images using auto mode balanced:

  • Hero (OpenAI medium): $0.04
  • Infographic (ideogram3): $0.06
  • Photo (seedream3): $0.02
  • Social (flux1-schnell): $0.003
  • Thumbnail (dreamshaper): $0.001
  • Total: ~$0.173 via smart routing

Compare to uniform provider costs:

  • Same 5 images all Gemini Pro 2K immediate: $0.67 (5 × $0.134)
  • Same 5 images all Gemini Pro 2K batched: $0.335 (5 × $0.067)
  • Same 5 images all OpenAI medium: $0.20 (5 × $0.04)
  • Savings: 60-75% through multi-provider optimization and batching

Quality Improvements:

  • Reference images maintain brand consistency
  • Batch review prevents mistakes before generation
  • Cost visibility prevents budget overruns
  • Skills reduce prompt iteration cycles

Installation: 5 Minutes to Running

Prerequisites:

- Python 3.8+
- Claude Desktop or Claude Code
- API keys (one or more): Gemini, OpenAI, Together AI
Enter fullscreen mode Exit fullscreen mode

Setup:

# Clone repository
git clone https://github.com/PeeperFrog/peeperfrog-create.git
cd peeperfrog-create/peeperfrog-create-mcp

# Configuration
cp config.json.example config.json
cp .env.example .env

# Install dependencies
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install requests
Enter fullscreen mode Exit fullscreen mode

Configure MCP Client:

Find your settings file:

  • Claude Code: ~/.claude/settings.json
  • Claude Desktop (Linux): ~/.config/Claude/claude_desktop_config.json
  • Claude Desktop (macOS): ~/Library/Application Support/Claude/claude_desktop_config.json
  • Claude Desktop (Windows): %APPDATA%\Claude\claude_desktop_config.json

Add the server:

{
  "mcpServers": {
    "peeperfrog-create": {
      "command": "/path/to/peeperfrog-create/peeperfrog-create-mcp/venv/bin/python3",
      "args": ["/path/to/peeperfrog-create/peeperfrog-create-mcp/src/image_server.py"],
      "env": {
        "GEMINI_API_KEY": "your-key",
        "OPENAI_API_KEY": "your-key",
        "TOGETHER_API_KEY": "your-key"
      }
    }
  }
}
Enter fullscreen mode Exit fullscreen mode

Restart Claude. Done.

Install Skills:

For Claude Desktop: Settings > Capabilities > Skills > Upload each SKILL.md from the skills/ folder.

For Claude Code:

cp -r skills/* ~/.claude/skills/
Enter fullscreen mode Exit fullscreen mode

Use Cases Beyond Newsletters

While I built this for newsletter production, the system works for:

Content Marketing:

  • Blog hero images
  • Social media graphics
  • Email campaign visuals
  • Landing page assets

Documentation:

  • Technical diagrams
  • Architecture visualizations
  • Process flowcharts
  • Tutorial illustrations

E-commerce:

  • Product mockups
  • Lifestyle photography
  • Promotional graphics
  • Brand assets

Creative Projects:

  • Concept art
  • Storyboarding
  • Character design
  • World-building

The key: any workflow where you need multiple AI-generated images with consistent quality, controlled costs, and efficient delivery.

The Open Source Advantage

PeeperFrog Create is Apache 2.0 licensed. This means:

For individuals:

  • Use it for free forever
  • Modify for your needs
  • No vendor lock-in

For teams:

  • Deploy on your infrastructure
  • Customize provider routing
  • Add your own workflows
  • Integrate with existing tools

For developers:

  • Extend with new providers
  • Build custom Skills
  • Contribute improvements
  • Fork for specialized needs

The codebase is Python. The skills are markdown files. No complex dependencies. Easy to understand, easier to modify.

What's Next

Current development roadmap:

Near-term:

  • Image editing capabilities (inpainting, outpainting)
  • More Claude skills
  • Social Media Connections
  • Template system for common layouts
  • Additional provider support (Replicate, Stability AI)

Medium-term:

  • Video generation integration
  • Animation and motion graphics
  • Multi-image composition tools
  • Advanced cost management

Long-term:

  • Local model support
  • Fine-tuning integration
  • Custom model hosting
  • Enterprise collaboration features

Want to contribute? Issues and PRs welcome at github.com/PeeperFrog/peeperfrog-create.

The Bigger Picture: AI-Assisted Workflows

This project represents a larger shift in how we work with AI tools. The traditional model - AI as chatbot - is giving way to AI as workflow participant.

Old model:

Human → Think of task → Do task → AI assists with parts
Enter fullscreen mode Exit fullscreen mode

New model:

Human → Describe outcome → AI orchestrates entire workflow → Human reviews
Enter fullscreen mode Exit fullscreen mode

MCP enables this transition. Instead of Claude generating text that you copy-paste into tools, Claude directly operates tools based on conversation context.

Image generation is just one domain. The same pattern applies to:

  • Data analysis and visualization
  • Code generation and testing
  • Research and summarization
  • Content publishing and distribution
  • Project management and tracking

PeeperFrog Create proves the concept works. Your AI assistant can manage multi-provider services, handle complex workflows, optimize costs, and deliver production-ready results - all from a conversation.

Try It Today

  1. Install the MCP server (5 minutes)
  2. Add one API key (any provider)
  3. Install the Skills (3 minutes)
  4. Ask Claude to generate an image

That's it. No tutorials needed. The Skills teach Claude how to use the tools effectively. Auto mode handles the complex decisions. Batch workflows scale your production.

Within 10 minutes, you'll have an AI-powered image pipeline that would take days to build manually.

Repository: github.com/PeeperFrog/peeperfrog-create

Documentation: Full docs in the repo README and individual Skills

Community: Issues, discussions, and PRs welcome


Running newsletters? Content marketing? Building a CMS? Try **PeeperFrog Create* and cut your image costs in half and production time by 80%. Apache 2.0 licensed, free forever.*

Top comments (0)