ImaginePro

Posted on Aug 6 • Edited on Aug 8

Complete Guide to Use Midjourney API for Text-to-Image Generation

Building a Text-to-Image Tool with Midjourney API

Introduction

Midjourney has revolutionized the world of AI-generated imagery with its powerful text-to-image capabilities. While Midjourney is primarily known for its Discord bot interface, developers can now integrate its image generation capabilities directly into their applications using APIs. This guide will walk you through everything you need to know to build a text-to-image tool using Midjourney's API.

Prerequisites

Before diving into the implementation, ensure you have:

Basic knowledge of JavaScript/TypeScript
Node.js installed on your system
An API key
Understanding of REST APIs and asynchronous programming

Getting Started

1. Installation

First, install the necessary SDK. For this guide, we'll use the ImaginePro SDK which provides a clean interface for Midjourney API interactions:

npm install imaginepro

2. Basic Setup

Create a new project and set up the basic configuration:

import ImagineProSDK from 'imaginepro';

const midjourneyClient = new ImagineProSDK({
    apiKey: 'your-imaginepro-api-key',
});

Core Functionality

Text-to-Image Generation

The primary feature is generating images from text prompts:

async function generateImage(prompt) {
    try {
        // Initiate image generation
        const response = await midjourneyClient.imagine({
            prompt: prompt,
        });

        console.log('Generation started:', response.messageId);

        // Wait for completion and get result
        const result = await midjourneyClient.fetchMessage(response.messageId);

        return result;
    } catch (error) {
        console.error('Generation failed:', error);
        throw error;
    }
}

// Usage example
const imageResult = await generateImage('a majestic dragon flying over a medieval castle at sunset');
console.log('Generated image URL:', imageResult.uri);

Advanced Prompt Engineering

Midjourney responds well to detailed, descriptive prompts. Here are some best practices:

// Good prompt structure
const goodPrompt = 'a photorealistic portrait of a wise old wizard, dramatic lighting, intricate details, 8k resolution, cinematic composition';

// Include style modifiers
const styledPrompt = 'a futuristic cityscape, cyberpunk aesthetic, neon lights, rain-slicked streets, cinematic lighting';

// Specify aspect ratios and quality
const detailedPrompt = 'a serene mountain landscape at golden hour, 16:9 aspect ratio, high quality, detailed textures';

Image Manipulation Features

Upscaling Images

Enhance the resolution of generated images:

async function upscaleImage(messageId, index = 1) {
    try {
        const result = await midjourneyClient.upscale({
            messageId: messageId,
            index: index // Corresponds to U1, U2, U3, U4 buttons
        });

        return result;
    } catch (error) {
        console.error('Upscaling failed:', error);
        throw error;
    }
}

Generating Variants

Create alternative versions of existing images:

async function createVariant(messageId, index = 1) {
    try {
        const result = await midjourneyClient.variant({
            messageId: messageId,
            index: index // Corresponds to V1, V2, V3, V4 buttons
        });

        return result;
    } catch (error) {
        console.error('Variant generation failed:', error);
        throw error;
    }
}

Rerolling Images

Regenerate images with the same prompt:

async function rerollImage(messageId) {
    try {
        const result = await midjourneyClient.reroll({
            messageId: messageId
        });

        return result;
    } catch (error) {
        console.error('Reroll failed:', error);
        throw error;
    }
}

Building a Complete Text-to-Image Tool

1. Create the Main Application

class TextToImageTool {
    constructor(apiKey) {
        this.client = new ImagineProSDK({
            apiKey: apiKey,
            timeout: 300000,
        });
        this.generationHistory = [];
    }

    async generateImage(prompt, options = {}) {
        const generationId = Date.now().toString();

        try {
            // Start generation
            const response = await this.client.imagine({
                prompt: prompt,
                ref: generationId,
                webhookOverride: options.webhookUrl
            });

            // Track generation
            this.generationHistory.push({
                id: generationId,
                messageId: response.messageId,
                prompt: prompt,
                status: 'processing',
                startTime: new Date()
            });

            // Wait for completion
            const result = await this.client.fetchMessage(response.messageId);

            // Update history
            const historyItem = this.generationHistory.find(item => item.id === generationId);
            if (historyItem) {
                historyItem.status = result.status;
                historyItem.result = result;
                historyItem.completionTime = new Date();
            }

            return result;
        } catch (error) {
            console.error('Image generation failed:', error);
            throw error;
        }
    }

    async enhanceImage(messageId, enhancementType, index = 1) {
        switch (enhancementType) {
            case 'upscale':
                return await this.client.upscale({ messageId, index });
            case 'variant':
                return await this.client.variant({ messageId, index });
            case 'reroll':
                return await this.client.reroll({ messageId });
            default:
                throw new Error('Unknown enhancement type');
        }
    }

    getGenerationHistory() {
        return this.generationHistory;
    }
}

2. Webhook Integration

For production applications, use webhooks to handle generation results asynchronously:

// Set up webhook endpoint
app.post('/webhook/midjourney', (req, res) => {
    const { messageId, status, uri, prompt, ref } = req.body;

    if (status === 'DONE') {
        // Handle successful generation
        console.log(`Image generated for prompt: ${prompt}`);
        console.log(`Image URL: ${uri}`);

        // Update your database, send notifications, etc.
        updateUserGallery(ref, uri);
    } else if (status === 'FAIL') {
        // Handle failed generation
        console.error(`Generation failed for prompt: ${prompt}`);
        notifyUserOfFailure(ref);
    }

    res.status(200).send('OK');
});

// Use webhook in generation
const result = await tool.generateImage('a beautiful sunset', {
    webhookUrl: 'https://your-app.com/webhook/midjourney'
});

3. Error Handling and Retry Logic

async function generateWithRetry(prompt, maxRetries = 3) {
    for (let attempt = 1; attempt <= maxRetries; attempt++) {
        try {
            return await tool.generateImage(prompt);
        } catch (error) {
            console.error(`Attempt ${attempt} failed:`, error.message);

            if (attempt === maxRetries) {
                throw new Error(`Failed after ${maxRetries} attempts: ${error.message}`);
            }

            // Wait before retrying
            await new Promise(resolve => setTimeout(resolve, 2000 * attempt));
        }
    }
}

Best Practices

1. Prompt Optimization

Be specific: Instead of "a cat", use "a majestic Persian cat with golden fur sitting in a sunlit garden"
Include style keywords: "photorealistic", "cinematic", "artistic", "minimalist"
Specify lighting and mood: "dramatic lighting", "soft natural light", "moody atmosphere"
Add quality modifiers: "high resolution", "detailed", "8k", "professional photography"

2. Performance Optimization

// Batch processing for multiple images
async function generateBatch(prompts) {
    const promises = prompts.map(prompt => 
        tool.generateImage(prompt).catch(error => ({
            error: error.message,
            prompt: prompt
        }))
    );

    return await Promise.allSettled(prompts);
}

// Rate limiting
class RateLimitedGenerator {
    constructor(tool, maxRequestsPerMinute = 10) {
        this.tool = tool;
        this.maxRequests = maxRequestsPerMinute;
        this.requestQueue = [];
        this.lastRequestTime = 0;
    }

    async generateImage(prompt) {
        const now = Date.now();
        const timeSinceLastRequest = now - this.lastRequestTime;
        const minInterval = 60000 / this.maxRequests; // 60 seconds / max requests

        if (timeSinceLastRequest < minInterval) {
            await new Promise(resolve => 
                setTimeout(resolve, minInterval - timeSinceLastRequest)
            );
        }

        this.lastRequestTime = Date.now();
        return await this.tool.generateImage(prompt);
    }
}

3. User Experience Considerations

// Progress tracking
async function generateWithProgress(prompt, onProgress) {
    const response = await tool.client.imagine({ prompt });

    // Poll for progress
    const checkProgress = async () => {
        const result = await tool.client.fetchMessage(response.messageId);

        onProgress({
            status: result.status,
            progress: result.progress,
            messageId: response.messageId
        });

        if (result.status === 'PROCESSING' || result.status === 'QUEUED') {
            setTimeout(checkProgress, 2000);
        }
    };

    checkProgress();
    return await tool.client.fetchMessage(response.messageId);
}

// Usage
generateWithProgress('a magical forest', (progress) => {
    console.log(`Status: ${progress.status}, Progress: ${progress.progress}%`);
});

Advanced Features

Inpainting (Selective Editing)

async function inpaintImage(messageId, mask) {
    try {
        const result = await midjourneyClient.inpainting({
            messageId: messageId,
            mask: mask // Base64 encoded mask image
        });

        return result;
    } catch (error) {
        console.error('Inpainting failed:', error);
        throw error;
    }
}

Custom Webhook Handling

class WebhookHandler {
    constructor() {
        this.pendingGenerations = new Map();
    }

    registerGeneration(generationId, callback) {
        this.pendingGenerations.set(generationId, callback);
    }

    handleWebhook(payload) {
        const { ref, status, uri, error } = payload;
        const callback = this.pendingGenerations.get(ref);

        if (callback) {
            callback({ status, uri, error });
            this.pendingGenerations.delete(ref);
        }
    }
}

Troubleshooting Common Issues

1. API Rate Limits

// Implement exponential backoff
async function generateWithBackoff(prompt, maxRetries = 5) {
    for (let attempt = 1; attempt <= maxRetries; attempt++) {
        try {
            return await tool.generateImage(prompt);
        } catch (error) {
            if (error.message.includes('rate limit')) {
                const delay = Math.pow(2, attempt) * 1000; // Exponential backoff
                console.log(`Rate limited, waiting ${delay}ms before retry`);
                await new Promise(resolve => setTimeout(resolve, delay));
            } else {
                throw error;
            }
        }
    }
}

2. Network Issues

// Add timeout and retry for network issues
const clientWithRetry = new ImagineProSDK({
    apiKey: 'your-api-key',
    timeout: 60000, // 1 minute timeout
    retryAttempts: 3,
    retryDelay: 1000
});

Conclusion

Building a text-to-image tool with Midjourney API opens up incredible possibilities for creative applications. By following this guide, you'll have a solid foundation for creating robust, user-friendly image generation tools.

Remember to:

Always handle errors gracefully
Implement proper rate limiting
Use webhooks for production applications
Optimize prompts for better results
Monitor API usage and costs

The ImaginePro SDK provides a clean, professional interface for Midjourney API integration, making it easier to build enterprise-grade applications with reliable image generation capabilities.

DEV Community