DEV Community

Manognya Lokesh Reddy
Manognya Lokesh Reddy

Posted on

🧬 Enhancing Historical Images with GANs: My AEGAN + SRGAN Project

Hey everyone! 👋

I’m Manognya Lokesh Reddy, currently pursuing my Master’s in Artificial Intelligence at the University of Michigan-Dearborn. In this post, I’ll take you behind the scenes of one of my most technically demanding and visually exciting projects: high-resolution image synthesis using GANs.

I built a system that takes low-resolution or black-and-white historical images and transforms them into sharp, colored, high-quality outputs using a combination of AEGAN (Auto-Embedding GAN) and SRGAN (Super-Resolution GAN).

🖼️ The Problem
Low-resolution or faded historical images limit both aesthetic appeal and analytical value. Traditional upscaling doesn’t preserve fine detail, and manual enhancement is time-consuming.

The goal? Use Generative Adversarial Networks (GANs) to:

Upscale image resolution

Colorize black-and-white photos

Retain edge quality and texture

Do it fast and accurately

⚙️ Tech Stack
Python

PyTorch – for building AEGAN and SRGAN models

Docker – for reproducible deployment

JWT (JSON Web Tokens) – for secure REST API endpoints

Mixed Precision Training – to optimize GPU usage

OpenCV – for image pre/post-processing

🧠 How It Works
🔗 AEGAN Architecture
Combines auto-encoders and GANs to embed image features in a compressed space

Stabilized training with gradient penalty and Wasserstein loss

Mixed precision training helped accelerate training by ~60% on GPU

🔎 SRGAN Module
Generates 4x super-resolved outputs

Preserves perceptual detail using VGG-based content loss

Fine-tuned using perceptual + adversarial losses for photorealism

🔐 Scalable API Access
Deployed model using a Docker container

Secured API with JWT authentication to restrict access and support production scalability

📊 Results
✅ 37% improvement in FID score (Frechet Inception Distance) over baseline GANs

⚡ 60% faster inference time with GPU optimization

🖼️ Output quality good enough for archival, artistic, and medical use cases

🧪 Deployment time reduced by 50% using Dockerized workflows

💡 What I Learned
GAN training is unstable—techniques like gradient penalty and mixed precision training are critical

Embedding-based architectures like AEGAN offer superior feature reconstruction

Docker and JWT are must-have tools for secure, scalable ML deployment

Working with image data brings unique challenges (e.g., lighting variance, compression noise)

📸 Use Cases
Historical image restoration

Medical image enhancement

Satellite image upscaling

Artistic rendering for museums or documentaries

Top comments (0)