DEV Community

Biricik Biricik
Biricik Biricik

Posted on • Originally published at zsky.ai

The Real Reason AI Image Tools Are Bad for Realtors (And How to Fix It)

I've spent the last six months talking to about 40 realtors about AI image tools. Most of them tried Midjourney once, got a kitchen with a floating refrigerator, and never came back. A few stuck with virtual staging SaaS that charges $25 per room. The rest are still paying photographers $400 a listing because nothing else works.

This bothers me. Not because I want to sell realtors anything (I run an indie creative platform, our typical user is a music producer, not a Re/Max agent). It bothers me because the tools are so close to being good for this use case, and the gap between "useless" and "actually useful" is mostly prompt engineering and a bit of workflow design.

So here's what I figured out, written as if I were sitting next to a realtor friend trying to fix this for them. Includes the prompt formula, the workflow, the cost math, and a curl example for anyone who wants to wire it into their CRM.

The real failure mode

When a realtor types "modern living room with sunlight" into a generic AI image tool, the model gives them a beautiful living room. The problem: it's not their living room. It's a hallucinated room that doesn't exist in any MLS listing.

Realtors don't need image generation. They need image transformation. Specifically:

  1. Virtual staging — empty room becomes a furnished room
  2. Day-to-twilight — daytime exterior becomes the magic-hour shot the seller paid extra for
  3. Decluttering — that recliner from 1994 disappears
  4. Sky replacement — gray Tuesday becomes blue Sunday
  5. Hero shot generation — the social-media-ready closeup that gets clicks

Generic text-to-image tools do none of this well because they don't anchor to the source photo. You need image-to-image, not text-to-image. That's the mistake that breaks 90% of "I tried AI for my listings and it was bad" stories.

The 5-element prompt formula (real estate version)

I wrote about this formula in another post for photographers, but the real estate version has its own quirks. Here's the structure:

[ROOM TYPE] + [STYLE] + [STAGING DETAIL] + [LIGHT/CAMERA] + [QUALITY]
Enter fullscreen mode Exit fullscreen mode

Worked example for a virtual staging job on an empty 12x14 living room:

Living room, mid-century modern style, walnut credenza and low-profile gray sofa with rust throw pillows, soft morning light from left window, shot on 24mm wide lens at f/8, MLS-quality interior photography

Notice what's missing: no "masterpiece, 8k, hyperrealistic, ultra-detailed" filler. That's pure noise. Modern image models are good enough that the magic words don't help. What helps is specificity about objects, materials, light direction, and lens.

Common mistakes I see realtors make in their prompts:

  • Vague style words — "elegant" means nothing. "1960s Eichler with terrazzo floors" means everything.
  • No light direction — "soft morning light from left window" produces a totally different image than "harsh overhead noon light." Pick one.
  • Conflicting time of day — "sunset light" plus "bright airy" = the model picks one and ignores the other.
  • Overstuffing — adding 14 pieces of furniture to a single prompt. Stage one corner at a time, then composite if needed.

Cost breakdown — actually doing the math

I'm going to compare realistic cost-per-listing across the options. Numbers are from talking to realtors, not pulled from thin air.

Traditional photographer — $300 to $500 for a single-family home with 25 photos. Includes a few hero shots, maybe twilight retouching as an upsell. Repeatable, but slow (3-day turnaround) and doesn't scale.

Premium virtual staging SaaS — $25 to $40 per room. A typical 4-bedroom listing needs 6-8 rooms staged. That's $150 to $320 per listing, 24-hour turnaround. Quality is high but inflexible — you can't iterate cheaply.

Generic AI image tools (Midjourney, etc.) — about $0.04 per image if you use them aggressively in the $30/month tier. Problem: no image-to-image staging, so you get new rooms instead of staged versions of your room. Useless for MLS.

Indie image platforms with image-to-image — credit-based, typically $0.02 to $0.05 per generation depending on the plan. Generate 10-20 variations per room, pick the best, ship it. Total cost per listing: under $5 if you know what you're doing.

The economics are absurd. The tooling has improved 100x and the per-image cost has dropped 1000x. Realtors who don't pick this up in 2026 are going to lose listings to ones who do.

A workflow that actually works

This is the workflow I would use if I was a listing agent with a fresh empty house. It's tactical, not theoretical.

Step 1 — shoot the source photos right. This matters more than the AI part. Use a tripod, shoot at 24mm equivalent, expose for the windows, bracket if you need to. Garbage in, garbage out — no AI tool will rescue a phone photo with a tilted horizon and yellow tungsten cast.

Step 2 — pick your three hero rooms. Living room, primary bedroom, kitchen. These are the rooms that decide whether someone clicks. The other 22 photos can be plain.

Step 3 — stage one corner at a time. Don't ask the AI to fill an entire room. Mask the corner you care about, prompt for one specific furniture vignette, generate 8 variations. Pick the best. Move to the next corner.

Step 4 — twilight conversion last. Take your best exterior daytime shot, run a day-to-twilight transform. Dial in warm interior light glow through windows — that's the detail that sells the shot.

Step 5 — strip metadata before upload. This one is overlooked. AI-generated images sometimes carry workflow metadata that exposes which tool you used. Run them through exiftool -all= before sending to MLS. Your seller doesn't need to know.

exiftool -all= -overwrite_original /path/to/listing-photos/*.jpg
Enter fullscreen mode Exit fullscreen mode

Step 6 — A/B test the hero. Most MLS systems let you swap the lead photo. Track click-through. The "good staging" version usually beats the "great photographer" version because it shows the life in the room, not the empty box.

The curl example I promised

A lot of indie tools (including ours) expose an HTTP endpoint for generation. Here's a generic shape that works for most of them — adapt the URL and key for your provider:

curl -X POST "https://api.example.com/v1/edit" \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "source_image_url": "https://your-mls-host.com/listing/12345/raw/living_room.jpg",
    "instruction": "Stage as mid-century modern living room with walnut credenza, gray sofa, rust throw pillows, soft morning light from left window. MLS interior photography quality.",
    "num_variations": 4,
    "preserve_geometry": true
  }'
Enter fullscreen mode Exit fullscreen mode

preserve_geometry: true is the magic flag. It tells the model to keep walls, windows, and floors locked to the source while only generating the furniture. Without this, you get the floating-refrigerator problem.

If your CRM lets you call webhooks on listing creation, you can wire this so every new listing automatically gets a staged variant queued up for review. Most realtors I talk to don't realize this is possible. It's not even hard — it's a four-line Zapier zap or a single Supabase edge function.

The honest disclaimer

A few realtors I know got burned by MLS rules requiring disclosure of virtually staged photos. Different MLS systems have different policies. Florida requires "virtually staged" labels, California is looser, Texas varies by board. Check your local rules before you ship. The technology is ready; the legal framework lags behind.

Also: if you're doing twilight conversion or sky replacement, you're not "staging" — you're enhancing. Most MLS rules treat that the same as a photographer's color correction, but again, check.

Why I care about this

I built our platform because I'm an artist who lost my visual imagination after a brain injury and rebuilt it with photography. The mission has always been: anyone should be able to make beautiful images, regardless of whether they were trained as an artist. Realtors are exactly the kind of people that mission is for. They're not visual professionals, but they need visual professionalism every single week.

The tools are finally good enough. The prompt formula works. The cost math is laughable in our favor. The only thing missing is realtors knowing this is now possible.

If you're a realtor reading this: you don't need a $400 photographer for every listing. You need a tripod, the 5-element prompt formula, and 30 minutes of image-to-image work per listing. The savings will pay for your MLS dues for a decade.

If you're a developer reading this and you build CRM integrations: there's a real product hiding here. "Auto-stage every new listing on creation" is a feature nobody has shipped well yet. The realtors I know would pay for it.

I'm at zsky.ai if you want to see how we approach the artist-for-artists side of this. Happy to answer questions in the comments — drop your worst listing photo and I'll show you exactly which corner to stage first.

Top comments (0)