Matt

Posted on Jun 4 • Edited on Jun 30 • Originally published at fortem.dev

How to Cut AWS Costs Without Reserved Instances

#aws #cost #optimization #fargate

How to Cut AWS Costs Without Reserved Instances

Originally published at https://fortem.dev/blog/reduce-aws-costs-without-ri
RIs change how you pay, not what runs. 5 ways to cut AWS consumption: scheduling, right-sizing, Spot, auto-stop, killing orphans.

Guide

You've already set up Reserved Instances and Savings Plans. You checked the boxes the FinOps team sent over. Your AWS bill is still too high — and it keeps climbing. That's because RIs and Savings Plans change how you pay for compute. They don't change how much compute you consume. If your dev and staging environments run 24/7 while your team works 40 hours a week, no pricing model optimization will fix that. Five methods address that directly.

TL;DR

RIs and Savings Plans change your pricing model — not your consumption. They're table stakes. Get them first, then keep reading.
Scheduling non-prod environments to business hours alone cuts compute spend by 60–70% — 3× the impact of a typical RI on non-prod workloads.
Right-sizing overprovisioned services costs $0 to implement and saves 10–30% immediately. Check p95 CloudWatch metrics before changing a single line of Terraform.
Fargate Spot drops compute costs ~70% for fault-tolerant workloads. Combined with scheduling, dev environments cost near-zero.
Most teams have 5–15% of environments that nobody owns. Finding and deleting 3 orphaned environments recovers $500–2,000/month.

Reserved Instances are table stakes — what's next?

“Reserved Instances give you up to 66% off list price for committed usage — but they don't change total consumption. You're still paying for 168 hours per week per environment.”

— AWS Savings Plans documentation

RIs give up to 66% off — but they change the price per unit, not units consumed. Scheduling those same environments saves 1.75× more than RIs on non-prod. Go to the AWS Savings Plans console and commit to a 1-year plan for your production workloads. It's a up to 66% discount on list price for zero engineering effort. This is the lowest-hanging fruit in AWS cost optimization. Do it first.

The problem RIs don't solve: they change the price per unit, but not the number of units you consume. Your dev environments still run 168 hours a week. Your staging environment still sits idle at 3am on Sunday. Your three orphaned environments from last year's migration still bill by the second.

KEY INSIGHT: On a $10,000/month AWS bill where 70% is non-production compute: a 40% RI discount saves $2,800/month. Scheduling those same non-production environments to business hours saves $4,900/month. RI addresses the pricing model. Scheduling addresses the consumption.

$10,000/mo bill breakdown:

Non-production compute (70%): $7,000/mo

RI savings on non-prod (40%): −$2,800/mo

Scheduling savings (70% of compute hrs): −$4,900/mo

Scheduling captures 1.75× more savings than RIs on non-prod — and you can do both

The three levers that cut consumption (not how you pay)

Scheduling, right-sizing, and Fargate Spot are the three levers that reduce what you run — they save more than any RI, and an RI can't touch them. Here's the one-line case for each; the full ECS Fargate cost optimization guide has the math, the bar charts, and the implementation for all three.

1

Scheduling (60–70%). Non-prod runs 168 hrs/week; your team works ~50. Scheduling those environments off nights and weekends bills Fargate $0 the rest of the time — the highest-ROI, zero-Terraform change most teams can make.
2

Right-sizing (10–30%). Most tasks run at 15–30% of allocated CPU/memory. Set size to p95 × 3 (with autoscaling as the safety net) and a 1 vCPU → 0.25 vCPU cut drops that task ~75%.
3

Fargate Spot (~68%). Spare-capacity pricing with a 2-minute interruption notice — right for CI/CD, batch, and dev; wrong for production. Full Spot breakdown →

KEY INSIGHT: Why these beat an RI: an RI changes the rate on compute you keep running 24/7. These three cut the hours and sizeof what runs at all. A Reserved Instance on an idle 3am dev environment still bills you for 3am. Scheduling that environment to $0 doesn't.

Auto-stop idle environments (the lever the optimization guide skips)

Environments with no deployments and no log activity for 6+ days are automatically stopped via a CloudTrail + Lambda rule, with a Slack alert for one-click restart. Implement with CloudTrail event monitoring + Lambda. The hard part: defining 'idle'. No deployments? No API calls? No console logins? Pick one and enforce it.

This is different from scheduling. Scheduling is predictable — environments stop and start on a fixed calendar. Auto-stop targets environments that _should_be in use but aren't. An environment that hasn't seen a deployment in 10 days, has zero active connections, and generates no application logs — it's probably abandoned, even if someone forgot to tell you.

Implementation:monitor CloudTrail for ECS service updates (deployments) and CloudWatch Logs for application activity. If an environment has zero deploy events and zero log activity for a configurable threshold — say 6 consecutive days — automatically set its ECS service desired counts to 0. Send a Slack notification: “use1-dev-experiment stopped — idle 6 days. One-click restart here.”

KEY INSIGHT: The organizational question is harder than the technical implementation: who decides what “idle” means? 3 days? 7 days? 14 days? Define the policy with your team leads, document it, and give developers a 24-hour warning before auto-stop kicks in. The technical part is a Lambda function. The organizational part is a Slack thread.

Best practice: start conservative. 14-day idle threshold, 48-hour warning. Measure how many environments get auto-stopped and how many get immediately restarted. Tighten the threshold over time as the team builds trust in the process.

Kill orphaned environments

Tag every resource with an owner at provisioning, run a monthly audit against the team directory, and delete any environment with no owner and no deploy in 30+ days. Monthly audit: cross-reference with team directory. Ownerless resources → platform team review. Most teams find 1-3 orphaned environments at $200-400/mo each. Recovery: $500-1,200/mo immediately.

While auto-stop handles the recently-idle, this method handles the permanently-abandoned. Every team that's been running ECS for more than a year has environments that nobody claims. They were spun up for a migration, a hackathon, a departed engineer's experiment. Nobody deploys to them. Nobody knows who owns them. They bill — quietly, every month — and because of the real per-environment cost of Fargate (ALB, NAT Gateway, and CloudWatch overhead before any compute), an idle environment is rarely as cheap as people assume.

“Most teams we work with find 5–15% of their environments are completely abandoned — no deploys in 6+ months, no identifiable owner, no access logs. Three orphaned environments at $170/month each = $6,120/year of compute serving zero requests.”

— Fortem fleet audit of 100+ ECS environments across 12 teams, 2026

Audit approach: pull the last deployment timestamp per environment. Cross-reference with the team directory (who owns what?). Environments with no deploy in 30+ days and no active team owner go on a review list. The platform team reviews the list, confirms abandonment, and deletes the infrastructure.

KEY INSIGHT: Finding orphaned environments is a one-time audit that costs $0 and takes an afternoon. The savings compound every month. For a team with 50+ environments, the most common outcome is 2–5 orphans worth $500–$2,000/month. That's $6,000–$24,000/year — from a one-time afternoon of work.

Comparing the methods (the full stack)

Stack them by impact-to-effort: scheduling first, then right-sizing, then Spot, then auto-stop, then orphan deletion — one per week, not all at once. That's how cost optimization projects die in committee. Do the first this week, the next one next week, and watch the savings compound.

Method	Impact	Effort	Risk	What it means
1. Scheduling	60–70%	Low	None	Dev/staging envs stop outside business hours (50 hrs/wk instead of 168). Zero Terraform changes.
2. Right-sizing	10–30%	Medium	Low	Drop task CPU/memory to p95 + 50% headroom. One-time TF change per service.
3. Fargate Spot	Up to 70%	Low	Medium	Switch capacity provider to FARGATE_SPOT. 2-min interruption notice from AWS.
4. Auto-stop idle	Variable	Medium	Low	Stop any env not deployed to or accessed in 6+ days. CloudTrail + Lambda.
5. Kill orphans	$500–2,000/mo	Low	None	Find envs with no owner and no deploys in 30+ days. Delete them.

$5,765/mo· $69,180/yr

Combined impact on a $10,000/mo fleet: RI (−$2,800) + Scheduling (−$4,900) + Right-sizing (−$1,050 on remaining) + Spot on eligible dev envs (−$815). Total: $10,000 → $4,235/mo. 57% reduction without touching a single Reserved Instance.

The specific numbers depend on your fleet composition. A team with 80% non-prod compute will see scheduling dominate. A team where everything runs at steady utilization will see right-sizing and Spot carry the weight. The framework is the same regardless: reduce consumption first, then optimize the pricing model on what remains.

Fortem automates scheduling, idle detection, and orphan identification across every ECS Fargate environment in your fleet — no Terraform changes, no Lambda functions to maintain. Connect your AWS account and the savings surface in under 20 minutes.

Book a 20-min call →

Common questions

How do you reduce AWS costs without Reserved Instances?

Five methods ranked by impact: (1) schedule non-prod environments off outside business hours — 60-70% savings; (2) right-size overprovisioned services — free, immediate; (3) Fargate Spot or EC2 Spot — up to 70% discount; (4) auto-stop idle environments after 6+ days unused; (5) kill orphaned environments nobody claims.

How do you optimize costs in AWS?

RIs and Savings Plans change how you pay — not what runs. The bigger lever is reducing what runs: scheduling, right-sizing, Spot, and killing orphaned resources. Check CloudWatch Container Insights for actual vs allocated CPU/memory. Most environments are overprovisioned by 2-3×.

What is the best AWS cost optimization strategy for 2026?

Stop paying for idle compute. The average non-prod environment runs 168 hrs/week and is used ~55 hrs. Scheduling alone saves 60-70% of non-prod compute — no RI commitment needed. Fargate Spot adds another 70% on top for fault-tolerant workloads. The combination is the highest-ROI strategy for any ECS team.

How can I find and kill orphaned AWS resources?

Tag every resource with an owner tag at provisioning. Run a monthly audit: list all resources, cross-reference with team directory. Resources without current owners go to the platform team for review. At 10+ environments, teams typically find 1-3 orphaned environments costing $200-400/mo each.

The numbers in this post are estimates. Run the Fleet Audit against your actual ECS fleet and get your real figure in 15 minutes.

If you read this, you might also want to know

How do I convince finance to approve scheduling instead of RIs?

RIs require 1-3 year commitment. Scheduling costs $0 to implement. Present the math: 10 dev environments at $200/mo each = $2,000/mo. Scheduling at 60% savings = $1,200/mo saved immediately, zero commitment. Finance understands zero-risk savings better than long-term commitments.

Does scheduling affect my production environments?

No — scheduling applies to non-production environments only. Production always runs 24/7. Tag everything tagged environment=dev,staging,qa,demo for schedules. Leave production tags untouched. Most teams realise 60-70% of their ECS bill is non-production.

What's the difference between Savings Plans and Reserved Instances for ECS?

Savings Plans apply to Fargate usage directly — no instance selection needed. RIs apply to specific EC2 instance types for ECS on EC2. Compute Savings Plans give the most flexibility: 1-3 year commit, applies to Fargate, EC2, and Lambda.