AWS Lambda Durable Functions allow you to build long-running, stateful workflows that can run for up to one year while paying zero compute cost during waits.
AWS Lambda Durable Functions: Complete Learning Roadmap
AWS Lambda Durable Functions introduce a new way to build long-running, stateful workflows directly in code.
Instead of stitching together SQS queues, DynamoDB state tables, custom retry mechanisms, and complex Step Functions definitions, you can write workflows using the Lambda programming model you already know.
With Durable Functions, workflows can:
- Run for up to one year
- Pause while waiting for approvals, callbacks, or external events
- Automatically recover from failures
- Resume execution without losing state
- Pay zero compute cost while suspended
To help developers learn this new execution model, I put together a complete learning path that starts with the fundamentals and gradually progresses to production-grade patterns, testing, observability, and real-world projects.
Section 1: Course Prerequisites
Before diving into Durable Functions, we'll prepare the development environment and supporting resources.
AWS CLI Installation
Set up and configure the AWS CLI.
Documentation and Code
Access the GitHub repository, course resources, and official documentation.
Cost Considerations
Understand the expected AWS costs for following along.
Section 2: Getting Started with AWS Lambda Durable Functions
Build a strong foundation before exploring advanced patterns.
Why AWS Lambda Durable Functions?
Understand the evolution from:
- Stateless Lambda
- AWS Step Functions
- AWS Lambda Durable Functions
What is a Durable Function?
Learn durable execution, checkpointing, replay, and workflow persistence.
Create Your First Durable Function
Build and execute your first workflow.
Built-in Deduplication
Implement idempotency using durable execution names.
Synchronous vs Asynchronous Invocation
Learn when to use each execution model.
Section 3: Durable Steps
Learn how Durable Functions execute work safely and reliably.
Avoid Non-Deterministic Code
Prevent replay failures caused by timestamps, randomness, and external side effects.
Avoid Variable Mutation from Steps
Understand safe state management patterns.
Workflow Branching
Implement conditional workflow execution.
Error Handling in Steps
Explore workflow failures and recovery behavior.
More Durable Operations
Overview of advanced workflow capabilities.
Section 4: Mastering Wait Operations
One of the most powerful features of Durable Functions.
Different Types of Wait Operations
Explore:
- wait
- waitForCondition
- waitForCallback
- invoke
Wait for Condition
Implement polling-based workflows.
AWS Polly Integration Demo
Build a real-world asynchronous workflow.
Callback Operations
Pause execution until an external event resumes the workflow.
Resolving Callbacks
Resume workflows using AWS CLI and CloudShell.
Project: Human-in-the-Loop Workflow
Build an approval workflow requiring human interaction.
Invoke Operation
Coordinate multiple Lambda functions inside a durable workflow.
Section 5: Concurrent Operations
Learn how to execute work in parallel.
Understanding Concurrency
Explore Durable Functions concurrency fundamentals.
context.parallel
Run independent operations simultaneously.
Parallel Execution Configuration
Control completion and failure handling.
context.map
Process collections concurrently using isolated execution contexts.
Section 6: Project - Employee Reimbursement Manager
Build a production-style approval workflow.
Workflow Architecture
Design a document approval system.
Full Implementation
Create a complete reimbursement approval workflow using Durable Functions.
Key concepts:
- Human approvals
- Callback workflows
- Long-running business processes
- Workflow state management
Section 7: Workflows with Child Contexts
Child Contexts and Sub-Workflows
Learn how to compose reusable workflows and solve replay-ordering challenges.
Section 8: Error Handling, Retries & Execution Semantics
Learn how Durable Functions behave under failure conditions.
Error Fundamentals
Understand workflow failure behavior.
Retry Strategies
Implement exponential backoff and retry policies.
Custom Retry Strategies
Create advanced retry mechanisms.
Retry Presets
Use built-in retry configurations.
Step Semantics
Understand:
- At-Least-Once execution
- At-Most-Once execution
Step Semantics Demo
Observe execution guarantees in practice.
Section 9: Serialization & Deserialization (SerDes)
Durable execution relies heavily on serialization.
Understanding Serialization
Learn how checkpoints persist workflow state.
Custom SerDes Strategies
Create custom serialization logic.
createClassSerdes Helper
Restore class instances automatically.
createClassSerdesWithDates
Handle JavaScript Date objects correctly.
Compression & Cost Optimization
Reduce storage costs using compression techniques.
Section 10: Implementing the Saga Pattern
Saga Fundamentals
Manage distributed transactions safely.
Saga Implementation Demo
Build compensation-based rollback workflows.
Section 11: Project - QuantaSneaks E-Commerce Workflow
A complete production-style distributed workflow.
Architecture Overview
Understand the overall workflow design.
Sequence Diagram Walkthrough
Visualize the complete execution flow.
Code Deep Dive
Implement the durable workflow.
Infrastructure Deployment
Provision supporting AWS resources.
Key concepts:
- Human approvals
- AI risk scoring
- Payment workflows
- Inventory validation
- Distributed transactions
- Saga compensation patterns
Section 12: Observability & Monitoring
Learn how to operate Durable Functions in production.
Durable Execution History
Inspect workflow state transitions.
Logs & Debugging
Troubleshoot workflows using CloudWatch Logs.
Durable Metrics
Monitor workflow health with CloudWatch metrics and alarms.
Section 13: Testing Durable Functions
Testing is often overlooked in workflow orchestration.
Durable Testing SDK
Introduction to testing Durable Functions.
Test Environment Setup
Configure a testing framework.
Testing Wait Operations
Validate delayed execution behavior.
Testing Durable Operations
Test workflow logic in isolation.
Testing Callback Workflows
Test approval and callback patterns.
Cloud Runner
Execute tests against the deployed infrastructure.
End-to-End QuantaSneaks Testing
Build a complete workflow test suite.
Section 14: Production Readiness with AWS CDK
Deploy Durable Functions using Infrastructure as Code.
Infrastructure as Code Fundamentals
Why IaC matters in production environments.
AWS CDK Crash Course
Learn the CDK essentials.
Deploy Your First Durable Function
Provision workflows using AWS CDK.
Bundling with Esbuild
Optimize deployments using NodejsFunction.
TypeScript Support
Build workflows using TypeScript.
Durable Execution ESLint Plugin
Catch replay and determinism bugs before deployment.
Section 15: Real-World Durable Functions Case Studies
Explore how Durable Functions are being used in real-world systems.
Understand execution and storage costs.
Conclusion
Durable Functions introduce a different way of thinking about long-running workflows on AWS. Understanding concepts such as durable execution, replay, callbacks, retries, and workflow composition can help developers build more resilient serverless applications.
Top comments (0)