Alejandro Lazaro for AWS Community Builders

Posted on Jan 21 • Originally published at playingaws.com

AI-assisted coding: what worked for me and what didn’t (after 6 months)

#aws #ai #coding #programming

1. Introduction

Over the last six months, AI-assisted coding has become a core part of how I build software.

I’ve been using it almost daily across my AWS projects. One of them is my personal project https://playingpadel.es{:target="_blank"}, a web app for padel players that keeps evolving.

During this time I’ve tried different tools (Amazon Kiro, Copilot, Antigravity, Cursor, Windsurf). They all helped me iterate faster. And with all of them, I eventually made mistakes for the same underlying reasons.

That’s when I realized something important: the tool mattered much less than how I was using it. The same mistakes appeared across all of them whenever I skipped validation or gave them too much freedom.

For something simple, or even for a proof of concept (PoC), you can use these assistants in many ways and almost everything seems to work. But once you apply them to production code, the story changes.

You need more discipline and more control: tighter scope, clear constraints, tests, and verification. Otherwise, speed becomes misleading: you move fast today, but you pay for it later with technical debt, regressions, and production surprises.

This article is not about hype or tools. It’s a practical summary of what didn’t work for me, what did, and the workflow that has kept me safe while building and evolving real systems.

2. My default working loop

Before getting into what didn’t work, this is the loop I try to follow every day when I’m coding with AI assistance:

Plan a small change
Implement
Test
Review the changes
Commit
Repeat

3. Mapping: from my mistakes to my fixes

This is a quick map to help you navigate. The value is in the details: examples, nuances, and how the pieces connect.

What didn’t work (section 4)	What actually helped (section 5)
Asking for too many things at once	Keep changes small and scoped (5.1)
Blindly trusting the output	Review the plan and the diff first (5.2)
Not asking for validation	Ask for verification, not just code (5.5)
Losing control of the code	Refactor after correctness (5.7)
Not setting clear constraints	Define intent and limits clearly (5.3)
Taking shortcuts without thinking long-term	Checklist and continuous validation (5.8)
Not committing frequently	Commit small and often (5.6)
Too many tests without strategy	Treat tests as the contract (5.4)
Using weak models for complex tasks	Use the right model for the job (5.9)
Constantly switching tools	Master one tool and configure it well (5.9)

4. What didn’t work for me

This section is intentionally long.

Not because things went particularly wrong, but because most of the problems I ran into kept repeating themselves in different forms.

The important part is that many of them share the same root cause: once you lose control of the loop (validation, review, and feedback), everything becomes fragile very quickly.

4.1. Asking for too many things at once

The most common mistake I made was asking the AI to implement too many changes in a single request.

Even when the result looks “correct”, the changes become hard to review and validation slows down. That’s where bugs start to sneak in.

What I learned

Fast generation only helps if the validation loop is also fast.
If I can’t validate something easily, I shouldn’t generate it in bulk.
Small changes reduce stress and errors.

4.2. Blindly trusting generated code

Sometimes the code looks clean and “well done”. That doesn’t mean it’s correct, safe, or consistent with your system.

The problem is that many issues aren’t obvious. They don’t break the build, they don’t fail locally, and yet they introduce behavior changes: missing validations, runaway retries, wrong timeouts, or unhandled edge cases.

In cloud environments this becomes even more visible, because those changes can translate into latency, unexpected costs, or degraded behavior under real load.

What I learned

Never assume that “it compiles” means “it’s fine”.
Review generated code like you would review a teammate’s PR.
If I can’t explain a piece of code, I don’t want it in production.

4.3. Not asking the AI to validate its own work

Many assistants make changes and stop. They won’t validate anything unless you explicitly ask.

If you assume the output works, you’ll often end up testing it yourself, watching it fail, and wasting time rebuilding context.

What I learned

Always ask for implementation plus validation.
“Implement + validate with tests + explain why it works” is a solid default.
Verification is not an extra phase, it’s part of the work.

4.4. Losing control of the code

This happens faster than it seems. You keep asking for changes and suddenly you’re no longer sure what you have or how it works.

The AI can generate working code, but you are still the owner and responsible for maintaining it.

What I learned

If something isn’t clear, ask for explanations before moving on.
Prefer simpler code, even if it’s slightly longer.
Don’t move forward without understanding the current state.

4.5. Not setting clear constraints

Without constraints, the assistant will make decisions for you: changing structure, adding dependencies, or “improving” things you didn’t ask for.

The problem is not that it does this, but that it does it without context.

What I learned

Constraints are not optional.
Being explicit about what I do NOT want prevents surprises.
Rules like “minimal diff”, “no new services”, or “don’t touch IAM” save a lot of trouble.

4.6. Choosing the fast path without thinking long-term

I accepted shortcuts because I wanted the solution quickly. Later, undoing them was slow and expensive.

What felt like a pragmatic decision turned into hard-to-pay technical debt.

What I learned

Think about trade-offs before accepting shortcuts.
Document temporary decisions with an exit plan.
Be especially careful when shortcuts affect security, data, or cost.

4.7. Not committing frequently

Accumulating working changes without committing is accumulating risk.

Without commits you lose traceability, rollback points, and confidence to keep iterating.

What I learned

Small commits reduce fear.
Checkpoints make experimentation cheaper.
Rolling back should always be easy.

4.8. Having too many tests without a clear strategy

It’s very easy to generate tests today. It’s also very easy to end up with a slow and redundant test suite.

In my padel project, I ended up with hundreds of E2E tests, many of them duplicated, and I didn't want to run them because they took too long.

What I learned

Tests need a strategy, not just volume.
Slow tests are deferred cost.
Not every change needs the same level of testing.

4.9. Using weaker models for complex tasks

I tried to “save tokens” by using smaller models for tasks that weren’t simple. The result was more time spent fixing and reworking.

What I learned

If this option is available in your assistant, let it choose which model to use. This is usually the best option, unless you know what you are doing.
If you choose, use the right model for the complexity of the task.
- Small models for boilerplate.
- Strong models for design, debugging, and risky refactors.

4.10. Switching tools too often

Jumping between IDEs, chats, and assistants made me lose context and consistency.

Each tool came with different assumptions and rules.

What I learned

Do not change tools in the middle of a task.
If I switch tools, explicitly copy the plan and constraints.
Close my working loop before moving on.

5. What worked for me

If the previous section lists the mistakes, this is the practical part: the practices that, together, helped me use coding assistants without losing control.

They’re not tricks or shortcuts. They’re repeatable habits that let me keep momentum without sacrificing quality.

Most of these practices are boring.

That’s exactly why they work.

5.1. Keep control: request a change, validate, continue

If I had to pick one change that made everything else easier, this would be it.

I used to ask for too much in a single request. Now I go one change at a time: request a small change, validate it, move on.

Instead of asking for “refactor the whole module”, I ask for one concrete improvement.

Small changes are easier to review, validation stays fast, and rollbacks stop being scary.

5.2. Review the plan before implementing

When a change is complex or high-impact, I review what will be done before any code is written.

My pattern is simple:

First, ask for the plan (options, risks, steps).
Once I’m happy, ask to implement only what was agreed.

That avoids surprises and keeps scope under control.

I treat the assistant like a teammate.

If I can’t review the changes quickly, the problem isn’t the assistant. It’s the size of the change.

Quick rule: if changes are big, split them. I prefer three small changes over one huge one.

5.3. Define what I want, in as much detail as needed

When the prompt is vague, the output is vague.

If you’re not clear, the assistant will fill the gaps for you.

I don’t follow a rigid template, but I usually include:

what I want
what I don’t want (clear limits)
how I’ll know it’s done (acceptance criteria)
relevant context for this change

It may feel excessive, but it reduces misunderstandings and improves output quality.

Amazon Kiro has a spec-driven development mode that forces you to define intent first, validate how it will be built, and only then generate code. It’s uncomfortable at first, but it reduces errors.

5.4. Have a clear testing strategy

Tests are not just there to “make the pipeline green”. They’re a control mechanism: confidence to change things and fast feedback when regressions appear.

What works for me is making three decisions explicitly:

Which test levels I have (and what each level is for).
When I add new tests (new feature, new bug, new edge case).
When I run them (what runs on every PR vs what I run at specific times).

My levels, ordered by criticality, are:

Level 1: Smoke tests
Level 2: Unit tests
Level 3: Integration tests
Level 4: Contract / API tests
Level 5: E2E tests

Be careful with E2E tests that trigger real side effects (emails, data writes, AWS events, etc.). If you add E2E tests, also add cleanup and isolation.

When a generated test fails, the assistant may “fix” it by weakening the test itself. That can keep the pipeline green while the bug is still there.
{: .prompt-warning }

A prompt line that helps me:

Do not change tests just to make them pass. Fix production code first.
If you change a test, explain why the original expectation was wrong.

5.5. Ask for verification, not just code

Many times I asked for changes and got code that simply didn’t work.

The good news is that this usually has a simple fix: explicitly ask for verification.

By default, I ask for:

confirmation that the code works
a short verification plan
tests to add or update

If something fails, my rule is simple: stop, review, and iterate again with smaller scope and more context.

For delicate changes, I add a quick risk pass (security, reliability, cost, maintainability). In a minute, obvious issues usually surface.

5.6. Commit frequently

Small commits give me confidence.

I commit after every working step, before risky changes, and whenever I finish a refactor or test improvement.

If the assistant goes in a weird direction, rolling back is trivial.

Without frequent commits, the speed you gain coding turns into fear of breaking things.

5.7. Refactor with clean code, after it works

The AI is great at rewriting code, but I want correctness first.

The workflow that works for me:

Get correct behavior
Automate checks
Refactor with clean code principles

When the output looks messy, I explicitly ask:

"apply clean code principles and justify any refactor you propose."

Clean Code principles aim for readable, maintainable, and efficient code: clear and descriptive names, small functions that do one thing (aligned with SOLID), DRY (Don’t Repeat Yourself), consistent formatting rules, proper error handling, and solid unit tests. The Boy Scout Rule (“leave the code cleaner than you found it”) matters a lot, together with SOLID principles for class design and good dependency management.

5.8. A lightweight end-of-loop checklist

Before merging (or even before stopping for the day), I go through this checklist:

Did I review the changes?
Does my code compile?
Do tests cover critical paths?
Do my tests work?
Does my code follow the lint rules I have defined?
Did I keep changes scoped?
Did I commit?
Did I verify assumptions?

It sounds basic, but it prevents most of tomorrow’s problems.

5.9. Master one tool and configure it well

Choosing one main tool and configuring it properly completely changed my experience.

Constantly switching tools made me lose context and, more importantly, my rules: style, limits, and how I ask for changes.

Once I left one tool well configured (persistent rules, templates, shortcuts), quality jumped and friction dropped: less re-explaining context, fewer surprises.

I’ll write a dedicated post about Amazon Kiro and its SPEC mode later, because that’s where this really shines.

If your IDE supports persistent rules (project rules, custom instructions, workspace guidelines), put your defaults there so every request starts from the same baseline.

5.10. Keep the right mental state when working with AI

When things don’t go as expected, it’s easy to get frustrated and respond impulsively. That almost always makes things worse.

I’ve learned that output quality depends heavily on my mental state.

When I’m tired, frustrated, or in a rush, it shows immediately in my prompts and results.

What I do now

Stop when I notice frustration.
Rewrite the prompt calmly with more context.
Treat the assistant as a predictable tool, not something that “understands” me.

It sounds minor, but it breaks many bad loops before they escalate.

6. Final thoughts

AI-assisted coding changes speed dramatically.

But speed without control only amplifies existing problems.

When you work with real systems, especially in the cloud, small mistakes can quickly turn into reliability, security, or cost issues.

Much of what I share here was refined as I iterated on my personal project https://playingpadel.es.

The most important lesson I’ve learned over these months is simple: the tool matters less than the method.

If I had to summarize the whole article in one idea, it would be this: keep the loop healthy. Small changes, constant validation, conscious decisions.

Developers still build systems. AI changes the speed.
Used well, it doesn’t replace good engineering. It amplifies it.

DEV Community

AI-assisted coding: what worked for me and what didn’t (after 6 months)

1. Introduction

2. My default working loop

3. Mapping: from my mistakes to my fixes

4. What didn’t work for me

4.1. Asking for too many things at once

4.2. Blindly trusting generated code

4.3. Not asking the AI to validate its own work

4.4. Losing control of the code

4.5. Not setting clear constraints

4.6. Choosing the fast path without thinking long-term

4.7. Not committing frequently

4.8. Having too many tests without a clear strategy

4.9. Using weaker models for complex tasks

4.10. Switching tools too often

5. What worked for me

5.1. Keep control: request a change, validate, continue

5.2. Review the plan before implementing

5.3. Define what I want, in as much detail as needed

5.4. Have a clear testing strategy

5.5. Ask for verification, not just code

5.6. Commit frequently

5.7. Refactor with clean code, after it works

5.8. A lightweight end-of-loop checklist

5.9. Master one tool and configure it well

5.10. Keep the right mental state when working with AI

6. Final thoughts

Top comments (0)