Felix Jumason

Posted on Apr 19

Using Codex Inside Cursor (what it actually feels like)

#ai #productivity #tooling #programming

A field note on pairing with Codex inside Cursor: where it helps, where it
surprises you, and why the experience feels different from "just chatting
with AI in a sidebar."

There are a lot of AI coding demos that make the whole thing look effortless.
Open editor. Type one sentence. Watch a perfect feature appear. Real work is
not like that.

Using Codex inside Cursor feels less like summoning a magic trick and more
like working with a very fast, very tireless junior-to-mid engineer who can
read an absurd amount of code in a short time, propose reasonable changes, and
keep momentum when you'd normally get stuck context-switching.

That is the good version of the experience. The important part is that it is
still a working relationship, not autopilot.

This post is about what that relationship actually feels like in practice.

1. The biggest change is not code generation. It is friction removal.

The obvious pitch is "it writes code."

That is true, but it misses the real value.

What changed most for me was the amount of friction removed from the small,
boring, mentally expensive parts of software work:

tracing where a component is used
finding the right file before changing anything
understanding the shape of a feature before touching it
carrying a refactor across multiple files without losing the thread
translating "I know what I want, but I don't want to type all of it" into a first draft

That matters because most engineering time is not spent inventing brand new
algorithms. It is spent navigating a codebase, preserving intent, and avoiding
mistakes while making changes under time pressure.

Codex is unusually helpful when the task is annoying more than novel.

2. Cursor makes the experience feel native

The reason it works well inside Cursor is that the model is not floating off in
some separate chat universe. It is close to the code, the files, the diffs, and
the actual project structure.

That sounds like a small UX detail, but it changes behavior:

you ask for changes in the context of the code you are already reading
you review edits where you would review any other edit
you keep moving instead of copy-pasting between tools
you can tighten the loop from "idea" to "diff" to "sanity check"

The result feels less like consulting an assistant and more like pairing inside
the editor you are already using to think.

That tight loop is the part I underestimated.

3. It is best when you already know the direction

Codex shines brightest when the destination is clear, even if the path is not.

Good prompts look like:

"Make this admin dashboard responsive without changing the visual style."
"Refactor this form builder so the mobile layout stops overflowing."
"Trace why this route is crashing and patch the smallest safe fix."

In other words: clear outcome, real constraints, actual codebase.

When I know what "done" looks like, Codex is excellent at helping me get there
faster. It can search, summarize, patch, and verify without needing me to carry
every detail in my head at once.

When I do not know what I want, the output gets fuzzier. That is not really
an AI problem. It is a specification problem wearing an AI costume.

If your instruction is vague, you usually get one of three things:

a plausible but generic answer
a technically correct change that misses the product intent
too much change in the wrong places

So the workflow that works best is:

decide the outcome
state the constraints
let Codex do the heavy lifting
review like an engineer, not a spectator

4. The best sessions feel like pairing, not prompting

This was the most surprising part.

The strongest results came when I treated Codex like a collaborator:

give it context
tell it what matters
let it inspect the codebase
ask it to explain what it found
refine the direction
then let it implement

That is different from throwing a one-line command at the wall and hoping.

The quality jump between "fix this" and "make this responsive across admin
routes, preserve shadcn usage, avoid backend changes, and keep the current
visual language" is enormous.

The model does better when the collaboration is better.

That turns out to be true for humans too.

5. It is especially strong at keeping momentum

There is a kind of fatigue that comes from switching between debugging,
searching, writing, verifying, and remembering what you were doing before the
interruption.

Codex helps most when momentum is at risk.

A few examples:

You hit an unfamiliar part of the repo. It can map the surrounding files fast.
You notice a UI bug across several routes. It can carry the pattern through consistently.
You fix one issue and uncover another. It can keep the thread alive across follow-up changes.
You need a first draft of copy, docs, or cleanup after the code is done. It can finish the loop.

This is where the experience feels less like "AI wrote code for me" and more
like "I stayed in flow longer than I normally would have."

That is a meaningful difference.

6. It still needs taste, judgment, and boundaries

This is the part people either undersell or overreact to.

Codex is not a substitute for engineering judgment.

It can absolutely produce changes that are:

too broad
too clever
stylistically off for the repo
locally correct but globally awkward
missing an edge case you care about

That means you still need to do the adult parts of the job:

define the task well
review diffs
run checks
reject unnecessary complexity
keep the architecture coherent

The win is not that judgment disappears.
The win is that your judgment gets spent on higher-value decisions instead of
mechanical effort.

7. The emotional experience is underrated

There is also a more human side to this.

Sometimes you are tired. Sometimes the bug is small but irritating. Sometimes
you know the fix is straightforward, but you do not want to spend twenty minutes
walking through files just to confirm where the change belongs.

Having Codex in Cursor makes those moments lighter.

You can ask:

"Find where this state is coming from."
"Why is this layout overflowing on mobile?"
"Patch this, but keep the current patterns."
"Summarize this file before we touch it."

That reduces the activation energy required to start.

And once you start, progress tends to create more progress.

That may be the most practical value of all.

8. Where it feels weak

To be fair, there are still rough edges.

The experience degrades when:

the request is under-specified
the codebase is inconsistent
the change requires product decisions no one has made
you stop reviewing because the first draft looked convincing

It also has the classic assistant problem of confidence. A wrong answer can
still sound smooth. That means "looks professional" is not a valid acceptance
criterion.

You still need to verify behavior.

In other words: it is a force multiplier, not a waiver form.

9. So what is it actually like?

The shortest honest answer is:

Using Codex inside Cursor feels like pair programming with a teammate who is
extremely fast at codebase traversal, never gets tired of first drafts, and is
most useful when you give it clear goals and solid constraints.

It is not magic.
It is not full autonomy.
It is not the end of engineering.

It is a very good acceleration layer for real software work.

And once you get used to having that kind of help directly inside the editor,
going back to a slower, more fragmented workflow starts to feel unnecessarily
expensive.

Final thought

The real promise of tools like this is not that they replace the engineer.

It is that they reduce wasted effort.

Less time hunting.
Less time scaffolding.
Less time losing momentum.
More time deciding what should exist, what should change, and what "good" means
for the product in front of you.

That is what using Codex inside Cursor has felt like for me:
not automated genius, just better leverage.

DEV Community