Most local models impress me for about 10 minutes.
Then the context starts breaking, responses become repetitive, and debugging turns into prompt wrestling.
Gemma 4 was the first open model where I didn’t hit that wall immediately.
I tested it against a real Rails codebase instead of toy examples, and it was surprisingly good at:
tracing Sidekiq flows
finding duplicated logic
explaining legacy code
spotting missing indexes
The reasoning mode especially made the responses feel less like autocomplete and more like actual step-by-step analysis.
Not perfect.
Still weaker than larger cloud models.
But honestly, much more practical than I expected from a local setup.
Top comments (0)