AppRecode

Posted on May 29

Is Vibe Coding Bad? What Engineering Teams Actually Need to Know

#vibecoding

The question is not new anymore, but it is being asked more seriously. A developer shares a tweet about shipping a working app in two hours using nothing but prompts. A CTO reads about a startup that replaced half its engineering sprint with an AI agent. And then, a few weeks later, someone on the same team finds a hardcoded API key in the production codebase - generated, merged, and deployed without a single human reading the line.

Is vibe coding bad? The honest answer is: it depends on what your team has built around it. And for most teams, the answer to that follow-up question is not encouraging.

This article is not a defense of vibe coding and not an attack on it. It is an attempt to give engineering leaders an accurate picture of where the risks actually come from, when the approach genuinely makes sense, and what separates teams that use AI-assisted development well from teams that accumulate silent technical debt until something breaks.

What "Vibe Coding" Actually Means - and What It Does Not

The term came from Andrej Karpathy, who described it as a mode of development where you "fully give in to the vibes" - describing what you want to the model, accepting the output, and iterating through prompts rather than writing code line by line. The idea was not that LLMs would replace engineering judgment. It was that for certain tasks, intent-driven generation could move faster than traditional implementation.

Since then the term has expanded to cover almost everything involving AI code generation: Cursor sessions, GitHub Copilot completions, agentic frameworks that scaffold entire features from a single prompt, and full product prototypes built over a weekend. That range matters when evaluating whether vibe coding is "bad" - because a senior engineer using Copilot to draft a test suite and a junior developer asking Claude to build an authentication module and shipping it without review are both "vibe coding" in the loose sense, and they carry completely different risk profiles.

The definition worth working with: vibe coding is AI-assisted development where the human describes intent and accepts generated output, with varying levels of review and validation before that output reaches production. The risk is not in the generation. The risk is in everything that happens - or does not happen - between generation and deployment.

Where Vibe Coding Actually Goes Wrong

Code Quality and Technical Debt

LLMs generate code that looks correct. It usually compiles. It often runs. What it does not do is understand your existing architecture, your team's naming conventions, your module boundaries, or the three design decisions made six months ago that constrain how a new feature should be built.

The result is code that works in isolation and causes problems in context. Teams report the same pattern repeatedly: AI-generated modules that function correctly but do not fit how the rest of the system is structured, forcing rewrites or workarounds when the next feature touches the same area. The debt is invisible until it is not.

The deeper problem is ownership. When a developer writes a hundred lines of code, they understand it - at least at the time of writing. When an LLM generates five hundred lines and a developer accepts the output with light review, the team has shipped code that nobody fully understands. Debugging it later, extending it, or explaining it to a new hire costs more than the original time saved.

Security Vulnerabilities

This is the area where the risks are most documented and most serious. LLMs generate insecure code patterns with regularity - not because the models are malicious, but because they are trained on a large corpus of code that includes insecure patterns, and because they have no native awareness of the security context of the codebase they are writing for.

The specific failure modes are consistent: hardcoded credentials, missing input validation, SQL injection exposure through string interpolation, insecure deserialization, overpermissioned IAM roles in infrastructure code, and missing authentication checks on API endpoints. The NIST Secure Software Development Framework explicitly addresses AI-generated code as an emerging risk category in the secure SDLC. The OpenSSF Scorecard framework similarly flags automated code generation as a supply chain risk factor when not accompanied by appropriate review controls.

The OWASP Top 10 categories - injection, broken authentication, security misconfiguration - appear in AI-generated code with enough frequency that treating LLM output as automatically safe is a documented error. AppRecode's analysis of vibe coding security risks covers specific vulnerability patterns in more detail.

Accountability and Ownership

When something breaks in production, two questions matter immediately: what caused it, and who understands the code well enough to fix it. AI-generated code complicates both.

Code review exists partly for functional reasons - catching bugs before they ship - and partly for knowledge transfer: making sure more than one person understands what was built and why. Vibe coding workflows frequently skip or compress the review step, which means production incidents involving AI-generated code often surface in environments where nobody has context on the affected module.

For regulated industries, this is more than a process concern. Undocumented, untested, unreviewed code in a production environment is a compliance exposure. Healthcare, fintech, and SaaS products serving enterprise customers increasingly face audit requirements that assume human review of production code - requirements that vibe coding workflows, by design, may not satisfy.

Skill Erosion in Junior Teams

Senior engineers who use AI-assisted development as a starting point, review the output critically, rewrite what does not fit, and understand the generated code before approving it are using the tool appropriately. That is not the only pattern.

Junior developers in vibe coding workflows may ship code they cannot explain, debug, or extend - and may not realize this is a problem until a production incident forces the question. The skill gap between "can generate code" and "can own code in a production system" is real, and AI generation does not close it. In some cases it widens it by removing the practice opportunities that build the underlying understanding.

When Vibe Coding Is Not Bad

The critique above is genuine, but so is the use case. Vibe coding is not bad in absolute terms. It is risky in specific contexts, and well-suited in others.

Rapid prototyping and MVP development is where it performs best. When the goal is to validate a product hypothesis with a working prototype, code quality and long-term maintainability are secondary to speed. A two-day prototype that proves product-market fit before a team spends three months building the wrong thing is valuable - and the code quality of that prototype is irrelevant if the hypothesis fails.

Internal tooling, scripts, and one-off utilities are similarly appropriate. Code that will be used by a few people, maintained by one team, and replaced when requirements change does not need the same standards as code in a customer-facing production system.

Senior engineers using it as a starting point - generating a scaffold, a draft implementation, or a test suite that they then critically review and adapt - get real velocity benefits without the accountability gap. The human is still in the loop at the level that matters.

The transition sentence between concern and legitimacy is this: vibe coding is not bad. Vibe coding without the engineering foundation to catch what AI-assisted development gets wrong is bad. The teams that use it well were already doing other things well first.

The Infrastructure Gap Most Teams Miss

This is the core of the actual problem. The risks of vibe coding are not primarily generation risks. They are delivery pipeline risks.

Teams with strong CI/CD infrastructure, automated testing coverage, static analysis and dependency scanning, and consistent code review processes can absorb AI-generated code because every line passes through the same gates as human-written code. The pipeline does not care whether a function was written by a developer or an LLM - it runs the tests, checks the security patterns, and blocks the merge if something fails.

Teams without that infrastructure are exposed regardless of whether they use vibe coding. AI-assisted development does not create the gap. It makes the gap more consequential, because it increases the volume and speed of code generation faster than human review can scale.

The DORA State of DevOps Report has documented for years that elite-performing engineering teams achieve both speed and stability simultaneously - not through trade-offs, but through strong delivery foundations. When AI-assisted development is layered on top of those foundations, teams capture the velocity benefits without proportionally increasing risk. When it is layered on top of weak foundations, the velocity benefit accrues immediately and the risk exposure builds slowly until it surfaces in a difficult production incident.

DevSecOps - embedding security scanning and automated validation into the delivery pipeline rather than treating security as a pre-launch review step - becomes more important in vibe coding workflows, not less. The AWS Well-Architected Framework's operational excellence pillar makes the same argument from a different direction: automation of feedback and validation is what makes fast delivery sustainable.

Vibe Coding vs Traditional Coding - The Frame That Actually Helps

The comparison that matters is not "vibe coding vs traditional coding." Both have failure modes. Both have legitimate contexts. The comparison that helps engineering leaders make decisions is between AI-assisted development with proper controls and AI-assisted development without them.

Dimension	Vibe Coding Without Controls	Traditional Development	Vibe Coding With Controls
Speed	High	Lower	High
Code quality	Variable	High	High
Security exposure	High	Manageable	Manageable
Long-term maintainability	Low	High	High
Best suited for	Throwaway prototypes	Production systems	Production systems at speed

AppRecode's breakdown of vibe coding vs traditional coding covers the comparison in more operational depth - particularly for teams trying to decide where to draw the boundary in a hybrid workflow.

AppRecode: Helping Teams Use AI-Assisted Development Safely

The risks around vibe coding are real, but most of them are infrastructure problems, not AI problems. Teams that already have strong CI/CD pipelines, automated security scanning, and clear code review processes find AI-assisted development genuinely useful. Teams without those foundations expose themselves to significant technical debt and security risk the moment AI-generated code reaches production at speed.

AppRecode works with engineering teams to build and optimize the delivery infrastructure that makes modern development - including AI-assisted workflows - safe and scalable. The company has completed 50+ projects since 2019, with 30+ engineers working across DevOps, cloud infrastructure, and AI-assisted development.

Services directly relevant to teams evaluating or already using vibe coding:

Vibe coding development services - structured AI-assisted product development with proper engineering controls built into the workflow from the start, not added afterward.

DevOps development and consulting - CI/CD pipeline optimization, security scanning integration, infrastructure automation, and the delivery foundations that determine whether fast development stays safe development.

DevSecOps implementation - embedding security into the pipeline rather than treating it as a review gate before launch.

AppRecode's approach starts with understanding the current state: where the team is generating risk, where the bottlenecks are, and what a practical improvement path looks like. The portfolio covers projects across startup and scale-up environments where delivery speed and reliability both needed to improve.

The goal is not to restrict how teams work. It is to make the infrastructure capable of supporting how they want to work.

What Engineering Teams Should Actually Do

If your team is evaluating vibe coding or already using it at scale, these are the practical starting points:

Audit the delivery pipeline first. Before expanding AI-assisted development, understand what your current CI/CD coverage, automated testing depth, and security scanning capabilities actually are. Vibe coding amplifies existing gaps.

Define where AI-generated code is and is not permitted. Prototype code and production code have different standards. Make that distinction explicit. "We use AI assistance for scaffolding and draft implementations, which require full review before merge" is a policy. "We use AI assistance" is not.

Treat code review as non-negotiable for AI output. The review step is where human judgment catches what generation gets wrong. Compressing or skipping it to preserve the speed benefit of vibe coding is where the risk accumulates.

Implement automated static analysis and dependency scanning. Tools like Semgrep, Snyk, or Dependabot do not require any additional developer effort in the review cycle - they run in the pipeline and surface problems before a human needs to catch them manually.

Be deliberate about junior developer exposure. AI generation is most valuable when the developer using it can evaluate the output critically. Build that expectation into your process rather than discovering it is missing after a production incident.

Conclusion

Vibe coding is not inherently bad. The question the term triggers - should engineering teams be concerned? - has an honest answer: yes, under specific conditions, and for specific reasons that are well understood.

The teams hurt most by vibe coding are teams where the delivery infrastructure was already thin. The teams that benefit most are teams where CI/CD, code review, and security scanning were already working - and who now get to add velocity without adding proportional risk.

The shift worth making is not toward or against AI-assisted development. It is toward the engineering foundations that determine whether any development approach - AI-assisted or traditional - produces software that holds up.

If your team is adopting AI-assisted development and wants to make sure the delivery infrastructure can support it safely - or if vibe coding has already created technical debt you need to address - AppRecode can help assess the current state and design a practical path forward.

FAQ

Is vibe coding suitable for production applications?

With proper controls, yes. AI-generated code can reach production safely when it passes through code review, automated testing, and security scanning. Teams that deploy vibe-coded output directly without these steps take on technical debt and security risk that compounds over time.

What are the biggest security risks of vibe coding?

The most consistent failure modes are hardcoded credentials, missing input validation, SQL injection vulnerabilities, and overpermissioned infrastructure configurations. LLMs generate plausible-looking code that does not account for security context unless explicitly constrained - and even then requires review.

Is vibe coding bad for junior developers specifically?

It carries real risks for junior developers who cannot critically evaluate AI output. Without the underlying knowledge to judge what was generated, juniors can ship code they do not understand - creating debugging and maintenance problems that become visible later. Senior oversight and mandatory review significantly reduce this exposure.

When does vibe coding actually make sense?

Prototyping, MVP development, internal tooling, and exploratory work - contexts where speed matters more than long-term maintainability and where a developer with domain knowledge reviews and refines the output. It is least appropriate for security-sensitive modules, core business logic, and code that will be maintained by multiple teams.

What should teams do before scaling vibe coding?

Audit CI/CD coverage, testing depth, and security scanning capabilities. Define where AI-generated code is and is not permitted. Make code review non-negotiable for all AI output. Implement automated static analysis. The delivery infrastructure should be in place before velocity increases, not built in response to the problems that follow.

Top comments (1)

Harjot Singh • May 31

The honest answer to "is vibe coding bad" is that it's a power tool, dangerous in proportion to how much you skip the verify step. For a throwaway prototype it's pure upside; for production with a team it goes bad exactly when people ship the plausible-looking diff without understanding it. The teams that use it well treat AI output like a junior's PR: review it, test it, own it, not paste-and-pray. The tool isn't the risk, the missing verification discipline is. That's the line I build into Moonshift, generate freely but gate on proof-it-works before anything ships. What's your team's rule for what AI-generated code has to clear before it merges?