<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Bryan |</title>
    <description>The latest articles on DEV Community by Bryan | (@bryanpage).</description>
    <link>https://dev.to/bryanpage</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3835882%2F896a0409-9b17-429a-b26a-6c4bfe32e406.jpg</url>
      <title>DEV Community: Bryan |</title>
      <link>https://dev.to/bryanpage</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/bryanpage"/>
    <language>en</language>
    <item>
      <title>AI Agents Are Lying to You</title>
      <dc:creator>Bryan |</dc:creator>
      <pubDate>Fri, 15 May 2026 04:10:57 +0000</pubDate>
      <link>https://dev.to/bryanpage/ai-agents-are-lying-to-you-22o7</link>
      <guid>https://dev.to/bryanpage/ai-agents-are-lying-to-you-22o7</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffurn3xvxejdi7eukywya.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Ffurn3xvxejdi7eukywya.jpg" alt=" " width="784" height="1168"&gt;&lt;/a&gt; Every AI coding tool on the market has the same pitch. Describe what you want and we'll build it. Cursor, Copilot, Devin. They all promise autonomous code generation. And they all have the same problem.&lt;/p&gt;

&lt;p&gt;You can't verify what they did.&lt;/p&gt;

&lt;p&gt;They generate code. Sometimes it works. Sometimes it doesn't. But you never actually know why it worked, what decisions were made along the way, or whether the output matches what you asked for. You're trusting a black box with your codebase.&lt;/p&gt;

&lt;p&gt;That's not autonomy. That's hope.&lt;/p&gt;

&lt;p&gt;The Verification Problem:&lt;br&gt;
Here's what happens when you use a typical AI coding agent. You write a prompt. The agent generates code. You read through it, maybe. You ship it, probably.&lt;/p&gt;

&lt;p&gt;That third step is where everything falls apart. You're reviewing AI generated code with human eyes, trying to catch mistakes in logic you didn't write. It's like proofreading a legal contract in a language you half speak. You'll catch the obvious errors. You'll miss the ones that matter.&lt;/p&gt;

&lt;p&gt;And the agent won't tell you what it got wrong. It can't. It doesn't have a verification layer. It generated output and moved on. There's no audit trail. No execution log. No proof that the code it wrote actually satisfies the intent you described.&lt;/p&gt;

&lt;p&gt;If you can't audit it, you don't own it.&lt;/p&gt;

&lt;p&gt;Context Blind Execution&lt;br&gt;
The deeper issue is context. Current AI agents operate without persistent awareness of what they've already done, what failed, or why. Every prompt is a fresh start. Every session is amnesia.&lt;/p&gt;

&lt;p&gt;The same mistake gets made across runs because there's no memory of past failures. There's no way to trace why a decision was made three steps ago. When something breaks, you're debugging code you didn't write with zero execution history.&lt;/p&gt;

&lt;p&gt;It's not that these tools are useless. They're genuinely fast at generating boilerplate. But speed without verification is just technical debt with extra steps.&lt;/p&gt;

&lt;p&gt;What Verifiable Execution Looks Like&lt;br&gt;
I'm building BuildOrbit to solve this. It's a verifiable execution runtime for AI agents. Every action the agent takes is logged, traceable, and auditable.&lt;/p&gt;

&lt;p&gt;The architecture is built on three layers of truth.&lt;/p&gt;

&lt;p&gt;Intent Truth. What you actually asked for. Your prompt is parsed into a structured intent that becomes the canonical reference for the entire run. Not a suggestion. A contract.&lt;/p&gt;

&lt;p&gt;Execution Truth. What the agent actually did. Every phase of the pipeline is recorded. What code was generated, what decisions were made, what was verified and what wasn't. This is the authoritative record. If there's a conflict between what the agent said it did and what actually happened, the execution log wins.&lt;/p&gt;

&lt;p&gt;Reality Truth. What actually shipped. The final deployed state is compared against intent and execution. Did the output match the request? Can you prove it?&lt;/p&gt;

&lt;p&gt;Each layer checks the others. The agent can't silently hallucinate a feature, skip a requirement, or paper over a failure. If something goes wrong, you know exactly where, when, and why.&lt;/p&gt;

&lt;p&gt;Why This Matters&lt;br&gt;
This isn't academic. If you're building anything real with AI agents, anything that touches production, handles user data, or needs to work reliably, you need to be able to answer one question.&lt;/p&gt;

&lt;p&gt;Can you prove your agent did what you asked?&lt;/p&gt;

&lt;p&gt;Right now, with every major AI coding tool, the answer is no. You can look at the output and guess. You can run tests after the fact. But you can't trace the decision chain from intent to execution to deployment.&lt;/p&gt;

&lt;p&gt;BuildOrbit makes that traceable. Every run produces a complete audit trail. When something fails, you see the phase it failed at, the reasoning the agent used, and the exact point where execution diverged from intent.&lt;/p&gt;

&lt;p&gt;No black boxes. No blind trust. No "it works on my machine."&lt;/p&gt;

&lt;p&gt;The Honest Version&lt;br&gt;
I'm one person. BuildOrbit is pre revenue. I don't have a team or a Series A or a wall of testimonials. I'm building this in public because I think the problem is real and the current solutions aren't solving it.&lt;/p&gt;

&lt;p&gt;I'm not claiming to have reinvented software engineering. I'm saying that if we're going to let AI agents write our code, we should at minimum be able to verify what they wrote and why.&lt;/p&gt;

&lt;p&gt;That bar is shockingly low. And almost nobody is clearing it.&lt;/p&gt;

&lt;p&gt;If you want to see it in action: buildorbit.polsia.app&lt;/p&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>productivity</category>
      <category>opensource</category>
    </item>
    <item>
      <title>The Problem With Modern Development: We're Deploying Code We Don't Truly Understand</title>
      <dc:creator>Bryan |</dc:creator>
      <pubDate>Fri, 20 Mar 2026 18:58:08 +0000</pubDate>
      <link>https://dev.to/bryanpage/peeling-back-the-black-box-of-github-deployments-3me1</link>
      <guid>https://dev.to/bryanpage/peeling-back-the-black-box-of-github-deployments-3me1</guid>
      <description>&lt;h2&gt;
  
  
  Modern development has a subtle problem:
&lt;/h2&gt;

&lt;p&gt;We can deploy faster than we can explain what actually happened.&lt;/p&gt;

&lt;p&gt;Push to GitHub, wait a minute, and your app is live. Platforms like Vercel and Netlify made that experience feel effortless. For small projects, that magic is great. It removes friction and lets you ship.&lt;/p&gt;

&lt;p&gt;But as systems grow, that same magic starts to work against you.&lt;/p&gt;

&lt;p&gt;Builds fail without clear explanations. Runtime behavior drifts from what you expected. Logs feel incomplete. Configuration exists somewhere, but not somewhere you can actually reason about.&lt;/p&gt;

&lt;p&gt;At that point, developers end up doing the same three things:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;digging through proprietary logs&lt;/li&gt;
&lt;li&gt;guessing at the real build environment&lt;/li&gt;
&lt;li&gt;fighting configuration they cannot fully see&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the problem Hostack is being built to solve.&lt;/p&gt;

&lt;h2&gt;
  
  
  Hostack: Push-to-Deploy Without the Black Box
&lt;/h2&gt;

&lt;p&gt;Hostack is a deploy-from-GitHub platform designed to keep the simplicity developers want while restoring the visibility they need.&lt;/p&gt;

&lt;p&gt;The goal is not to make deployments feel harder.&lt;/p&gt;

&lt;p&gt;The goal is to make them understandable.&lt;/p&gt;

&lt;p&gt;With Hostack, you still get a fast GitHub-driven deployment flow, but you can also see:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;how the project was detected&lt;/li&gt;
&lt;li&gt;which build environment ran&lt;/li&gt;
&lt;li&gt;what commands executed&lt;/li&gt;
&lt;li&gt;what artifact was produced&lt;/li&gt;
&lt;li&gt;what got promoted live&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In other words: the convenience stays, but the black box goes away.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Core Flow
&lt;/h2&gt;

&lt;p&gt;The Hostack workflow is intentionally simple:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Repo Hook&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
A GitHub Action or webhook triggers a deployment on push.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Smart Detection&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Hostack analyzes the repo to detect the framework, package manager, runtime, and likely build plan.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Job Queuing&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The deployment is queued and assigned to a worker instead of being executed inline by the API.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Transparent Build&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The app is built in an isolated, ephemeral environment with observable logs and deterministic inputs.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Global Deployment&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The resulting artifact is promoted to the edge or a runtime environment.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Simple on the surface. Much more understandable underneath.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;Most deployment pain does not come from shipping code.&lt;/p&gt;

&lt;p&gt;It comes from not knowing why a deployment behaved the way it did.&lt;/p&gt;

&lt;p&gt;That usually shows up in three places:&lt;/p&gt;

&lt;h2&gt;
  
  
  1. Framework Detection Should Help, Not Hide
&lt;/h2&gt;

&lt;p&gt;One of the first design questions behind Hostack was:&lt;/p&gt;

&lt;p&gt;How smart should the platform be?&lt;/p&gt;

&lt;p&gt;If detection is entirely heuristic, you get convenience but lose trust. The system becomes another opaque platform making decisions on your behalf.&lt;/p&gt;

&lt;p&gt;If configuration is entirely manual, you get clarity but lose speed.&lt;/p&gt;

&lt;p&gt;So Hostack uses a hybrid approach.&lt;/p&gt;

&lt;p&gt;By default, it inspects your &lt;code&gt;package.json&lt;/code&gt;, lockfiles, and framework config files to infer:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;framework&lt;/li&gt;
&lt;li&gt;package manager&lt;/li&gt;
&lt;li&gt;install command&lt;/li&gt;
&lt;li&gt;build command&lt;/li&gt;
&lt;li&gt;runtime type&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But that detection is not the final word.&lt;/p&gt;

&lt;p&gt;Projects can also define a &lt;code&gt;hostack.yaml&lt;/code&gt; file that acts as the source of truth when you need explicit control. That lets the platform stay fast by default without becoming mysterious.&lt;/p&gt;

&lt;p&gt;Transparency means you should never have to ask:&lt;/p&gt;

&lt;p&gt;"Why did it decide to build my app this way?"&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Workers Need Clean Isolation
&lt;/h2&gt;

&lt;p&gt;Monorepos expose the next major problem quickly: environment drift.&lt;/p&gt;

&lt;p&gt;A React frontend, a Go API, a Node worker, and an n8n workflow set should not all be forced through the same generic deployment environment.&lt;/p&gt;

&lt;p&gt;Hostack handles this through ephemeral worker isolation.&lt;/p&gt;

&lt;p&gt;Instead of one bloated execution environment, the platform can assign a purpose-built builder image per job.&lt;/p&gt;

&lt;p&gt;Examples:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;React app -&amp;gt; &lt;code&gt;node:20-alpine&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;n8n workflow deployment -&amp;gt; a custom image with &lt;code&gt;n8n-cli&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;specialized pipelines -&amp;gt; builder images aligned to the actual runtime&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That gives you three important properties:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Clean&lt;/strong&gt;: each build starts in a fresh environment&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Reproducible&lt;/strong&gt;: the same image and inputs can be rerun&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Auditable&lt;/strong&gt;: you can see what executed and how&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is a much better foundation than hoping a long-lived shared environment behaves predictably.&lt;/p&gt;

&lt;h2&gt;
  
  
  3. n8n Should Be Treated Like Code
&lt;/h2&gt;

&lt;p&gt;Automation is infrastructure. It should not live outside version control as an awkward side system.&lt;/p&gt;

&lt;p&gt;That is why Hostack treats n8n as a first-class deployment target.&lt;/p&gt;

&lt;p&gt;With &lt;code&gt;ubie-oss/n8n-cli&lt;/code&gt;, workflows can live alongside application code in GitHub. That means workflow changes become reviewable, diffable, and deployable in the same system as everything else.&lt;/p&gt;

&lt;p&gt;Using flags like &lt;code&gt;--git-diff&lt;/code&gt; and &lt;code&gt;--externalize&lt;/code&gt;, Hostack can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;deploy only the workflows that changed&lt;/li&gt;
&lt;li&gt;extract complex JavaScript nodes into separate files&lt;/li&gt;
&lt;li&gt;make workflow logic easier to review in pull requests&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That moves automation out of the "hidden ops corner" and into the normal engineering workflow.&lt;/p&gt;

&lt;h2&gt;
  
  
  Where Hostack Is Going
&lt;/h2&gt;

&lt;p&gt;Hostack is not just about deploying code.&lt;/p&gt;

&lt;p&gt;It is about giving developers a clearer control plane for what happens after code leaves their machine.&lt;/p&gt;

&lt;p&gt;The next layer of work is focused on even more operational transparency:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Raw Build Logs&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Real-time, unfiltered output from workers&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Dockerfile Export&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
The exact build recipe used during deployment, so you can inspect it or run it yourself&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;PR-Level Workflow Diffs&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
A way to see exactly what changed in automation before merging&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Queue + Worker Visibility&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Clear separation between control plane orchestration and background execution&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Rollback That Actually Feels Safe&lt;/strong&gt;&lt;br&gt;&lt;br&gt;
Promotion of known-good artifacts instead of rebuilding old code and hoping for the same result&lt;/p&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Why We're Building It This Way
&lt;/h2&gt;

&lt;p&gt;The best deployment platforms made shipping fast.&lt;/p&gt;

&lt;p&gt;Hostack is trying to make fast deployments understandable again.&lt;/p&gt;

&lt;p&gt;That means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;no pretending invisible defaults are always good enough&lt;/li&gt;
&lt;li&gt;no hiding build behavior behind convenience&lt;/li&gt;
&lt;li&gt;no treating logs and rollback as afterthoughts&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Fast is good.&lt;/p&gt;

&lt;p&gt;Fast and explainable is better.&lt;/p&gt;

&lt;h2&gt;
  
  
  Join the Discussion
&lt;/h2&gt;

&lt;p&gt;Hostack is being built in the open, and the discussion is public:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/bry92/Hostack-Deploy/discussions/1#discussion-9699264" rel="noopener noreferrer"&gt;https://github.com/bry92/Hostack-Deploy/discussions/1#discussion-9699264&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If you've ever been frustrated by a deployment platform that felt magical right up until it broke, that's exactly the problem this project is trying to solve.&lt;/p&gt;

</description>
      <category>cicd</category>
      <category>devops</category>
      <category>github</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
