<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Savinu T Vijay</title>
    <description>The latest articles on DEV Community by Savinu T Vijay (@savinuvijay).</description>
    <link>https://dev.to/savinuvijay</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3970946%2F3bad189b-354d-4e38-927c-4c76d4d543e8.jpeg</url>
      <title>DEV Community: Savinu T Vijay</title>
      <link>https://dev.to/savinuvijay</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/savinuvijay"/>
    <language>en</language>
    <item>
      <title>What If LLMs Were Just the CPU? Rethinking AI Systems as Programs</title>
      <dc:creator>Savinu T Vijay</dc:creator>
      <pubDate>Mon, 08 Jun 2026 05:01:21 +0000</pubDate>
      <link>https://dev.to/savinuvijay/what-if-llms-were-just-the-cpu-rethinking-ai-systems-as-programs-35h6</link>
      <guid>https://dev.to/savinuvijay/what-if-llms-were-just-the-cpu-rethinking-ai-systems-as-programs-35h6</guid>
      <description>&lt;p&gt;Most AI frameworks today place the language model at the center of the system and everything revolves around the LLM.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;Need knowledge? Add RAG.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Need external actions? Add tools.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Need memory? Add a memory layer.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Need autonomy? Add agents.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The result often looks something like this:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqxvzoot2ihd00z4qjeaq.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqxvzoot2ihd00z4qjeaq.png" alt="Traditional LLM Setup" width="627" height="295"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The model becomes the orchestrator, planner, router, and execution engine all at once.&lt;/p&gt;

&lt;p&gt;While building AI applications over the past years, I started wondering if this was the right way to think about the problem.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;What if the LLM was not the center of the system?&lt;/em&gt;&lt;br&gt;
&lt;em&gt;What if it was simply one of several core components?&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  The CPU Analogy
&lt;/h2&gt;

&lt;p&gt;In a traditional computer system, the CPU performs computation. But the CPU is not the entire computer.&lt;br&gt;
A complete system also needs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Memory&lt;/li&gt;
&lt;li&gt;Storage&lt;/li&gt;
&lt;li&gt;Input and Output&lt;/li&gt;
&lt;li&gt;Device Drivers&lt;/li&gt;
&lt;li&gt;Running Programs&lt;/li&gt;
&lt;li&gt;An Operating System&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The operating system coordinates everything and allows programs to execute using those resources.&lt;/p&gt;

&lt;p&gt;This led me to a simple thought:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;What if an LLM is just one component of an AI system, much like a CPU is just one component of a computer?&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The model performs &lt;strong&gt;reasoning&lt;/strong&gt; and &lt;strong&gt;generation&lt;/strong&gt;.&lt;br&gt;
But an AI system also needs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Knowledge retrieval&lt;/li&gt;
&lt;li&gt;State management&lt;/li&gt;
&lt;li&gt;Tool execution&lt;/li&gt;
&lt;li&gt;External integrations&lt;/li&gt;
&lt;li&gt;Workflow orchestration&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These should not be the responsibilities of the model itself. There needs to be a &lt;strong&gt;runtime&lt;/strong&gt; that is responsible for this.&lt;/p&gt;

&lt;h2&gt;
  
  
  Mapping AI Systems to Computer Systems
&lt;/h2&gt;

&lt;p&gt;The analogy started becoming surprisingly useful.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4dsznfzx8bau0ndcrt1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4dsznfzx8bau0ndcrt1.png" alt="Mapping AI Systems to Computer Systems" width="511" height="415"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Viewed through this lens, an AI application begins to look less like a &lt;strong&gt;prompt chain&lt;/strong&gt; and more like a &lt;strong&gt;program&lt;/strong&gt; executing over &lt;strong&gt;system resources&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  A Program-Centric View
&lt;/h2&gt;

&lt;p&gt;Consider a simple &lt;strong&gt;Help Bot&lt;/strong&gt;.&lt;br&gt;
Most implementations are described as:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6lsoirt25g0elc2ur3l5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6lsoirt25g0elc2ur3l5.png" alt="Conventional Help Bot implementation" width="180" height="360"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;But another way to describe the same thing is:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fykbdncyffiwqv614ca4g.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fykbdncyffiwqv614ca4g.png" alt="Help Bot based on runtime" width="416" height="505"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The &lt;strong&gt;program&lt;/strong&gt; itself becomes the primary unit of execution, while the &lt;strong&gt;model&lt;/strong&gt; becomes just &lt;strong&gt;one of several resources&lt;/strong&gt; available to the &lt;strong&gt;runtime&lt;/strong&gt;. It may also need to use other resources such as knowledge retrieval, tool execution, or state management.&lt;/p&gt;

&lt;p&gt;This small shift in perspective has surprisingly large consequences.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why A Runtime Is Needed
&lt;/h2&gt;

&lt;p&gt;Once AI applications grow beyond a single prompt, they quickly require additional capabilities:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Multiple models&lt;/li&gt;
&lt;li&gt;Knowledge sources&lt;/li&gt;
&lt;li&gt;State&lt;/li&gt;
&lt;li&gt;Branching logic&lt;/li&gt;
&lt;li&gt;External tools&lt;/li&gt;
&lt;li&gt;Validation&lt;/li&gt;
&lt;li&gt;Reusable workflows&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;At that point, the challenge is no longer prompting.&lt;/p&gt;

&lt;p&gt;The challenge is &lt;strong&gt;orchestration&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The system needs something responsible for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Managing execution state&lt;/li&gt;
&lt;li&gt;Loading resources&lt;/li&gt;
&lt;li&gt;Executing workflow steps&lt;/li&gt;
&lt;li&gt;Handling inputs and outputs&lt;/li&gt;
&lt;li&gt;Coordinating tools and models
In other words: A &lt;strong&gt;Runtime&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  The Idea Behind GenOS
&lt;/h2&gt;

&lt;p&gt;This realization eventually led me to build GenOS.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;&lt;strong&gt;GenOS&lt;/strong&gt; is a local-first runtime for AI systems.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The goal of GenOS is not to be another prompt wrapper or agent framework. Instead, it explores what AI systems look like when they are treated as &lt;strong&gt;executable programs&lt;/strong&gt; running over a collection of resources, rather than centering everything around the language model.&lt;/p&gt;

&lt;p&gt;In &lt;strong&gt;GenOS&lt;/strong&gt;, these executable programs are represented as &lt;strong&gt;Projects&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;A GenOS &lt;strong&gt;Project&lt;/strong&gt; defines:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Inputs&lt;/li&gt;
&lt;li&gt;Outputs&lt;/li&gt;
&lt;li&gt;Workflow Graph&lt;/li&gt;
&lt;li&gt;Entry Node&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ird7zerlar597zpnehj.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ird7zerlar597zpnehj.png" alt="GenOS Architecture" width="800" height="533"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;Project&lt;/strong&gt; can use &lt;strong&gt;resources&lt;/strong&gt; such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Models for inference/compute&lt;/li&gt;
&lt;li&gt;Knowledge for storage&lt;/li&gt;
&lt;li&gt;State for memory&lt;/li&gt;
&lt;li&gt;Tools for external capabilities&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A &lt;strong&gt;Runtime Kernel&lt;/strong&gt; coordinates these resources and executes &lt;strong&gt;Projects&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;In this model, the &lt;strong&gt;Project&lt;/strong&gt; becomes the primary unit of execution, while models, knowledge, tools, and state become resources that the Project uses while it runs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Projects as Modules
&lt;/h2&gt;

&lt;p&gt;One of the more interesting ideas that emerged during development was treating &lt;strong&gt;Projects&lt;/strong&gt; as &lt;strong&gt;reusable execution units&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;Project&lt;/strong&gt; can define &lt;strong&gt;Inputs &amp;amp; Outputs&lt;/strong&gt; and expose a workflow graph that other &lt;strong&gt;Projects&lt;/strong&gt; can invoke as a &lt;strong&gt;Module&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;This allows larger systems to be built from smaller reusable &lt;strong&gt;projects&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0nmomt5diuj3trrvbe1h.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0nmomt5diuj3trrvbe1h.png" alt="Modular approach for Customer Support PRoject" width="441" height="342"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Each GenOS &lt;strong&gt;Project&lt;/strong&gt; behaves like a program with a well-defined interface defined by its &lt;strong&gt;inputs&lt;/strong&gt; and &lt;strong&gt;outputs&lt;/strong&gt;, allowing complex systems to be composed from smaller, focused Projects.&lt;/p&gt;

&lt;h2&gt;
  
  
  Rethinking Agents
&lt;/h2&gt;

&lt;p&gt;An unexpected outcome of this design was a different perspective on agents.&lt;/p&gt;

&lt;p&gt;Many AI frameworks introduce agents as a special concept.&lt;/p&gt;

&lt;p&gt;However, if &lt;strong&gt;Projects&lt;/strong&gt; can invoke other &lt;strong&gt;Projects&lt;/strong&gt;, then a sophisticated agent can simply be a &lt;strong&gt;higher-level project&lt;/strong&gt; that coordinates other &lt;strong&gt;Projects&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;In this model:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxtu1ww6n563mbpf89xs7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fxtu1ww6n563mbpf89xs7.png" alt="Agent = Project Composition" width="494" height="73"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;rather than a separate runtime abstraction.&lt;br&gt;
This keeps the architecture simple while still supporting complex behavior.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;The AI ecosystem currently focuses heavily on models.&lt;br&gt;
Models are important. But models are &lt;strong&gt;only one part&lt;/strong&gt; of a complete AI system.&lt;/p&gt;

&lt;p&gt;As applications become larger and more capable, concerns like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;State&lt;/li&gt;
&lt;li&gt;Knowledge&lt;/li&gt;
&lt;li&gt;Tools&lt;/li&gt;
&lt;li&gt;Reusability&lt;/li&gt;
&lt;li&gt;Orchestration
become increasingly important.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The question I wanted to explore was:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;What happens when we design &lt;strong&gt;AI systems&lt;/strong&gt; the same way we design &lt;strong&gt;software systems&lt;/strong&gt;?&lt;/em&gt;&lt;br&gt;
&lt;em&gt;Not around a single component, but around the interaction of many components coordinated by a runtime.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Looking Ahead
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;GenOS&lt;/strong&gt; is still in its early stages, but the idea continues to evolve.&lt;/p&gt;

&lt;p&gt;The goal is to provide a &lt;strong&gt;structured environment&lt;/strong&gt; in which language models can operate alongside knowledge, tools, state, and reusable workflows rather than being the center of the system.&lt;/p&gt;

&lt;p&gt;Perhaps the most interesting realization from building &lt;strong&gt;GenOS&lt;/strong&gt; has been this:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;The future of AI applications may not be about making the model responsible for everything.&lt;/em&gt;&lt;br&gt;
&lt;em&gt;It may be about building better runtimes around the model.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Because a CPU alone is not a computer.&lt;br&gt;
And an LLM alone is not an AI system.&lt;/p&gt;

&lt;h2&gt;
  
  
  Exploring the Idea
&lt;/h2&gt;

&lt;p&gt;GenOS is open source and still in its early stages, but it serves as a practical vehicle for exploring the ideas discussed in this article.&lt;/p&gt;

&lt;p&gt;If you're interested in runtime-oriented approaches to AI systems, I'd love to hear your thoughts, criticisms, and alternative perspectives.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Repository&lt;/strong&gt;: &lt;a href="https://github.com/sagenticlab/genos" rel="noopener noreferrer"&gt;https://github.com/sagenticlab/genos&lt;/a&gt; &lt;br&gt;
&lt;strong&gt;npm&lt;/strong&gt;: &lt;a href="https://www.npmjs.com/package/@sagentic/genos" rel="noopener noreferrer"&gt;https://www.npmjs.com/package/@sagentic/genos&lt;/a&gt; &lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>architecture</category>
      <category>opensource</category>
    </item>
  </channel>
</rss>
