<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Laksmana Tri Moerdani</title>
    <description>The latest articles on DEV Community by Laksmana Tri Moerdani (@ltmoerdani).</description>
    <link>https://dev.to/ltmoerdani</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F4014340%2F0baa7240-1b04-4af1-94db-12b0c988fee5.png</url>
      <title>DEV Community: Laksmana Tri Moerdani</title>
      <link>https://dev.to/ltmoerdani</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ltmoerdani"/>
    <language>en</language>
    <item>
      <title>How to Use 30+ AI Models in GitHub Copilot Chat for Free</title>
      <dc:creator>Laksmana Tri Moerdani</dc:creator>
      <pubDate>Sat, 04 Jul 2026 03:56:20 +0000</pubDate>
      <link>https://dev.to/ltmoerdani/how-to-use-30-ai-models-in-github-copilot-chat-for-free-4ifd</link>
      <guid>https://dev.to/ltmoerdani/how-to-use-30-ai-models-in-github-copilot-chat-for-free-4ifd</guid>
      <description>&lt;p&gt;Copilot Pro+ is $39 a month. The free tier caps you at 2,000 completions and two models. Want DeepSeek V4, Kimi K2.6, GLM 5.1, Qwen3.7 Max? Not in the default catalog.&lt;/p&gt;

&lt;p&gt;There's a VS Code API that lets extensions register custom providers into the native model picker. I built one that plugs in 30+ models. No Copilot subscription required. Same Chat UI, same Agent Mode, just more models to pick from.&lt;/p&gt;

&lt;h3&gt;
  
  
  Why bother adding more models to Copilot Chat?
&lt;/h3&gt;

&lt;p&gt;A few reasons I kept running into:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;The free tier is thin.&lt;/strong&gt; Two models, 2,000 completions. Fine for a weekend. Falls apart on a real project.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Pro+ locks the interesting models behind $39.&lt;/strong&gt; Claude Opus, GPT-5.5 premium, Gemini 3.5 Pro. You pay the full monthly even if you only need them occasionally.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Open-weight models got good.&lt;/strong&gt; DeepSeek V4, Kimi K2.6, GLM 5.2, Qwen3.7 can hold their own against frontier models for most coding tasks. They're cheap or free, but wiring each one into VS Code manually is a chore.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Different tasks want different models.&lt;/strong&gt; Quick refactor vs. deep debugging vs. large-context review. One model doesn't fit all.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The extension sits in between. Free models when the task is simple. Pay-per-use when you need Claude Opus for one tricky prompt. Flat subscription when you're grinding through a refactor.&lt;/p&gt;

&lt;h3&gt;
  
  
  A few things to know before you start
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;You need GitHub Copilot Chat installed.&lt;/strong&gt; The free version works. You don't need Pro, Pro+, or Max.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;You need a VS Code account or local install.&lt;/strong&gt; VS Code 1.100 or newer.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;API keys are stored in VS Code SecretStorage.&lt;/strong&gt; They don't leave your machine.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Free models rotate.&lt;/strong&gt; Big Pickle is always free. DeepSeek V4 Flash Free, MiMo V2.5 Free, and Nemotron rotate in and out. The paid models stay put.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The extension is MIT, independent.&lt;/strong&gt; Not affiliated with GitHub, OpenCode, or any model provider.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Which plan fits your use case?
&lt;/h3&gt;

&lt;p&gt;Three options, same extension. Pick based on how much you code.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Just testing, or light use.&lt;/strong&gt; OpenCode Zen free models. $0. DeepSeek V4 Flash works at $0 balance. No card needed. Rate limits are low without balance, but enough to try the extension.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Daily coding.&lt;/strong&gt; OpenCode Go subscription. $10 a month, $5 the first month. DeepSeek V4 Pro, Kimi K2.6, GLM 5.1, Qwen3.7 Max, MiMo V2.5 Pro, MiniMax M3. Generous limits across 5-hour, weekly, and monthly windows.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Need Claude, GPT, or Gemini occasionally.&lt;/strong&gt; Zen pay-per-use. Add $20 balance. Claude Opus 4.7 ($5/$25 per 1M tokens), GPT-5.5 ($5/$30), Gemini 3.5 Flash ($0.50/$3). You pay only for what you use. Adding balance also improves rate limits on the free models.&lt;/p&gt;

&lt;p&gt;My honest take: start with the free tier. If you hit the rate limit more than twice a week, upgrade to Go. Add Zen balance only when you specifically need Claude or GPT for a task.&lt;/p&gt;

&lt;h3&gt;
  
  
  How to set it up
&lt;/h3&gt;

&lt;p&gt;Takes about 60 seconds.&lt;/p&gt;

&lt;h4&gt;
  
  
  1. Install GitHub Copilot Chat
&lt;/h4&gt;

&lt;p&gt;If you don't have it already:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;code &lt;span class="nt"&gt;--install-extension&lt;/span&gt; GitHub.copilot-chat
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Sign in with any GitHub account. The free tier works.&lt;/p&gt;

&lt;h4&gt;
  
  
  2. Install OpenCode for Copilot Chat
&lt;/h4&gt;

&lt;p&gt;From the marketplace: &lt;a href="https://marketplace.visualstudio.com/items?itemName=ltmoerdani.opencode-copilot-chat" rel="noopener noreferrer"&gt;OpenCode for Copilot Chat&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Or via command line:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;code &lt;span class="nt"&gt;--install-extension&lt;/span&gt; ltmoerdani.opencode-copilot-chat
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  3. Get an OpenCode API key
&lt;/h4&gt;

&lt;p&gt;Go to &lt;a href="https://opencode.ai/auth" rel="noopener noreferrer"&gt;opencode.ai/auth&lt;/a&gt;, sign up, and copy your API key.&lt;/p&gt;

&lt;p&gt;You don't need to add a payment method to start. The free models work at $0 balance.&lt;/p&gt;

&lt;h4&gt;
  
  
  4. Add the provider to Copilot Chat
&lt;/h4&gt;

&lt;p&gt;Open Copilot Chat in VS Code (&lt;code&gt;Cmd+Shift+I&lt;/code&gt; or &lt;code&gt;Ctrl+Shift+I&lt;/code&gt;). Click the model name in the picker.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Model picker → "Add Models..." → OpenCode Zen → paste API key
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Press Enter to accept the default group name. The models appear in the picker.&lt;/p&gt;

&lt;h4&gt;
  
  
  5. Pick a model and start chatting
&lt;/h4&gt;

&lt;p&gt;Pick any OpenCode model from the dropdown. Start with &lt;code&gt;DeepSeek V4 Flash Free&lt;/code&gt; if you want to test without paying anything.&lt;/p&gt;

&lt;p&gt;That's it. Tool-calling, Agent Mode, file edits, terminal commands. All work natively because the extension forwards tool-call format correctly per endpoint.&lt;/p&gt;

&lt;h3&gt;
  
  
  What you actually get
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;&lt;/th&gt;
&lt;th&gt;Copilot Free&lt;/th&gt;
&lt;th&gt;Copilot Pro+ $39/mo&lt;/th&gt;
&lt;th&gt;OpenCode Extension&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Models&lt;/td&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Premium only&lt;/td&gt;
&lt;td&gt;30+&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Free model&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;DeepSeek V4 Flash&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Reasoning controls&lt;/td&gt;
&lt;td&gt;None&lt;/td&gt;
&lt;td&gt;GitHub decides&lt;/td&gt;
&lt;td&gt;You set per model&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Agent Mode&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Vision, PDF, Audio&lt;/td&gt;
&lt;td&gt;Limited&lt;/td&gt;
&lt;td&gt;Limited&lt;/td&gt;
&lt;td&gt;Per-model&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Provider&lt;/td&gt;
&lt;td&gt;GitHub&lt;/td&gt;
&lt;td&gt;GitHub&lt;/td&gt;
&lt;td&gt;OpenCode Zen or Go&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  The models
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Free (no payment needed):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Big Pickle (always free, 200K context)&lt;/li&gt;
&lt;li&gt;DeepSeek V4 Flash Free (200K context)&lt;/li&gt;
&lt;li&gt;MiMo V2.5 Free (rotating)&lt;/li&gt;
&lt;li&gt;Nemotron 3 Super Free (rotating)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Go subscription, $10/mo ($5 first month):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DeepSeek V4 Pro (1M context, 384K output, reasoning off to max)&lt;/li&gt;
&lt;li&gt;Kimi K2.6 (262K context, reasoning on/off)&lt;/li&gt;
&lt;li&gt;GLM 5.1 (202K context)&lt;/li&gt;
&lt;li&gt;Qwen3.7 Max (1M context, thinking budget 4K to 82K)&lt;/li&gt;
&lt;li&gt;MiMo V2.5 Pro (1M context)&lt;/li&gt;
&lt;li&gt;MiniMax M3 (512K context)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Zen pay-per-use (add balance):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Claude Opus 4.7 ($5/$25 per 1M tokens)&lt;/li&gt;
&lt;li&gt;Claude Sonnet 4.6 ($3/$15)&lt;/li&gt;
&lt;li&gt;GPT-5.5 ($5/$30)&lt;/li&gt;
&lt;li&gt;GPT-5.4 ($0.75 to $30 depending on variant)&lt;/li&gt;
&lt;li&gt;Gemini 3.5 Flash ($0.50/$3)&lt;/li&gt;
&lt;li&gt;Grok 4 (256K context)&lt;/li&gt;
&lt;li&gt;Mistral Large, Llama 4 Maverick, Sonar Pro, Command R+&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Thinking controls per model family
&lt;/h3&gt;

&lt;p&gt;Each family has its own reasoning knob. You set it from the model picker, no config file.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;DeepSeek:&lt;/strong&gt; off / low / medium / high / max&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Qwen:&lt;/strong&gt; &lt;code&gt;thinking_budget&lt;/code&gt; from 4,096 to 81,920 tokens&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MiMo:&lt;/strong&gt; low / medium / high&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;MiniMax:&lt;/strong&gt; on / off&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;GLM and Kimi:&lt;/strong&gt; on / off&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I usually keep DeepSeek on &lt;code&gt;high&lt;/code&gt; for debugging, &lt;code&gt;medium&lt;/code&gt; for refactors, &lt;code&gt;off&lt;/code&gt; for quick questions. Qwen's &lt;code&gt;thinking_budget&lt;/code&gt; is handy when you want reasoning but need to cap token cost on a long session.&lt;/p&gt;

&lt;h3&gt;
  
  
  How the routing actually works
&lt;/h3&gt;

&lt;p&gt;Different model families speak different protocols. The extension routes each one to its native endpoint.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;GPT models        → OpenAI /responses
Gemini            → Google :streamGenerateContent?alt=sse
Claude, MiniMax   → Anthropic /messages
DeepSeek, Qwen,
  Kimi, GLM, MiMo → /chat/completions
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Tool-call format gets translated per endpoint. OpenAI uses &lt;code&gt;tool_calls&lt;/code&gt;, Anthropic uses &lt;code&gt;tool_use&lt;/code&gt; content blocks. Agent Mode (read files, edit, run terminal) keeps working because the translation happens in the streaming layer.&lt;/p&gt;

&lt;p&gt;There's also a retry layer. If the upstream API rejects a parameter because &lt;code&gt;models.dev&lt;/code&gt; metadata is stale, the extension parses the error, patches the request body, and retries once. This handles thinking config mismatches and temperature rejections without requiring a code release.&lt;/p&gt;

&lt;h3&gt;
  
  
  Honest limitations
&lt;/h3&gt;

&lt;p&gt;Worth being upfront about these:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Session cost doesn't show in VS Code's native session popover.&lt;/strong&gt; VS Code 1.126 doesn't convert BYOK usage data parts into &lt;code&gt;IChatUsage&lt;/code&gt; progress events yet. The extension tracks cost in its own status bar. When VS Code fixes this, the data will flow through automatically.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Free models have low rate limits without balance.&lt;/strong&gt; Adding $20 to Zen improves this, but if you want truly free, expect to wait during peak hours.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Some models need specific configurations.&lt;/strong&gt; Kimi K2.7 Code rejects &lt;code&gt;temperature&lt;/code&gt; and forces thinking on. GLM only accepts &lt;code&gt;off&lt;/code&gt;, &lt;code&gt;high&lt;/code&gt;, or &lt;code&gt;max&lt;/code&gt;. The extension handles these per-model quirks, but if a new model drops with a new quirk, it might 400 until metadata catches up.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Vision support varies.&lt;/strong&gt; GLM, MiniMax, and a few others don't support image input. The extension filters this based on &lt;code&gt;models.dev&lt;/code&gt; metadata.&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Conclusion
&lt;/h3&gt;

&lt;p&gt;If you're on Copilot Free and keep hitting the limit, or you're on Pro+ and keep wishing for a model that isn't in the catalog, this is a way out. The free tier is enough to test. The Go subscription is the cheapest way to get daily access to the open-weight catalog. Zen pay-per-use covers the Claude/GPT/Gemini cases without a $39 commitment.&lt;/p&gt;

&lt;p&gt;Same Chat UI you already use. More models. Lower bill most months.&lt;/p&gt;

&lt;p&gt;Repo is &lt;a href="https://github.com/ltmoerdani/opencode-copilot-chat" rel="noopener noreferrer"&gt;MIT on GitHub&lt;/a&gt;. Contributions welcome. Four external contributors have already shipped features, bug fixes, and docs.&lt;/p&gt;

&lt;h3&gt;
  
  
  Links
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://marketplace.visualstudio.com/items?itemName=ltmoerdani.opencode-copilot-chat" rel="noopener noreferrer"&gt;Marketplace listing&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://github.com/ltmoerdani/opencode-copilot-chat" rel="noopener noreferrer"&gt;GitHub repo&lt;/a&gt; (MIT, contributions welcome)&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://opencode.ai/docs" rel="noopener noreferrer"&gt;OpenCode docs&lt;/a&gt; (API key setup, pricing details)&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://marketplace.visualstudio.com/items?itemName=ltmoerdani.cline-copilot-chat" rel="noopener noreferrer"&gt;Cline Copilot Chat&lt;/a&gt; (sister extension for ClinePass models)&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>vscode</category>
      <category>opensource</category>
      <category>tutorial</category>
    </item>
  </channel>
</rss>
