<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Manish</title>
    <description>The latest articles on DEV Community by Manish (@mkagenius).</description>
    <link>https://dev.to/mkagenius</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2726700%2F1a61be83-1804-409c-8e0a-89ad399b2074.jpg</url>
      <title>DEV Community: Manish</title>
      <link>https://dev.to/mkagenius</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/mkagenius"/>
    <language>en</language>
    <item>
      <title>Small win: Made CodeRunner and got appreciated by Google Gemini team!</title>
      <dc:creator>Manish</dc:creator>
      <pubDate>Wed, 02 Jul 2025 05:29:06 +0000</pubDate>
      <link>https://dev.to/mkagenius/small-win-made-coderunner-and-got-appreciated-by-google-gemini-team-1fij</link>
      <guid>https://dev.to/mkagenius/small-win-made-coderunner-and-got-appreciated-by-google-gemini-team-1fij</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F063zj9evikq8ucxz3jk2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F063zj9evikq8ucxz3jk2.png" alt="Image description" width="800" height="940"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A small story to motivate you guys to work on cutting edge frontier tech:&lt;/p&gt;

&lt;p&gt;You might have heard about Gemini CLI[1] release a few days back. I managed to integrate Apple Containers (also released recently) to execute the code generated by AI safely.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/google-gemini/gemini-cli" rel="noopener noreferrer"&gt;Gemini CLI&lt;/a&gt; supports code execution itself but via shell, which can be a security issue. &lt;a href="https://github.com/BandarLabs/coderunner" rel="noopener noreferrer"&gt;CodeRunner&lt;/a&gt; solves this by running it in a safe isolated container.&lt;/p&gt;

&lt;p&gt;Got appreciation from Google team:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuqadik03fb8vzexzpch2.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fuqadik03fb8vzexzpch2.png" alt="Image description" width="640" height="330"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;cperry is from the Google team who commented on my reply with "&amp;lt;3 amazing". I will consider it a win.&lt;/p&gt;

&lt;p&gt;Some asked me,&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;That is amazing. Could you tell me how you did it and how you even thought of it in the first place? I'm a student so please eli5&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Context:&lt;/strong&gt; Everyday something new is released, I read about it and in depth -- doesn't take a lot of time now as I have been doing this for a few years.&lt;/p&gt;

&lt;p&gt;When Apple Container was released I had already read about WASM[1], docker[2] etc. The native apple containers are kind of a big thing I felt, so I and my friend created CodeRunner[4] for apple containers 4 days back.&lt;/p&gt;

&lt;p&gt;I had also seen a trend of MCPs[3] -- initially I brushed it off but it seems there are a lot of tech built around it. So, converted my APIs (which eventually execute code) to MCP APIs which do the same thing but are now MCP compatible.&lt;/p&gt;

&lt;p&gt;Cut to Gemini CLI release, I went to check if they have MCP support and since I had read about MCPs before I knew it would be a quick change to make it work for Gemini and indeed they did have support and after that it was a matter of config change!&lt;/p&gt;

&lt;p&gt;Moral of the story is, be ready, opportunities will come and then you strike.&lt;/p&gt;

&lt;p&gt;And if somehow we magically reach 500 stars on github -- who knows it might become part of official Gemini CLI. Lot of small wins can lead to a big win!&lt;/p&gt;

&lt;p&gt;WASM: &lt;a href="https://en.wikipedia.org/wiki/WebAssembly" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/WebAssembly&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;docker: &lt;a href="https://en.wikipedia.org/wiki/Docker_(software)" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/Docker_(software)&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;MCP: &lt;a href="https://en.wikipedia.org/wiki/Model_Context_Protocol" rel="noopener noreferrer"&gt;https://en.wikipedia.org/wiki/Model_Context_Protocol&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;CodeRunner (mine): &lt;a href="https://github.com/BandarLabs/coderunner" rel="noopener noreferrer"&gt;https://github.com/BandarLabs/coderunner&lt;/a&gt;&lt;/p&gt;

</description>
      <category>gemini</category>
      <category>google</category>
      <category>containers</category>
      <category>startup</category>
    </item>
    <item>
      <title>Gemini is so good, I have let it control/use my phone</title>
      <dc:creator>Manish</dc:creator>
      <pubDate>Mon, 20 Jan 2025 14:35:56 +0000</pubDate>
      <link>https://dev.to/mkagenius/gemini-is-so-good-i-have-let-it-controluse-my-phone-4mf6</link>
      <guid>https://dev.to/mkagenius/gemini-is-so-good-i-have-let-it-controluse-my-phone-4mf6</guid>
      <description>&lt;h2&gt;
  
  
  Click3: LLMs have taken control of my phone
&lt;/h2&gt;

&lt;p&gt;Or &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Draft a mail to &lt;a href="mailto:someone@gmail.com"&gt;someone@gmail.com&lt;/a&gt; and ask for lunch next Saturday.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Or&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Find out what's my rating in Uber?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Or &lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Start a 3+2 game on lichess&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;All of these tasks require you to know the UI, like where to find the ratings - "is it in the profile section or is it even there in the app?" and do multiple clicks afterwards.&lt;/p&gt;

&lt;p&gt;But with this &lt;a href="https://github.com/BandarLabs/clickclickclick" rel="noopener noreferrer"&gt;framework&lt;/a&gt;, you can just type in plaintext and see the LLM do it for you.&lt;/p&gt;

&lt;p&gt;&lt;iframe width="710" height="399" src="https://www.youtube.com/embed/ZVUFaQ2iD8c"&gt;
&lt;/iframe&gt;
&lt;/p&gt;

&lt;h2&gt;
  
  
  How does it work?
&lt;/h2&gt;

&lt;p&gt;It has three separate components, as seen below and each have their own separation of concerns:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;Planner&lt;/code&gt;: Plans the next step (given current screenshot and previous actions)&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;Finder&lt;/code&gt;: Finds specific UI elements (whatever Planner asks it to find)&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;Executor&lt;/code&gt;: Clicks, scrolls, types etc&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo6yr7f4rutq05a6a7ea5.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo6yr7f4rutq05a6a7ea5.png" alt="Architecture diagram" width="800" height="506"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  OpenAI / Gemini / Local LLM
&lt;/h2&gt;

&lt;p&gt;I have added supports for all of the above, to be precise, the following is the recommended models for each component. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fx5zz68t7pdidnv00bn5g.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fx5zz68t7pdidnv00bn5g.png" alt="Model recommendation table" width="800" height="497"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;You can use your own keys and run it locally, the support for local LLM is what I am most excited about and the &lt;code&gt;Molmo&lt;/code&gt; MLX for MacOS is a great start, I feel.&lt;/p&gt;

&lt;h2&gt;
  
  
  Open-sourced
&lt;/h2&gt;

&lt;p&gt;This project is open sourced for everyone to use and contribute. You can also check out more demos in the &lt;code&gt;README&lt;/code&gt;: &lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/BandarLabs/clickclickclick" rel="noopener noreferrer"&gt;https://github.com/BandarLabs/clickclickclick&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What do you guys think?
&lt;/h2&gt;

&lt;p&gt;What are the use cases you can think of for this? For starters, I think it can be used to "create overlays of walkthrough over any app" or "automate testing of any functionality of an app" for developers.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>🎙️I built "GitPodcast" to help devs understand 💡any GitHub repo easily</title>
      <dc:creator>Manish</dc:creator>
      <pubDate>Fri, 17 Jan 2025 07:04:12 +0000</pubDate>
      <link>https://dev.to/mkagenius/i-built-gitpodcast-to-help-devs-understand-any-github-repo-easily-1pkn</link>
      <guid>https://dev.to/mkagenius/i-built-gitpodcast-to-help-devs-understand-any-github-repo-easily-1pkn</guid>
      <description>&lt;p&gt;Understanding any new codebase was a daunting task for me while listening to any podcast is something that comes very naturally to me. So, what if we combine the two ideas - that is understand any GitHub repository via a Podcast. &lt;/p&gt;

&lt;h2&gt;
  
  
  GitPodcast
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://gitpodcast.com" rel="noopener noreferrer"&gt;Gitpodcast&lt;/a&gt;: In this tool you can enter any GitHub url, and get the podcast within a minute. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkjzjuspaj4d40jcfaouc.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkjzjuspaj4d40jcfaouc.png" alt="Image showing gitpodcast screenshot" width="800" height="548"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Users Feedback
&lt;/h2&gt;

&lt;p&gt;Users wanted longer podcast, so I have added the ability to get upto 10 minutes of podcast (Do people want even longer podcast or is this enough?)&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyj1v223nkinr60ebstqi.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fyj1v223nkinr60ebstqi.png" alt="Image showing ability to choose podcast length" width="296" height="148"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Talking to users really helps you understand what exactly they want, and this was the main feature they wanted instead of the options to add multiple voices which I was thinking as the main missing feature.&lt;/p&gt;

&lt;p&gt;Got the top 3 product award from producthunt on the day of the launch. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdwy5ykppq746uhitl0hk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fdwy5ykppq746uhitl0hk.png" alt="A badge showing product of the day award" width="574" height="186"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  How to use
&lt;/h2&gt;

&lt;p&gt;Two ways to use it:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;Just replace &lt;code&gt;hub&lt;/code&gt; in the url with &lt;code&gt;podcast&lt;/code&gt; (when you are at any github repository)&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;or Paste the repo link in the website input box&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  What do you guys think?
&lt;/h2&gt;

&lt;p&gt;You can check it out here: &lt;a href="https://gitpodcast.com" rel="noopener noreferrer"&gt;https://gitpodcast.com&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Open-source
&lt;/h2&gt;

&lt;p&gt;The code is open source and you can host it yourself with your keys - &lt;a href="https://github.com/BandarLabs/gitpodcast" rel="noopener noreferrer"&gt;https://github.com/BandarLabs/gitpodcast&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;(In case you are logged in to github already, please do leave a star there!)&lt;/p&gt;

</description>
      <category>productivity</category>
      <category>opensource</category>
      <category>ai</category>
      <category>podcast</category>
    </item>
  </channel>
</rss>
