<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Aneesh Lade</title>
    <description>The latest articles on DEV Community by Aneesh Lade (@aneesh_lade_2605).</description>
    <link>https://dev.to/aneesh_lade_2605</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3942641%2F585bb14e-7330-4476-a135-08dd4d240afe.jpeg</url>
      <title>DEV Community: Aneesh Lade</title>
      <link>https://dev.to/aneesh_lade_2605</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/aneesh_lade_2605"/>
    <language>en</language>
    <item>
      <title>Week 2</title>
      <dc:creator>Aneesh Lade</dc:creator>
      <pubDate>Thu, 04 Jun 2026 21:03:13 +0000</pubDate>
      <link>https://dev.to/aneesh_lade_2605/week-2-hm9</link>
      <guid>https://dev.to/aneesh_lade_2605/week-2-hm9</guid>
      <description>&lt;p&gt;Hello everyone! It has been a busy week, but I've made some exciting progress on my machine learning journey. Here is what I've been up to:&lt;/p&gt;

&lt;h2&gt;
  
  
  Kaggle Orbit Wars &amp;amp; AWS
&lt;/h2&gt;

&lt;p&gt;I completed the baseline implementation for the Kaggle Orbit Wars competition and initially hit a score of around 1030. My score has dipped slightly over the past few days, so I am currently brainstorming ways to improve it. &lt;/p&gt;

&lt;p&gt;This week also marked my very first time using AWS! I used it to extract data for reinforcement learning. Transparency check: I spent exactly $7.58 USD on AWS resources during the process.&lt;/p&gt;

&lt;h2&gt;
  
  
  Paper Reading &amp;amp; RL Insights
&lt;/h2&gt;

&lt;p&gt;I spent a lot of time reading research papers this week. &lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;AlphaZero:&lt;/strong&gt; I was initially excited about using the self-play mechanism from AlphaZero. However, because this specific game has rock-paper-scissors dynamics, standard self-play might not work effectively.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AlphaStar:&lt;/strong&gt; This led me to the AlphaStar paper, which uses self-play combined with &lt;strong&gt;League Training&lt;/strong&gt;. &lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The engineering behind AlphaStar is incredible. Two specific concepts really stood out to me: &lt;strong&gt;Pointer Networks&lt;/strong&gt; and &lt;strong&gt;V-trace off-policy correction&lt;/strong&gt;. I was also impressed by their use of an &lt;strong&gt;LSTM core&lt;/strong&gt; to handle long-term memory. &lt;/p&gt;

&lt;h2&gt;
  
  
  Next Steps
&lt;/h2&gt;

&lt;p&gt;Moving forward, I plan to leverage Kaggle, AWS, and GCP credits to train different components of my model. I am giving myself total freedom to experiment, imagine, and test unconventional solutions.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Random life update to close out the week: I used to have long hair because I was insecure about my forehead, but I finally decided to shave it all off at home by myself. It honestly feels really weird right now, but it's a fresh start!&lt;/em&gt;&lt;/p&gt;

</description>
      <category>aws</category>
      <category>beginners</category>
      <category>devjournal</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Week 1</title>
      <dc:creator>Aneesh Lade</dc:creator>
      <pubDate>Wed, 27 May 2026 17:20:41 +0000</pubDate>
      <link>https://dev.to/aneesh_lade_2605/week-1-31lj</link>
      <guid>https://dev.to/aneesh_lade_2605/week-1-31lj</guid>
      <description>&lt;p&gt;Hi everyone! It’s been a week since my first post. &lt;/p&gt;

&lt;p&gt;In this past week, I’ve read through research papers on AlphaZero, AlphaGo and many interesting research papers. I have also started implementing my strategy for the Kaggle Orbit Wars competition. &lt;/p&gt;

&lt;p&gt;On top of that, I landed an internship at a startup! My work will focus on MCP (Model Context Protocol), learning and understanding how it works, and implementing it so it can interface with LLMs and a custom simulator they’ve built. &lt;/p&gt;

&lt;p&gt;Looking ahead to next week, I have two main goals:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Kaggle Orbit Wars:&lt;/strong&gt; Complete a baseline for the strategy I've decided on.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Internship:&lt;/strong&gt; Dive seriously into my new role. The team mentioned that this is a relatively new topic, so if we do good work, there’s a chance we might publish a paper. That’s one of the main reasons I'm so excited!&lt;/li&gt;
&lt;/ol&gt;

</description>
      <category>career</category>
      <category>devjournal</category>
      <category>machinelearning</category>
      <category>mcp</category>
    </item>
    <item>
      <title>Hello World</title>
      <dc:creator>Aneesh Lade</dc:creator>
      <pubDate>Wed, 20 May 2026 16:28:34 +0000</pubDate>
      <link>https://dev.to/aneesh_lade_2605/hello-world-ob4</link>
      <guid>https://dev.to/aneesh_lade_2605/hello-world-ob4</guid>
      <description>&lt;p&gt;Hey everyone, I'm Aneesh. &lt;/p&gt;

&lt;p&gt;Today, I’m launching this developer log as a personal accountability challenge. From this week onward, I am committing to publishing &lt;strong&gt;at least one technical post every single week&lt;/strong&gt; (and more than that if I run into breakthroughs or major roadblocks worth sharing). &lt;/p&gt;

&lt;p&gt;My goal is to document the raw, unfiltered engineering journey, the bugs, the design choices, the math derivations, and the late-night simulation wins.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I'm Building: Kaggle Orbit Wars
&lt;/h2&gt;

&lt;p&gt;Right now, my primary focus is the &lt;strong&gt;Kaggle Orbit Wars&lt;/strong&gt; simulation challenge. It's a brutal 2D real-time strategy environment where you have to conquer rotating planets, navigate gravitational paths around a central sun, and optimize fleet trajectories. With about a month left until the final submission deadline in June, I am aggressively iterating on my agent's state estimation and decision-making logic.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I'm Learning: UC Berkeley's CS 285
&lt;/h2&gt;

&lt;p&gt;To back up my practical work with deep theoretical foundations, I am currently working through &lt;strong&gt;UC Berkeley's CS 285 (Deep Reinforcement Learning)&lt;/strong&gt; course. Shifting from hard-coded heuristics to understanding advanced policy gradients, Q-learning value functions, and model-based RL is completely reshaping how I think about designing autonomous agents.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why I'm Doing This
&lt;/h2&gt;

&lt;p&gt;I'm skipping the corporate noise of traditional social media. This space is going to be my open-source lab notebook. If you are also grinding through Orbit Wars, studying RL, or building autonomous systems, follow along or drop a comment—let's build together.&lt;/p&gt;

&lt;p&gt;See you next week for the first deep-dive update!&lt;/p&gt;

</description>
      <category>devjournal</category>
      <category>learning</category>
      <category>productivity</category>
      <category>softwaredevelopment</category>
    </item>
  </channel>
</rss>
