<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Goutam Sharma</title>
    <description>The latest articles on DEV Community by Goutam Sharma (@goutam_sharma_8032653abc9).</description>
    <link>https://dev.to/goutam_sharma_8032653abc9</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3955165%2F47d7c6ba-caae-446a-9d67-62d241fb386a.jpg</url>
      <title>DEV Community: Goutam Sharma</title>
      <link>https://dev.to/goutam_sharma_8032653abc9</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/goutam_sharma_8032653abc9"/>
    <language>en</language>
    <item>
      <title># Starting My Journey into AI Research 🚀</title>
      <dc:creator>Goutam Sharma</dc:creator>
      <pubDate>Wed, 27 May 2026 20:54:49 +0000</pubDate>
      <link>https://dev.to/goutam_sharma_8032653abc9/-starting-my-journey-into-ai-research-2bde</link>
      <guid>https://dev.to/goutam_sharma_8032653abc9/-starting-my-journey-into-ai-research-2bde</guid>
      <description>&lt;p&gt;Hello everyone!&lt;/p&gt;

&lt;p&gt;I am a 3rd year Computer Science student, and recently I started exploring the world of AI research, especially in:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Vision Language Models (VLMs)&lt;/li&gt;
&lt;li&gt;Spatial AI&lt;/li&gt;
&lt;li&gt;Semantic Segmentation&lt;/li&gt;
&lt;li&gt;Multimodal Learning&lt;/li&gt;
&lt;li&gt;Scene Understanding&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;At first, research looked very complicated to me because of research papers, mathematical concepts, and large AI architectures. But after reading papers and experimenting with datasets and models, I realized research is mainly about curiosity and solving problems.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Am Currently Exploring
&lt;/h2&gt;

&lt;p&gt;Recently, I have been learning about:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;How VLMs understand images and text together&lt;/li&gt;
&lt;li&gt;Spatial reasoning in AI systems&lt;/li&gt;
&lt;li&gt;Scene graph understanding&lt;/li&gt;
&lt;li&gt;Data curation and annotation pipelines&lt;/li&gt;
&lt;li&gt;Reducing hallucinations in multimodal models&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  My Goal
&lt;/h2&gt;

&lt;p&gt;I want to work on advanced AI systems that can understand the real world more accurately, especially for applications like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Robotics&lt;/li&gt;
&lt;li&gt;Autonomous systems&lt;/li&gt;
&lt;li&gt;Smart surveillance&lt;/li&gt;
&lt;li&gt;Human-AI interaction&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  What I Learned So Far
&lt;/h2&gt;

&lt;p&gt;One important thing I learned is:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Research is not about knowing everything.&lt;br&gt;
It is about continuously learning and improving ideas.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Technologies I Am Using
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Python&lt;/li&gt;
&lt;li&gt;PyTorch&lt;/li&gt;
&lt;li&gt;Hugging Face&lt;/li&gt;
&lt;li&gt;OpenCV&lt;/li&gt;
&lt;li&gt;Transformers&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;This is just the beginning of my research journey, and I am excited to keep learning, building, and sharing my progress with the community.&lt;/p&gt;

&lt;p&gt;If you are also starting in AI research, feel free to connect with me!&lt;/p&gt;

&lt;h1&gt;
  
  
  ai #machinelearning #research #computervision
&lt;/h1&gt;

</description>
      <category>ai</category>
      <category>computervision</category>
      <category>nlp</category>
      <category>multimodalai</category>
    </item>
  </channel>
</rss>
