<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Limen4ik</title>
    <description>The latest articles on DEV Community by Limen4ik (@limen4ik).</description>
    <link>https://dev.to/limen4ik</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3914164%2Fdcaf7bba-ccf4-4f72-baba-5596b520d74c.webp</url>
      <title>DEV Community: Limen4ik</title>
      <link>https://dev.to/limen4ik</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/limen4ik"/>
    <language>en</language>
    <item>
      <title>Hey community! We are 13yo and we just released our "home" version of DeepSeek. What do you think?</title>
      <dc:creator>Limen4ik</dc:creator>
      <pubDate>Tue, 05 May 2026 17:42:27 +0000</pubDate>
      <link>https://dev.to/limen4ik/hey-community-we-are-13yo-and-we-just-released-our-home-version-of-deepseek-what-do-you-think-2p0m</link>
      <guid>https://dev.to/limen4ik/hey-community-we-are-13yo-and-we-just-released-our-home-version-of-deepseek-what-do-you-think-2p0m</guid>
      <description>&lt;div class="ltag__link--embedded"&gt;
  &lt;div class="crayons-story "&gt;
  &lt;a href="https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3" class="crayons-story__hidden-navigation-link"&gt;How Two 13-Year-Olds Distilled DeepSeek-V4 Reasoning into Qwen3.5-2B&lt;/a&gt;


  &lt;div class="crayons-story__body crayons-story__body-full_post"&gt;
    &lt;div class="crayons-story__top"&gt;
      &lt;div class="crayons-story__meta"&gt;
        &lt;div class="crayons-story__author-pic"&gt;

          &lt;a href="/limen4ik" class="crayons-avatar  crayons-avatar--l  "&gt;
            &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3914164%2Fdcaf7bba-ccf4-4f72-baba-5596b520d74c.webp" alt="limen4ik profile" class="crayons-avatar__image" width="200" height="200"&gt;
          &lt;/a&gt;
        &lt;/div&gt;
        &lt;div&gt;
          &lt;div&gt;
            &lt;a href="/limen4ik" class="crayons-story__secondary fw-medium m:hidden"&gt;
              Limen4ik
            &lt;/a&gt;
            &lt;div class="profile-preview-card relative mb-4 s:mb-0 fw-medium hidden m:inline-block"&gt;
              
                Limen4ik
                
              
              &lt;div id="story-author-preview-content-3616474" class="profile-preview-card__content crayons-dropdown branded-7 p-4 pt-0"&gt;
                &lt;div class="gap-4 grid"&gt;
                  &lt;div class="-mt-4"&gt;
                    &lt;a href="/limen4ik" class="flex"&gt;
                      &lt;span class="crayons-avatar crayons-avatar--xl mr-2 shrink-0"&gt;
                        &lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3914164%2Fdcaf7bba-ccf4-4f72-baba-5596b520d74c.webp" class="crayons-avatar__image" alt="" width="200" height="200"&gt;
                      &lt;/span&gt;
                      &lt;span class="crayons-link crayons-subtitle-2 mt-5"&gt;Limen4ik&lt;/span&gt;
                    &lt;/a&gt;
                  &lt;/div&gt;
                  &lt;div class="print-hidden"&gt;
                    
                      Follow
                    
                  &lt;/div&gt;
                  &lt;div class="author-preview-metadata-container"&gt;&lt;/div&gt;
                &lt;/div&gt;
              &lt;/div&gt;
            &lt;/div&gt;

          &lt;/div&gt;
          &lt;a href="https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3" class="crayons-story__tertiary fs-xs"&gt;&lt;time&gt;May 5&lt;/time&gt;&lt;span class="time-ago-indicator-initial-placeholder"&gt;&lt;/span&gt;&lt;/a&gt;
        &lt;/div&gt;
      &lt;/div&gt;

    &lt;/div&gt;

    &lt;div class="crayons-story__indention"&gt;
      &lt;h2 class="crayons-story__title crayons-story__title-full_post"&gt;
        &lt;a href="https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3" id="article-link-3616474"&gt;
          How Two 13-Year-Olds Distilled DeepSeek-V4 Reasoning into Qwen3.5-2B
        &lt;/a&gt;
      &lt;/h2&gt;
        &lt;div class="crayons-story__tags"&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/ai"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;ai&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/opensource"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;opensource&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/machinelearning"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;machinelearning&lt;/a&gt;
            &lt;a class="crayons-tag  crayons-tag--monochrome " href="/t/python"&gt;&lt;span class="crayons-tag__prefix"&gt;#&lt;/span&gt;python&lt;/a&gt;
        &lt;/div&gt;
      &lt;div class="crayons-story__bottom"&gt;
        &lt;div class="crayons-story__details"&gt;
          &lt;a href="https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left"&gt;
            &lt;div class="multiple_reactions_aggregate"&gt;
              &lt;span class="multiple_reactions_icons_container"&gt;
                  &lt;span class="crayons_icon_container"&gt;
                    &lt;img src="https://assets.dev.to/assets/sparkle-heart-5f9bee3767e18deb1bb725290cb151c25234768a0e9a2bd39370c382d02920cf.svg" width="24" height="24"&gt;
                  &lt;/span&gt;
              &lt;/span&gt;
              &lt;span class="aggregate_reactions_counter"&gt;2&lt;span class="hidden s:inline"&gt; reactions&lt;/span&gt;&lt;/span&gt;
            &lt;/div&gt;
          &lt;/a&gt;
            &lt;a href="https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3#comments" class="crayons-btn crayons-btn--s crayons-btn--ghost crayons-btn--icon-left flex items-center"&gt;
              Comments


              2&lt;span class="hidden s:inline"&gt; comments&lt;/span&gt;
            &lt;/a&gt;
        &lt;/div&gt;
        &lt;div class="crayons-story__save"&gt;
          &lt;small class="crayons-story__tertiary fs-xs mr-2"&gt;
            2 min read
          &lt;/small&gt;
            
              &lt;span class="bm-initial"&gt;
                

              &lt;/span&gt;
              &lt;span class="bm-success"&gt;
                

              &lt;/span&gt;
            
        &lt;/div&gt;
      &lt;/div&gt;
    &lt;/div&gt;
  &lt;/div&gt;
&lt;/div&gt;

&lt;/div&gt;


</description>
      <category>ai</category>
      <category>llm</category>
      <category>machinelearning</category>
      <category>showdev</category>
    </item>
    <item>
      <title>How Two 13-Year-Olds Distilled DeepSeek-V4 Reasoning into Qwen3.5-2B</title>
      <dc:creator>Limen4ik</dc:creator>
      <pubDate>Tue, 05 May 2026 16:54:14 +0000</pubDate>
      <link>https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3</link>
      <guid>https://dev.to/limen4ik/how-two-13-year-olds-distilled-deepseek-v4-reasoning-into-qwen35-2b-6h3</guid>
      <description>&lt;p&gt;Hello everyone!&lt;br&gt;
&lt;strong&gt;We are two 13-year-old students from Russia&lt;/strong&gt;, and we want to show&lt;br&gt;
you our model: &lt;strong&gt;QwenSeek-2B!&lt;/strong&gt;&lt;br&gt;
Based on: Qwen3.5-2B, which we fine-tuned on ~8K&lt;br&gt;
reasoning examples from DeepSeek-V4-Flash using the "Unsloth" Framework.&lt;/p&gt;

&lt;p&gt;For us, this is a big result! We released the model on Hugging Face, and the&lt;br&gt;
GGUF version has already gained over &lt;strong&gt;1000+ downloads!&lt;/strong&gt; We are just incredibly&lt;br&gt;
happy about this!!!&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Training Details:&lt;/strong&gt;&lt;br&gt;
Total epochs: 1 (250 steps) &lt;br&gt;
Training time: ~10 hours (at a speed of 0.01 it/s...) &lt;br&gt;
Total examples: ~8K (input and output)&lt;br&gt;
Dataset: &lt;a href="https://huggingface.co/datasets/Jackrong/DeepSeek-V4-Distill-8000x" rel="noopener noreferrer"&gt;https://huggingface.co/datasets/Jackrong/DeepSeek-V4-Distill-8000x&lt;/a&gt;&lt;br&gt;
Hardware: T4 x 1 (Kaggle) &lt;br&gt;
Context window size: 4096&lt;/p&gt;

&lt;p&gt;We will continue to release new models, even cooler! And maybe in the near&lt;br&gt;
future, we will train a model from &lt;strong&gt;scratch&lt;/strong&gt;! But for now, we are looking for good&lt;br&gt;
GPUs, maybe we will even apply somewhere for GPU Grants!&lt;/p&gt;

&lt;p&gt;By the way, the training took 10 hours, right? But we faced huge obstacles&lt;br&gt;
before we could finally achieve the result! For a couple of days, we suffered&lt;br&gt;
from...: &lt;br&gt;
&lt;strong&gt;There were times when the loss spiked to 3+...&lt;/strong&gt;&lt;br&gt;
&lt;strong&gt;There were times with&lt;br&gt;
OOM in the middle of training...&lt;/strong&gt; &lt;br&gt;
And other minor bugs... But we handled it!&lt;br&gt;
&lt;em&gt;Also: We trained in FP32 because Qwen3.5 refused to work with FP16, and BF16&lt;br&gt;
didn't work on T4... 🤣&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;We just decided to make a "Home" DeepSeek! A Mini DeepSeek! &lt;br&gt;
&lt;em&gt;That’s how it&lt;br&gt;
is, 13-year-old students are slowly teaching AI! :)&lt;/em&gt;&lt;br&gt;
And now... If you want, you&lt;br&gt;
can try our models, or just take a look:&lt;br&gt;
&lt;a href="https://huggingface.co/faunix/QwenSeek-2B" rel="noopener noreferrer"&gt;https://huggingface.co/faunix/QwenSeek-2B&lt;/a&gt;&lt;br&gt;
And also the GGUF version:&lt;br&gt;
&lt;a href="https://huggingface.co/faunix/QwenSeek-2B-GGUF" rel="noopener noreferrer"&gt;https://huggingface.co/faunix/QwenSeek-2B-GGUF&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Try running it, see how it thinks and maybe send us Feedback!&lt;/strong&gt; &lt;br&gt;
We will be very grateful! Ask your questions, ask... &lt;br&gt;
&lt;em&gt;And let's go build open AI&lt;br&gt;
together!&lt;/em&gt; :)&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>machinelearning</category>
      <category>python</category>
    </item>
  </channel>
</rss>
