<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: deeplearning</title>
    <description>The latest articles tagged 'deeplearning' on DEV Community.</description>
    <link>https://dev.to/t/deeplearning</link>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/tag/deeplearning"/>
    <language>en</language>
    <item>
      <title>SAM.MD: Zero-shot medical image segmentation capabilities of the SegmentAnything Model</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 11:50:28 +0000</pubDate>
      <link>https://dev.to/paperium/sammd-zero-shot-medical-image-segmentation-capabilities-of-the-segmentanything-model-gmh</link>
      <guid>https://dev.to/paperium/sammd-zero-shot-medical-image-segmentation-capabilities-of-the-segmentanything-model-gmh</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Monocular Depth Estimation using Diffusion Models</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 10:40:28 +0000</pubDate>
      <link>https://dev.to/paperium/monocular-depth-estimation-using-diffusion-models-3md7</link>
      <guid>https://dev.to/paperium/monocular-depth-estimation-using-diffusion-models-3md7</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Dropout: Switch Off Neurons to Stop Overfitting</title>
      <dc:creator>Devanshu Biswas</dc:creator>
      <pubDate>Tue, 30 Jun 2026 10:03:20 +0000</pubDate>
      <link>https://dev.to/dev48v/dropout-switch-off-neurons-to-stop-overfitting-3f9a</link>
      <guid>https://dev.to/dev48v/dropout-switch-off-neurons-to-stop-overfitting-3f9a</guid>
      <description>&lt;p&gt;Dropout is almost absurdly simple — randomly switch off neurons during training — yet it was one of the biggest anti-overfitting wins in deep learning. Here's why it works, visualized.&lt;/p&gt;

&lt;p&gt;🎲 &lt;strong&gt;Watch neurons drop (toggle the rate):&lt;/strong&gt; &lt;a href="https://dev48v.infy.uk/dl/day20-dropout.html" rel="noopener noreferrer"&gt;https://dev48v.infy.uk/dl/day20-dropout.html&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What it does
&lt;/h2&gt;

&lt;p&gt;On each training step, each hidden neuron is kept with probability (1−p) and &lt;strong&gt;zeroed out&lt;/strong&gt; with probability p (say p=0.5). A different random subset drops every step. The demo grays out a fresh random set of neurons each pass and cuts their edges.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why that helps
&lt;/h2&gt;

&lt;p&gt;Neurons can't rely on any specific other neuron being present, so they can't &lt;strong&gt;co-adapt&lt;/strong&gt; into a fragile memorized solution — each must learn a feature that's useful on its own. It's like training a huge &lt;strong&gt;ensemble of subnetworks&lt;/strong&gt; that share weights. Result: a smaller train/val gap (less overfitting) — which the two accuracy curves in the demo show.&lt;/p&gt;

&lt;h2&gt;
  
  
  Train vs inference (the gotcha)
&lt;/h2&gt;

&lt;p&gt;You drop during &lt;strong&gt;training only&lt;/strong&gt;. At inference, all neurons are on. To keep the expected activations consistent, &lt;strong&gt;inverted dropout&lt;/strong&gt; scales the kept activations by 1/(1−p) during training, so inference needs no change.&lt;/p&gt;

&lt;h2&gt;
  
  
  Modern note
&lt;/h2&gt;

&lt;p&gt;With batch norm (Day 19) and huge datasets, dropout is needed less in conv nets — but it's still standard in Transformers (attention + feed-forward). It's regularization, alongside L2 (Day 17).&lt;/p&gt;

&lt;p&gt;🔨 Built from scratch (mask = rand &amp;gt; p → scale by 1/(1−p) → off at eval) on the page: &lt;a href="https://dev48v.infy.uk/dl/day20-dropout.html" rel="noopener noreferrer"&gt;https://dev48v.infy.uk/dl/day20-dropout.html&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Part of DeepLearningFromZero. 🌐 &lt;a href="https://dev48v.infy.uk" rel="noopener noreferrer"&gt;https://dev48v.infy.uk&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>deeplearning</category>
      <category>ai</category>
      <category>beginners</category>
    </item>
    <item>
      <title>Project: Cancer Classification Model</title>
      <dc:creator>Ваграм Катранян</dc:creator>
      <pubDate>Tue, 30 Jun 2026 09:56:42 +0000</pubDate>
      <link>https://dev.to/vagram123/project-cancer-classification-model-20i7</link>
      <guid>https://dev.to/vagram123/project-cancer-classification-model-20i7</guid>
      <description>&lt;p&gt;A year ago, I developed a study prototype of a neural network that combines two types of data:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;medical images in DICOM format;&lt;/li&gt;
&lt;li&gt;clinical tabular data (patient age, tumor size, biopsy results).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal of the model is to analyze both images and numerical data simultaneously to classify cancer presence.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key Features&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Multimodality&lt;/strong&gt;: the model processes both images and tabular features.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Attention mechanism&lt;/strong&gt;: highlights the most important features to improve accuracy.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;GPU/CPU support&lt;/strong&gt;: training can be performed on a regular computer or on a GPU.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Evaluation metrics&lt;/strong&gt;: AUC, F1, Precision, Recall — to measure performance objectively.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Engineering design&lt;/strong&gt;: separate classes for dataset, model, training, and logging.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In summary: this project gave me hands‑on experience with medical data and showed how Python can be applied not only in backend development but also in machine learning tasks.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>machinelearning</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Проект: Модель для классификации рака</title>
      <dc:creator>Ваграм Катранян</dc:creator>
      <pubDate>Tue, 30 Jun 2026 09:53:12 +0000</pubDate>
      <link>https://dev.to/vagram123/proiekt-modiel-dlia-klassifikatsii-raka-34g0</link>
      <guid>https://dev.to/vagram123/proiekt-modiel-dlia-klassifikatsii-raka-34g0</guid>
      <description>&lt;p&gt;Год назад я разработал учебный прототип нейросети, которая объединяет два типа данных:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;медицинские снимки в формате DICOM;&lt;/li&gt;
&lt;li&gt;клинические табличные данные (возраст пациента, размер опухоли, результаты биопсии).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Задача модели — анализировать изображения и цифры одновременно, чтобы классифицировать наличие рака.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Особенности реализации&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Мультимодальность&lt;/strong&gt;: модель работает сразу с изображениями и табличными признаками.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Механизм внимания&lt;/strong&gt; (attention): помогает выделять наиболее важные признаки и повышает точность.&lt;/li&gt;
&lt;li&gt;** Поддержка GPU/CPU**: обучение возможно как на обычном компьютере, так и на графическом процессоре.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Метрики качества&lt;/strong&gt;: AUC, F1, Precision, Recall — для объективной оценки работы модели.&lt;/li&gt;
&lt;li&gt;** Инженерная структура**: отдельные классы для датасета, модели, обучения и логирования.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Вывод&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;проект дал мне опыт работы с медицинскими данными и показал, как Python можно применять в задачах машинного обучения.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>machinelearning</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Data Science vs AI: Which Field Has Better Career Growth in 2026?</title>
      <dc:creator>Subhalaxmi Paikaray</dc:creator>
      <pubDate>Tue, 30 Jun 2026 09:47:31 +0000</pubDate>
      <link>https://dev.to/subha_74/data-science-vs-ai-which-field-has-better-career-growth-in-2026-19e2</link>
      <guid>https://dev.to/subha_74/data-science-vs-ai-which-field-has-better-career-growth-in-2026-19e2</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fy3sz8rsbmnoo6k6g4prz.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fy3sz8rsbmnoo6k6g4prz.png" alt=" " width="800" height="533"&gt;&lt;/a&gt;&lt;br&gt;
Artificial Intelligence (AI) and Data Science are two of the fastest-growing technology domains today. From startups to Fortune 500 companies, organizations are investing heavily in intelligent systems, predictive analytics, and automation. As a result, students often ask one important question:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Should I choose Data Science or Artificial Intelligence?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The truth is, there isn't a universal answer. Both fields offer exciting career opportunities, competitive salaries, and long-term growth. However, they focus on different skills, solve different business problems, and lead to different career paths.&lt;/p&gt;

&lt;p&gt;If you're planning a career in technology, this guide will help you understand the differences and decide which field aligns with your goals.&lt;/p&gt;

&lt;h2&gt;
  
  
  Understanding Data Science
&lt;/h2&gt;

&lt;p&gt;Data Science focuses on extracting meaningful insights from data. Every day, businesses collect enormous amounts of information—from customer behavior and sales performance to website traffic and financial transactions.&lt;/p&gt;

&lt;p&gt;A Data Scientist analyzes this data to answer important business questions and support decision-making.&lt;/p&gt;

&lt;p&gt;Typical responsibilities include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Collecting and cleaning data&lt;/li&gt;
&lt;li&gt;Building dashboards&lt;/li&gt;
&lt;li&gt;Performing statistical analysis&lt;/li&gt;
&lt;li&gt;Identifying business trends&lt;/li&gt;
&lt;li&gt;Creating predictive models&lt;/li&gt;
&lt;li&gt;Visualizing insights&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Popular tools include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python&lt;/li&gt;
&lt;li&gt;SQL&lt;/li&gt;
&lt;li&gt;R&lt;/li&gt;
&lt;li&gt;Power BI&lt;/li&gt;
&lt;li&gt;Tableau&lt;/li&gt;
&lt;li&gt;Excel&lt;/li&gt;
&lt;li&gt;Apache Spark&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you enjoy mathematics, statistics, and problem-solving, Data Science can be an excellent career choice.&lt;/p&gt;

&lt;h2&gt;
  
  
  Understanding Artificial Intelligence
&lt;/h2&gt;

&lt;p&gt;Artificial Intelligence focuses on building systems that can perform tasks requiring human intelligence.&lt;/p&gt;

&lt;p&gt;Rather than simply analyzing data, AI enables machines to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Learn from experience&lt;/li&gt;
&lt;li&gt;Understand language&lt;/li&gt;
&lt;li&gt;Recognize images&lt;/li&gt;
&lt;li&gt;Generate content&lt;/li&gt;
&lt;li&gt;Make predictions&lt;/li&gt;
&lt;li&gt;Automate complex workflows&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI professionals often work with technologies such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Machine Learning&lt;/li&gt;
&lt;li&gt;Deep Learning&lt;/li&gt;
&lt;li&gt;Natural Language Processing (NLP)&lt;/li&gt;
&lt;li&gt;Computer Vision&lt;/li&gt;
&lt;li&gt;Large Language Models (LLMs)&lt;/li&gt;
&lt;li&gt;Generative AI&lt;/li&gt;
&lt;li&gt;AI Agents&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The rapid adoption of &lt;strong&gt;Agentic AI&lt;/strong&gt;, &lt;strong&gt;AI Copilots&lt;/strong&gt;, &lt;strong&gt;Multimodal AI&lt;/strong&gt;, and &lt;strong&gt;Generative AI&lt;/strong&gt; is creating exciting opportunities for developers and engineers worldwide.&lt;/p&gt;

&lt;h2&gt;
  
  
  Key Differences Between Data Science and AI
&lt;/h2&gt;

&lt;p&gt;Although these fields overlap, their primary objectives differ.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://rcm.ac.in/data-science-specialization/" rel="noopener noreferrer"&gt;Data Science&lt;/a&gt;&lt;/strong&gt; focuses on understanding and interpreting data to drive business decisions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;&lt;a href="https://rcm.ac.in/ai-blockchain-metaverse-nft-crypto/" rel="noopener noreferrer"&gt;Artificial Intelligence&lt;/a&gt;&lt;/strong&gt; focuses on building intelligent systems that learn, automate, and interact with users.&lt;/p&gt;

&lt;p&gt;Think of it this way:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data Science answers: &lt;strong&gt;"What happened and why?"&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;AI answers: &lt;strong&gt;"How can machines solve this problem automatically?"&lt;/strong&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Understanding this distinction makes it easier to choose the right learning path.&lt;/p&gt;

&lt;h2&gt;
  
  
  Which Skills Should You Learn?
&lt;/h2&gt;

&lt;h3&gt;
  
  
  For Data Science
&lt;/h3&gt;

&lt;p&gt;Employers often look for professionals who understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Statistics&lt;/li&gt;
&lt;li&gt;Probability&lt;/li&gt;
&lt;li&gt;Python&lt;/li&gt;
&lt;li&gt;SQL&lt;/li&gt;
&lt;li&gt;Data Visualization&lt;/li&gt;
&lt;li&gt;Business Analytics&lt;/li&gt;
&lt;li&gt;Machine Learning basics&lt;/li&gt;
&lt;li&gt;Dashboard Development&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Strong analytical thinking is one of the biggest advantages in this field.&lt;/p&gt;

&lt;h3&gt;
  
  
  For Artificial Intelligence
&lt;/h3&gt;

&lt;p&gt;AI professionals typically require knowledge of:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python Programming&lt;/li&gt;
&lt;li&gt;Machine Learning&lt;/li&gt;
&lt;li&gt;Deep Learning&lt;/li&gt;
&lt;li&gt;Neural Networks&lt;/li&gt;
&lt;li&gt;Prompt Engineering&lt;/li&gt;
&lt;li&gt;TensorFlow&lt;/li&gt;
&lt;li&gt;PyTorch&lt;/li&gt;
&lt;li&gt;AI Model Deployment&lt;/li&gt;
&lt;li&gt;Large Language Models (LLMs)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;As AI technologies continue to evolve, continuous learning becomes an essential part of every AI career.&lt;/p&gt;

&lt;h2&gt;
  
  
  Career Opportunities
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Data Science Roles
&lt;/h3&gt;

&lt;p&gt;Popular job profiles include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Data Scientist&lt;/li&gt;
&lt;li&gt;Data Analyst&lt;/li&gt;
&lt;li&gt;Business Intelligence Analyst&lt;/li&gt;
&lt;li&gt;Analytics Consultant&lt;/li&gt;
&lt;li&gt;Data Engineer&lt;/li&gt;
&lt;li&gt;Product Analyst&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These professionals work across industries such as finance, healthcare, e-commerce, education, logistics, and retail.&lt;/p&gt;

&lt;h3&gt;
  
  
  AI Career Roles
&lt;/h3&gt;

&lt;p&gt;Artificial Intelligence opens opportunities such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;AI Engineer&lt;/li&gt;
&lt;li&gt;Machine Learning Engineer&lt;/li&gt;
&lt;li&gt;NLP Engineer&lt;/li&gt;
&lt;li&gt;Computer Vision Engineer&lt;/li&gt;
&lt;li&gt;Generative AI Developer&lt;/li&gt;
&lt;li&gt;AI Solutions Architect&lt;/li&gt;
&lt;li&gt;Robotics Engineer&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The increasing demand for AI-powered products means these roles are expected to remain highly valuable over the coming years.&lt;/p&gt;

&lt;h2&gt;
  
  
  Which Field Has Better Career Growth?
&lt;/h2&gt;

&lt;p&gt;Both careers offer strong growth, but the answer depends on your interests.&lt;/p&gt;

&lt;p&gt;Choose &lt;strong&gt;Data Science&lt;/strong&gt; if you enjoy:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Working with data&lt;/li&gt;
&lt;li&gt;Finding business insights&lt;/li&gt;
&lt;li&gt;Building dashboards&lt;/li&gt;
&lt;li&gt;Solving analytical problems&lt;/li&gt;
&lt;li&gt;Supporting business strategy&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Choose &lt;strong&gt;Artificial Intelligence&lt;/strong&gt; if you enjoy:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Programming&lt;/li&gt;
&lt;li&gt;Building intelligent applications&lt;/li&gt;
&lt;li&gt;Automation&lt;/li&gt;
&lt;li&gt;Robotics&lt;/li&gt;
&lt;li&gt;Generative AI&lt;/li&gt;
&lt;li&gt;Developing AI-powered software&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In reality, many companies now expect professionals to understand both disciplines. A Data Scientist often applies Machine Learning models, while an AI Engineer frequently works with large datasets.&lt;/p&gt;

&lt;p&gt;The boundaries between these fields are becoming increasingly interconnected.&lt;/p&gt;

&lt;h2&gt;
  
  
  Industry Trends to Watch in 2026
&lt;/h2&gt;

&lt;p&gt;Technology is evolving rapidly, and several trends are shaping the future of both AI and Data Science.&lt;/p&gt;

&lt;p&gt;Some of the biggest trends include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Generative AI&lt;/li&gt;
&lt;li&gt;AI Agents&lt;/li&gt;
&lt;li&gt;AI-Assisted Development&lt;/li&gt;
&lt;li&gt;Responsible AI&lt;/li&gt;
&lt;li&gt;Explainable AI (XAI)&lt;/li&gt;
&lt;li&gt;MLOps&lt;/li&gt;
&lt;li&gt;Predictive Analytics&lt;/li&gt;
&lt;li&gt;Edge AI&lt;/li&gt;
&lt;li&gt;Real-Time Data Processing&lt;/li&gt;
&lt;li&gt;AI Automation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Students who stay updated with these technologies will have a competitive advantage in the job market.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Practical Learning Matters
&lt;/h2&gt;

&lt;p&gt;Learning theories alone isn't enough to build a successful technology career.&lt;/p&gt;

&lt;p&gt;Employers increasingly prefer candidates who can demonstrate practical experience through:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Capstone projects&lt;/li&gt;
&lt;li&gt;Hackathons&lt;/li&gt;
&lt;li&gt;GitHub portfolios&lt;/li&gt;
&lt;li&gt;Open-source contributions&lt;/li&gt;
&lt;li&gt;AI model development&lt;/li&gt;
&lt;li&gt;Data analytics dashboards&lt;/li&gt;
&lt;li&gt;Industry internships&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Hands-on learning helps students understand how technologies are applied in real business environments.&lt;/p&gt;

&lt;p&gt;Recognizing this industry shift, institutions such as the &lt;strong&gt;&lt;a href="https://rcm.ac.in/" rel="noopener noreferrer"&gt;Regional College of Management (RCM)&lt;/a&gt;&lt;/strong&gt; are integrating project-based learning, internships, AI-focused coursework, Data Science, Full Stack Development, and industry collaborations into their technology programs. This practical approach helps students develop both technical expertise and workplace-ready skills.&lt;/p&gt;

&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;Choosing between Data Science and Artificial Intelligence isn't about selecting the "better" field—it's about choosing the one that matches your interests and long-term career goals.&lt;/p&gt;

&lt;p&gt;Data Science empowers organizations to make smarter decisions through data, while Artificial Intelligence focuses on building systems that can learn, automate, and solve complex problems.&lt;/p&gt;

&lt;p&gt;As businesses continue adopting AI-powered technologies, professionals who combine programming, analytics, machine learning, cloud computing, and problem-solving skills will remain in high demand.&lt;/p&gt;

&lt;p&gt;No matter which path you choose, keep learning, build real-world projects, contribute to open-source communities, and stay curious. In today's tech landscape, adaptability is one of the most valuable skills you can have.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What would you choose—Data Science or Artificial Intelligence?&lt;/strong&gt; Share your thoughts and career goals in the comments below!&lt;/p&gt;

</description>
      <category>deeplearning</category>
      <category>machinelearning</category>
      <category>datascience</category>
      <category>ai</category>
    </item>
    <item>
      <title>Meituan Open-Sources 1.6T-Parameter LongCat-2.0 Trained on Domestic Chips</title>
      <dc:creator>gentic news</dc:creator>
      <pubDate>Tue, 30 Jun 2026 09:38:15 +0000</pubDate>
      <link>https://dev.to/gentic_news/meituan-open-sources-16t-parameter-longcat-20-trained-on-domestic-chips-a4i</link>
      <guid>https://dev.to/gentic_news/meituan-open-sources-16t-parameter-longcat-20-trained-on-domestic-chips-a4i</guid>
      <description>&lt;blockquote&gt;
&lt;p&gt;Meituan open-sourced 1.6T-parameter LongCat-2.0 trained on 50,000 domestic ASICs, claiming China's first full-process domestic-chip trillion-parameter model.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Meituan open-sourced LongCat-2.0, a 1.6 trillion-parameter LLM trained entirely on domestic chips. The model claims to be China's first trillion-parameter AI fully pre-trained and inferred on a 50,000-card ASIC cluster.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Key facts&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;1.6 trillion parameters in LongCat-2.0.&lt;/li&gt;
&lt;li&gt;1 million-token context window.&lt;/li&gt;
&lt;li&gt;50,000-card domestic ASIC cluster used for training.&lt;/li&gt;
&lt;li&gt;DeepSeek V4-pro also has 1.6 trillion parameters.&lt;/li&gt;
&lt;li&gt;Open-sourced on Tuesday by Meituan.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Food delivery giant Meituan on Tuesday open-sourced LongCat-2.0, a large language model boasting 1.6 trillion parameters and a 1 million-token context window &lt;a href="https://www.scmp.com/tech/tech-trends/article/3358854/china-debuts-biggest-ai-model-trained-local-chips-meituan-releases-longcat-20" rel="noopener noreferrer"&gt;According to SCMP&lt;/a&gt;. The Beijing-based company claimed this is the industry's first trillion-parameter model to complete full-process training and inference on a 50,000-card domestic computing power cluster built with AI ASIC superpods.&lt;/p&gt;

&lt;h3&gt;
  
  
  Beyond Inference
&lt;/h3&gt;

&lt;p&gt;While DeepSeek's V4-pro (1.6 trillion parameters, launched April 2026) relied on home-grown chips only for inference, Meituan says LongCat-2.0 used domestic hardware for both pre-training and inference. Pre-training is far more computationally intensive — it involves digesting massive datasets to learn basic patterns. This marks a significant step for China's push to move domestic chips beyond inference workloads.&lt;/p&gt;

&lt;h3&gt;
  
  
  The Hardware Question
&lt;/h3&gt;

&lt;p&gt;Meituan did not disclose the specific ASIC vendor or chip performance metrics. The claim of a 50,000-card cluster raises questions about interconnect efficiency and training stability at scale on non-Nvidia hardware. DeepSeek's V4-pro, by contrast, used domestic chips only for inference — a less demanding task — while likely relying on Nvidia or other foreign GPUs for pre-training, though DeepSeek has not confirmed that.&lt;/p&gt;

&lt;h3&gt;
  
  
  Open-Source and Context
&lt;/h3&gt;

&lt;p&gt;LongCat-2.0 is open-sourced, following Meituan's earlier LongCat-1.0 release. The 1 million-token context window matches frontier models like DeepSeek V4 (which achieved 500K context with FlashMemory optimization in June 2026) and positions LongCat for long-document and enterprise RAG use cases. Meituan has not published benchmark results on standard evaluations like MMLU, HumanEval, or SWE-Bench.&lt;/p&gt;

&lt;h2&gt;
  
  
  What to watch
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fthnrv4ikbfbqax2fhztg.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fthnrv4ikbfbqax2fhztg.jpeg" alt="Meituan Releases LongCat-Flash-Thinking-2601, Setting a New Benchmark ..." width="800" height="337"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Watch for benchmark results from Meituan on standard evaluations like MMLU, HumanEval, and SWE-Bench. Also track whether DeepSeek responds with a fully domestic-chip pre-training claim for its next model, potentially V5.&lt;/p&gt;




&lt;p&gt;&lt;strong&gt;Source:&lt;/strong&gt; &lt;a href="https://www.scmp.com/tech/tech-trends/article/3358854/china-debuts-biggest-ai-model-trained-local-chips-meituan-releases-longcat-20" rel="noopener noreferrer"&gt;scmp.com&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Originally published on &lt;a href="https://gentic.news/article/meituan-open-sources-1-6t" rel="noopener noreferrer"&gt;gentic.news&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>research</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>Quantum Tagging: Authenticating Location via Quantum Information andRelativistic Signalling Constraints</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 09:30:28 +0000</pubDate>
      <link>https://dev.to/paperium/quantum-tagging-authenticating-location-via-quantum-information-andrelativistic-signalling-1o26</link>
      <guid>https://dev.to/paperium/quantum-tagging-authenticating-location-via-quantum-information-andrelativistic-signalling-1o26</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Directed clustering in weighted networks: a new perspective</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 08:20:28 +0000</pubDate>
      <link>https://dev.to/paperium/directed-clustering-in-weighted-networks-a-new-perspective-2a8l</link>
      <guid>https://dev.to/paperium/directed-clustering-in-weighted-networks-a-new-perspective-2a8l</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case ofAutomatic Genre Identification</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 07:10:29 +0000</pubDate>
      <link>https://dev.to/paperium/chatgpt-beginning-of-an-end-of-manual-linguistic-data-annotation-use-case-ofautomatic-genre-12hk</link>
      <guid>https://dev.to/paperium/chatgpt-beginning-of-an-end-of-manual-linguistic-data-annotation-use-case-ofautomatic-genre-12hk</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Modeling and Analysis of Uplink Non-Orthogonal Multiple Access (NOMA) inLarge-Scale Cellular Networks Using Poisson Cluster Proc</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 06:00:29 +0000</pubDate>
      <link>https://dev.to/paperium/modeling-and-analysis-of-uplink-non-orthogonal-multiple-access-noma-inlarge-scale-cellular-3c97</link>
      <guid>https://dev.to/paperium/modeling-and-analysis-of-uplink-non-orthogonal-multiple-access-noma-inlarge-scale-cellular-3c97</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
    <item>
      <title>Role of Digital Twin in Optical Communication: Fault Management, HardwareConfiguration, and Transmission Simulation</title>
      <dc:creator>Paperium</dc:creator>
      <pubDate>Tue, 30 Jun 2026 04:50:28 +0000</pubDate>
      <link>https://dev.to/paperium/role-of-digital-twin-in-optical-communication-fault-management-hardwareconfiguration-and-2g6h</link>
      <guid>https://dev.to/paperium/role-of-digital-twin-in-optical-communication-fault-management-hardwareconfiguration-and-2g6h</guid>
      <description>&lt;p&gt;{{ $json.postContent }}&lt;/p&gt;

</description>
      <category>ai</category>
      <category>deeplearning</category>
      <category>computerscience</category>
      <category>machinelearning</category>
    </item>
  </channel>
</rss>
