DEV Community

# deepseek

DeepSeek is a Chinese artificial intelligence company that has developed an open-source AI model, DeepSeek-R1, which rivals leading models like OpenAI's o1 in performance, yet was created in under two months at a cost of less than $6 million, significantly lower than its competitors.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026

DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026

Comments
9 min read
I Was the Retrieval Layer

I Was the Retrieval Layer

Comments 2
3 min read
DeepSeek V4: What's Inside, How It Compares, and Where It Actually Wins

DeepSeek V4: What's Inside, How It Compares, and Where It Actually Wins

1
Comments
9 min read
27 days to the DeepSeek V4-Pro cliff: what a 4x price jump looks like in production

27 days to the DeepSeek V4-Pro cliff: what a 4x price jump looks like in production

Comments
5 min read
One Year After DeepSeek and The $600 Billion Question

One Year After DeepSeek and The $600 Billion Question

Comments
9 min read
dsk++ — I rewrote a forgotten DeepSeek library to be fully async (and added a bunch of stuff)

dsk++ — I rewrote a forgotten DeepSeek library to be fully async (and added a bunch of stuff)

Comments
2 min read
DeepSeek V4 Day: It's About Infra, Not the Model

DeepSeek V4 Day: It's About Infra, Not the Model

Comments
8 min read
Why Does DeepSeek Pursue Alpha in Finance?

Why Does DeepSeek Pursue Alpha in Finance?

Comments
7 min read
Build the eval set before you swap the model.

Build the eval set before you swap the model.

Comments 1
4 min read
1.6 Trillion Parameters Just Went Open Source. What About the Other Direction?

1.6 Trillion Parameters Just Went Open Source. What About the Other Direction?

Comments
5 min read
DeepSeek-R1 Reasoning API: Production Guide with Chain-of-Thought (2026)

DeepSeek-R1 Reasoning API: Production Guide with Chain-of-Thought (2026)

Comments
7 min read
Fixing the Missing think Tag Glitch When Running DeepSeek V3.2 GGUF on CPU

Fixing the Missing think Tag Glitch When Running DeepSeek V3.2 GGUF on CPU

Comments
1 min read
DeepSeek V4's Real Innovation Isn't Scale—It's Memory Architecture

DeepSeek V4's Real Innovation Isn't Scale—It's Memory Architecture

Comments
3 min read
DeepSeek V4 Released: Open-Source 1.6T MoE, 1M Context, Apache 2.0 — and It's Already on the API

DeepSeek V4 Released: Open-Source 1.6T MoE, 1M Context, Apache 2.0 — and It's Already on the API

Comments
6 min read
I Stumbled Into a 40x Cost Reduction by Switching to Chinese AI Models

I Stumbled Into a 40x Cost Reduction by Switching to Chinese AI Models

1
Comments 1
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.