DEV Community

ninghonggang
ninghonggang

Posted on

Why the December 2025 AI IDE Rankings Are Scoring the Wrong Category

I spent the morning reading two December 2025 Juejin AI IDE ranking posts back to back, and the thing that finally crystallized for me is that the Chinese-language AI IDE roundups are still scoring the same eight or nine autocomplete-first tools, while the actual market has moved on to Cursor, Claude Code, and the agent runtime layer, and almost nobody is calling out the gap. The first list scored Tencent CodeBuddy at 9.6, Sourcegraph Cody at 8.2, Replit Ghostwriter at 8.0, Codeium at 7.8, Tabnine at 7.6, Amazon CodeWhisperer at 7.5, JetBrains AI Assistant at 7.4, Blackbox at 7.2. The second list was nearly identical, with the only real difference being that they swapped CodiumAI in for Tencent CodeBuddy and called it a nine-tool ranking. Seven of the names appeared in both, with the same scores to the decimal, which is the part that has been rattling around in my head all morning.

The piece that pushed me over the edge was noticing that almost none of the eight tools showing up in both rankings are tools I have heard any of my English-speaking colleagues mention in the last six months. Cursor has eaten the dev mindshare on Twitter and dev.to. Claude Code has eaten the engineering-team mindshare on Hacker News and Reddit. GitHub Copilot is still the default for the average bootcamp graduate, sure, but the conversation in 2026 is about agent runtimes, not autocomplete-with-chat. To be fair CodeBuddy and Sourcegraph Cody are real products with real users, and I am taking the 9.6 score column with a grain of salt because the post that gave it that number was transparently written by a CodeBuddy advocate, but the shape of the list is the part that matters. The Chinese-language IDE rankings are scoring a 2024 product category, and the roundup format is doing a poor job of reflecting where the engineers I know are actually spending money.

The meta-pattern I want to put down before I forget it is that there is now a visible gap between what the consumer-facing AI IDE roundups are scoring and what the GitHub trending recaps are surfacing, and the gap is getting wider, not narrower. The two December 2025 IDE rankings spent most of their column space on autocomplete latency, response speed, and the standard enterprise-feature checklist of SSO and SOC2 compliance. The April and October 2025 GitHub trending recaps surfaced Agent-S for GUI automation, Graphiti for memory graphs, supermemory for persistent context, FastMCP for plugin frameworks, claude-cookbooks as the Anthropic reference, and most recently AGENTS.md as a proposed standard for handing project context to AI coding agents. None of those are autocomplete tools. None of them are in either of the two Juejin IDE rankings. I am a little skeptical of any AI IDE ranking that scores tools on the 2024 axis and ignores the 2026 axis, because the engineers picking tools right now are not buying autocomplete with chat. They are buying agent runtimes that can call into a memory layer and write to a structured project context file.

Honestly I think the practical takeaway here is that if you are reading a Chinese-language AI IDE ranking to decide what to pay for in 2026, you are reading the wrong document. The Chinese roundups are still useful for tracking the domestic-only products like Tencent CodeBuddy and the Chinese versions of CodeWhisperer, and they are fine for getting a read on enterprise procurement preferences in the Asia-Pacific region. But for the question of which IDE you should be coding in tomorrow, the answer is almost certainly going to come from the GitHub trending recaps and the English-language reviews of Cursor and Claude Code, not from a list that scored Codeium at 7.8 in December 2025 and has not been updated since. I have not stress-tested CodeBuddy or Cody the way I have with Cursor and Claude Code, so I want to actually run them for a quarter before I oversell or undersell them, but the fact that two separate Chinese-language rankings produced the same seven-tool overlap with the same decimal scores tells me the roundup format itself is being copy-pasted rather than written from scratch, and that is a problem I want to call out before more people make tooling decisions based on it.

I will reassess in three months. The last time I said that I was mostly bouncing between Cursor and Claude Code for coding and ChatGPT for everything else, which is still roughly where I land. What has changed is that I now treat the GitHub trending recaps as the source of truth for what to pay attention to, and the Juejin IDE roundups as a lagging indicator for the Asia-Pacific enterprise market specifically, and I think that split is going to age well. Give it six months and I expect the Chinese-language roundups to start including Cursor and Claude Code and the agent runtime repos in their lists, and when that happens I will know the format has finally caught up with the market.

Top comments (0)