GokuScraper悟空爬虫

Posted on May 21

Why Can't the Chinese Internet Nurture a "Generous Hugging Face"?

#ai #opensource #machinelearning #discuss

Why Can't the Chinese Internet Nurture a "Generous Hugging Face"?

If you are an AI developer, Hugging Face is probably your go-to place for "freebies."

Llama, Gemma, Qwen... you can download model weight files that are tens or hundreds of gigabytes in size with a single click, completely unrestricted.

Want to upload your fine-tuned models? Go ahead, it's totally free for public repositories, and even private ones have a free tier.

Want to showcase a live demo? Spaces is free to use, and they even throw in some computing power. Developers happily take advantage of all this, while Hugging Face gives it away effortlessly.

Some people even use it as a cloud drive or CDN. The experience feels a lot like GitHub—store your code freely, long live the open-source spirit!!!

But if you turn your attention to China, the picture looks completely different.

ModelScope, WiseModel, OpenI... these platforms are certainly working hard to build open-source ecosystems, but you’ll notice a subtle difference: download speeds are strictly calculated and controlled, uploading files requires a stricter review process, and various "anti-freeloader" mechanisms lurk in the background, ready to throttle you if you aren't careful.

The overall vibe can be summed up in a few words: strictly managed and meticulously calculated.

This raises a puzzling question. They are all AI model hosting platforms, both catering to developers looking for free resources—so why are their postures so different?

Is Hugging Face just "dumb and rich," or do domestic platforms lack the "bigger picture"? How exactly is the underlying math of these costs calculated?

The answer lies in something seemingly inconspicuous but actually incredibly heavy: public network bandwidth.

In this article, we’ll cut through this angle and peel back the layers: who is really paying for Hugging Face’s "generosity"? Why is public network bandwidth in China absurdly expensive? How does the price gap in residential broadband spawn the PCDN gray-market arbitrage? And why did telecom carriers in 2026 crack down so hard on "freeloaders"?

Hugging Face’s “Generosity” Is Paid For By Tech Giants

At first glance, Hugging Face seems like the ultimate charity in the AI world.

Developers can upload models freely, often moving weight files in the tens or hundreds of gigabytes, with unlimited and unthrottled downloads.

By 2026, the number of public models on HF had surpassed 2.5 million. This level of generosity would make even GitHub call it "big brother"—after all, how big is a code repository? The weight file of a mainstream 70B model in 2026 is equivalent in size to hundreds of thousands of code repos.

So, in the minds of many developers, Hugging Face is like a living Bodhisattva with more money than sense.

But if you believe that a unicorn with a secondary market valuation approaching $9 billion is surviving purely out of the goodness of its heart by "running on love," you're looking at the business world through rose-colored glasses.

Peeling back this layer of free offerings, HF’s strategy for monetizing B2B operations in 2026 was already highly mature:

First, the Enterprise Hub:

Pfizer, Bloomberg, and even Apple and Tesla... these tech giants won't put their core models on a public platform. They need private deployments, extremely strict permission management, and SLA guarantees. Pay up, and HF sets it all up flawlessly.

Second, Compute Reselling and Inference Endpoints:

By 2026, model deployment is where the real money is. HF rents cloud-based GPUs on an hourly basis, letting you turn models into production-ready APIs with a single click. Just like that, it became the world’s largest middleman for AI compute.

Third, Extending from Software to Hardware:

In 2025, HF acquired the French robotics startup Pollen Robotics. Today’s HF doesn't just let you download code; it lets you download action datasets for robots. It has started selling its own open-source hardware, aiming to plant a flag in the physical world.

You see, HF isn't avoiding taking money. It just points its "free" offerings at end-user developers and reserves its "fees" for enterprise clients with budgets. This is a classic "build the ecosystem first, harvest the B-side later" playbook.

But this playbook isn't enough to explain how it can "burn" cash so lavishly. Its real trump card lies in its list of strategic financial backers.

Although its Series D round in 2023 capped out at $4.5 billion, entering 2026, a "luxury syndicate of backers" comprising Google, Amazon, NVIDIA, Salesforce, Intel, AMD, and Qualcomm continues to inject cash. This is not ordinary financial investment; it is collective strategic life support:

For NVIDIA: HF is the place where developers globally download models and run inference. More models running means more demand for GPUs—the money they invest in HF is essentially buying an "entry ticket" for their CUDA ecosystem.

For Cloud Giants (AWS/GCP/Azure): HF's Spaces and Inference Endpoints run on AWS and GCP. Bandwidth? Compute? Provided directly at "internal rates" or even "resource credits," reducing costs to almost zero. If you don't use HF, you might just use their cloud directly anyway. Giving it to HF buys a good reputation for "supporting the open-source ecosystem" and boosts developer retention.

This is the underlying logic behind HF’s "generosity": The money it loses is the "military budget" in the global war for AI supremacy.

In this logical chain, HF plays the role not of a "bandwidth buyer," but of an "internet tollgate."

When developers globally make it a default habit to push code and models to HF, it controls the Strait of Hormuz of the AI world. Whoever wants developer attention and habits must give HF money, resources, and help it burn cash.

So, Hugging Face isn't squandering money. It’s using strategic losses to bet on becoming the foundational standard of AI infrastructure. Once this ecosystem is built, the data value, network effects, and switching costs will each serve as an ironclad economic moat.

It’s burning cash to build an empire.

With the Same Playbook, Why Can't Domestic Platforms Keep Up?

Understanding HF’s underlying logic, looking back at domestic platforms may leave you even more confused.

ModelScope, WiseModel, OpenI... don't they want to emulate HF? Don’t they want to wave a wand and let developers upload and download freely to secure the ecosystem first?

It's not that they don't want to; the math just doesn't add up.

The starting point of the contradiction is a cognitive bias that is hard for an average person to notice—we think "broadband is cheap." 1000M fiber optic internet at home costs a few dozen bucks a year, and downloading a movie takes seconds.

Based on this standard, how expensive could platform bandwidth really be?

Sorry, but the stuff the platform uses and the "broadband" in your home are two entirely different commodities.

The "Dual Pricing" of Public Network Bandwidth

What you have installed at home is called "residential broadband." It has two hidden attributes you might not know: First is shared overselling—100 households in a building share one main outlet, and the carrier bets that you won't all be downloading at top speed simultaneously. Second is upload throttling—1000M refers to downstream traffic. Upload speeds are usually capped around 30M to 50M. Furthermore, your contract clearly states, "For residential use only, commercial use is prohibited."

What AI model hosting platforms need, however, is IDC (Internet Data Center) bandwidth. What does that require? It must be dedicated, symmetrical, and full-duplex. If someone downloads a model, you have to upload it. Upload and download speeds must be identical. And it's not for one household; it's for thousands or tens of thousands of simultaneous users. This bandwidth also needs to be BGP multi-line—ensuring fast speeds regardless of whether the visitor is using China Telecom, China Unicom, or China Mobile.

The price? Residential 1000M costs a few dozen bucks a year. IDC’s dedicated 1000M BGP bandwidth costs tens of thousands or even over a hundred thousand RMB a year. A thousand-fold price difference.

It's the same bottle of water. Tap water boiled at home and mineral water sold at KTV are both H₂O, but their cost and pricing logic aren't on the same spreadsheet.

Sky-High "Toll Fees"

If it were just expensive, that would be one thing. But what really drives domestic BGP bandwidth to astronomical prices is the "siloed" layout of the top three national carriers.

China Telecom, China Unicom, and China Mobile each built their own network. In many areas, these three networks do not interconnect freely—or rather, they can connect, but they charge a "toll fee," technically known as a peer-to-peer settlement.

Suppose you are a platform and only bought bandwidth from China Telecom to save money. When Unicom and Mobile users come to download models, the data packets have to cross from the Telecom network to the other two. Every time it crosses, the carriers settle the cost. If it crosses too much, the user doesn't experience speed, but lag—packet loss, latency, crawling along at a few KB.

When the user experience falls apart, platforms have no choice but to bite the bullet and pay for BGP. BGP is more expensive because it is effectively renting right-of-way from all three carriers simultaneously. Data can use anyone's network, efficiently finding the optimal path, and all settlement costs are included. It's expensive not because of the tech, but because of the coordination, settlement, and invisible "toll fees."

The "Cross-Subsidy" Calculation

Okay, at this point you might ask: Why can residential broadband be squashed so cheap while data center bandwidth can't drop its prices?

This gets to the deeper operational logic of China's telecom industry: Cross-subsidization.

In China, broadband is not just a commodity; it has the attributes of a quasi-public good. Carriers have a strict mandate for "universal service"—even a village sitting at 4,000 meters above sea level needs fiber optics and 4G coverage. From a purely economic standpoint, such projects wouldn’t break even in a hundred years.

Who covers those massive losses? Since prices for residential broadband are compressed to rock bottom, they have to make it up somewhere else. Enterprise users, especially those buying IDC and BGP bandwidth, become the highly anticipated "cash cows." Carriers take the infrastructure costs they lose on the consumer side and tack them onto the price of business products, using the "whales" to subsidize the "retail investors."

Therefore, the bandwidth domestic AI platforms purchase doesn't just reflect its own value; it inadvertently bears part of the societal cost of universal service. HF in the US or Europe can use peering to exchange traffic at extremely low costs, but domestic platforms must pay hard cash for every megabyte, while also helping amortize the fiber-optic bill for distant villages.

Compliance Costs

Finally, there is an extra expense unique to the domestic market: Content moderation.

When HF hosts a model, copyright and open-source licenses are the developer’s responsibility. But in China, platforms must take responsibility for the safety of uploaded content. Every model file and every online Space demo requires a backend running sensitive word filters, image safety scans, and illegal content blocking. The larger the file, the more compute and time these scans consume.

Think about it: how many GPU hours does it take to just open and check a 70B parameter model that is hundreds of gigabytes in size? Foreign platforms largely dodge these moderation costs or bear less responsibility, but for domestic platforms, it’s a non-negotiable hard expense.

So It's Actuarial Precision, Not Being Cheap

Laying out these four layers makes the ledger clear.

Hugging Face’s bandwidth and compute costs are directly wiped out as "ecosystem credits" by their deep-pocketed backers. Domestic platforms face a thousand-fold premium on IDC bandwidth, interconnect settlements from three carriers, universal service costs hidden in the bill, and the unavoidable expense of compliance audits.

When every megabyte of traffic comes with a price tag of real gold and silver, you can’t afford not to calculate with precision.

It's not an issue of having "no vision." It's just that the people running the numbers are genuinely burning their own cash.

The Temptation of the Gray Area: When People Use Residential Broadband for Commercial Jobs

Earlier we mentioned the thousand-fold price gap between residential and commercial broadband.

To the average person, this gap is just an exasperated sigh about "how expensive enterprise internet is." But to another group of people, it's a crack shining with gold—as long as you can "package" residential traffic as a commercial good, the difference is pure profit.

Thus, a massive gray-market industry quietly sprung up.

Residential broadband may have plenty of flaws—upload throttling, dynamic IPs, deeply nested NATs—but they are all trumped by one word: Cheap. When you can buy a whole year of 1000M residential broadband for $70, while an enterprise pays tens of thousands for the exact same speed commercially, clever folks will naturally wonder: is there a way to pool thousands of cheap pipes into a commercial torrent that can be sold for profit?

The answer is yes. This business is called PCDN (Peer-to-Peer Content Delivery Network).

How Does PCDN Work? — The Case of OneCloud and JD Cloud Wireless Router

The name PCDN is all too familiar to hardware-obsessed tech geeks. Whether it’s OneCloud or JD Cloud Wireless Router, they are essentially running the exact same hustle.

The game is incredibly simple: You buy a set-top box and plug it in at home, or install a client on your NAS or router. It sits there quietly in the background, consuming a fraction of your upload bandwidth and a bit of your idle hard drive space. In exchange, you get a "power subsidy" of a few dimes to a few bucks a day.

That’s the user’s perspective. What are the manufacturers doing behind the scenes?

They collect hundreds of thousands or even millions of these "boxes" nationwide, weaving them into a "distributed bandwidth network" covering almost every neighborhood in every city. Then, they knock on the doors of video streaming sites and hand over a quote: Hey, aren't you trying to serve HD video to users on iQIYI, Bilibili, and Douyu? Aren’t you paying carriers tens or hundreds of millions in commercial CDN bandwidth fees every year? Look, I'll use my "residential network" to handle your delivery for a third of the price.

Looking at that quote, it’s hard for a streaming site not to be tempted. After all, when a user plays a video, they pull the data from a nearby local node, and the experience feels nearly identical. But the site can save millions of actual dollars a year. Why wouldn't they do it?

And so, the profit loop is closed: Users get electricity subsidies, PCDN vendors pocket the arbitrage, and streaming sites save their budgets.

The Only Loser: The Telecom Carriers

Now, look at this picture from the top floor of a telecom carrier’s office.

You’ve worked tirelessly to lay fiber optics and build infrastructure, fulfilling "universal service" commitments in remote villages even if it means losing money. Your business model was supposed to be simple: use affordable residential broadband to cover the masses, and use pricey commercial broadband to make the profits back from enterprise clients.

Instead, a group of people exploited a loophole in this pricing system. They took your cheap residential water pipes, hooked them up to your high-priced commercial reservoir, and siphoned off the toll fees you were supposed to collect from enterprise clients.

From the carrier’s perspective, what is this called? This isn't technical innovation; it's commercial arbitrage. In plain English, they're fleecing the landlord.

The foundation of the PCDN business isn't advanced tech; it is the fact that the price of civilian broadband in China has been suppressed by the state to levels far below market value. It’s a business model built on price control arbitrage—essentially the exact same logic as using subsidized industrial electricity to mine Bitcoin.

Residential broadband agreements explicitly state "For residential use only, commercial use is prohibited," but for a long time, carriers looked the other way. After all, they were trying to grab market share, installation numbers were key KPIs, and it wasn't wise to pursue these issues too aggressively.

But now, the AI era has arrived. Large model files are routinely tens of gigabytes, and multimodal training data is dealing in astronomical numbers. Traffic consumption is orders of magnitude larger than in the video era. If carriers don't start hauling in the nets now, it won’t just be video traffic leaking into PCDN. Even the distribution of large models could get devoured by this "army of ants" residential network.

It’s no longer a matter of a few bucks for an electricity bill. It’s threatening the carriers' fundamental revenue structure.

Therefore, the rules had to tighten. The gray areas had to be illuminated. And this time, carriers were genuinely preparing for a fatal crackdown.

How Does Bandwidth Cost Shape Our Internet?

In the previous sections, we unpacked the economics, gray-market arbitrage, and regulatory tug-of-war riding on a broadband pipe. But the story so far is missing one final puzzle piece—how do all these things added together actually shape the internet we are currently experiencing?

Hugging Face becoming the global "default repo" for AI hinges on one easily overlooked prerequisite: its bandwidth costs were strategically erased by tech giants.

Domestic platforms don't have that prerequisite. When every gigabyte of a model download has to be tallied as hard costs, a purely free, unlimited model is fundamentally impossible from day one. It’s not an issue of "not wanting to copy HF"; the ledger is right there, and there's no way around it.

Therefore, China's "Hugging Face equivalents" are destined to take a path that commercializes earlier and hits the ground sooner. While they are still in the phase of building an ecosystem, they are forced to consider: Should this download button be throttled? Should we charge a small distribution fee for this large model file? Where do we draw the line between public and private repositories so that we can attract developers without bankrupting ourselves?

The tighter this ledger is balanced, the narrower the space for free and open access becomes.

Taking it a step further, high bandwidth costs could become an invisible barrier blocking domestic large models from going global. When domestic developers want to publish their fine-tuned Qwen or DeepSeek models to international markets, who pays the cross-border bandwidth bill? If hosted on a domestic platform, the download speeds will make foreign users want to smash their keyboards. If uploaded to HF, the data and models are handed over to someone else's infrastructure. This dilemma is essentially a structural scar etched into the industry framework by bandwidth costs.

The current domestic model—using low residential prices to fulfill universal service obligations, and using high commercial prices to recoup costs—was formed during a specific historical period. It efficiently solved the mission of "getting 1.4 billion people online." But as the AI era arrives, as every developer wants to distribute tens of gigabytes of models, and as PCDN undermines the pricing system with its "ant colony" approach, the cracks in this model are becoming increasingly obvious.

Raising prices blindly merely patches up immediate leaks. The long-term problem is that high bandwidth costs raise the barrier of entry for the entire ecosystem. The development, distribution, and iteration of AI all require massive traffic exchanges. If the cost of exchange is too high, the speed of progress slows down.

Is there a solution? The peering culture abroad serves as a point of reference—carriers, and large enterprises working with carriers, swapping traffic for free to lower the flow costs of the entire network. But this requires competition, it requires more players entering the market, and it requires breaking down the invisible walls between the silos.

That’s not easy. But only tackling the tough issues will decide the foundation of the internet for the next decade.

When we talk about bandwidth, we are actually talking about the infrastructure fairness of this era—who gets to build it, who can afford to use it, and who gets shut out. The answer to that is vastly more important than the price of a strand of fiber optic cable.

Traffic Isn't Free; Someone Is Just Picking Up the Check

Late at night, you click the download button on a multi-gigabyte model. The progress bar starts moving, and you turn around to pour yourself a glass of water. When you return, the model is resting quietly on your hard drive.

Those tens of gigabytes of data traversed Pacific subsea cables, were exchanged for free at some IX (Internet Exchange), sprinted across the backbone networks of AWS or Google Cloud, and finally arrived at your home router. You didn't spend a single cent in the entire process.

In that fleeting moment, you wouldn't think about what stands behind those bytes of data: Google's strategic investments, NVIDIA's ecosystem layout, interconnect settlements among the top three Chinese carriers, infrastructure subsidies for remote mountain villages, and some PCDN gamer who just got their internet cut off for anomalous upload traffic.

Hugging Face's "free access" is the war chest of tech giants fighting for ecosystem dominance, burning bright and decisive. Domestic platforms' "limitations" are born of survival rationality under high cost pressures and strict compliance demands, calculating every penny against their will. The rise and fall of PCDN represents a gray-market weed that sprouted in the thousand-fold price crack between "residential" and "commercial" broadband, only to be uprooted.

These three things seem disconnected, but they all point to the same truth: Not a single byte on the internet is truly free. It is merely being paid for in a place you cannot see.

Sometimes the person paying the bill is a strategic investor; sometimes it's an enterprise paying commercial bandwidth fees; sometimes it's the infrastructure budget for a far-flung village; sometimes it's crossfire catching a tech enthusiast; and sometimes it's an average person paying a monthly broadband bill, never suspecting they belong to the crowd doing the subsidizing.

Understanding bandwidth isn't about understanding the transmission speed of a fiber optic cable. It is about understanding how pricing, game theory, subsidies, arbitrage, and regulation all crash together simultaneously on a single wire.

And that, perhaps, is half of what makes up the internet.

Thank you all for reading! If you found this interesting, please like, share, and hit the 'in-look' button without hesitation!

To see my articles as soon as they drop in the future, don’t forget to add a star ⭐, so you won't lose track of the page.

Alright, that's it for today.