I shipped a Weibo scraper on Apify eight months ago. Fifteen customers pay me on it now, another thirty-four use the free tier, and in the last sixteen days they pulled 136,400 posts through it. I built it because every Western social-listening tool I evaluated — Synthesio, Brandwatch, Meltwater — quoted four to five figures a year for China coverage that was thinner than what you get from one tuned Apify run.
The whole pitch is one number: $0.005 per post. Pay only for items you actually take. The Apify free plan covers your first ~1,000 mentions before you spend a cent.
Weibo Scraper on Apify Store →
What's actually in the box
Four modes. All return normalized JSON. No Weibo login. No API key from Weibo. No VPN.
-
hot_search— the live hot-topics list, i.e. what 580M+ monthly active users are looking at right now. The single most-watched signal in Chinese social. -
search— keyword search across public posts. Brand names, ticker symbols, product launches, Chinese or English. -
post_comments— every public comment on a given post. Sentiment grenades and viral crises live here. -
user_posts— full posting history of any public account. KOL vetting, executive watch, competitor monitoring.
Output is flat JSON — post text, author handle, timestamp, repost / comment / like counts, media URLs. Push it straight into a warehouse, a Pandas DataFrame, or a Slack alert with a 30-line script.
What people actually pay for
I see what runs every day on this actor. The patterns paying customers settle into:
1. Equity / sentiment signal on China-listed names — ~$25/day, ~$750/month
A small fund or research desk covering BABA, NIO, PDD, BILI, JD, BEKE, LI, XPEV, KWEB constituents, or any China-exposed Western name. Scheduled search over 30-50 tickers and brand names, ~5,000 posts a day, fed into a sentiment model. Sentiment shifts on Weibo lead the Hong Kong open by hours. Dedicated enterprise social-listening contracts that even attempt China coverage start near $30K/year, and most don't index Weibo deeply.
2. Brand monitoring for Western brands in China — ~$15/day, ~$450/month
A consumer brand with China exposure — Apple, Tesla, Nike, Starbucks, LVMH, Lululemon, any DTC brand on Tmall — needs ~3,000 mentions/day on brand and product-line keywords. Comments mode catches crisis posts before they trend. Synthesio / Brandwatch / Talkwalker contracts that include China typically run $30K-$100K/year. The same daily mention stream costs you less than a streaming subscription.
3. KOL / influencer due diligence — ~$1 per KOL
Before you wire 50,000-200,000 RMB to a Weibo influencer for a sponsorship, run user_posts against the handle. Look at posting cadence, real engagement (not vanity follower counts), brand affinity history, controversy flags. One avoided bad deal pays for years of usage.
4. AI / LLM training data — ~1M posts = $5,000
Real-world, conversational, dialect-rich Mandarin from public posts. Filtered Weibo subsets sell on data marketplaces for $20K-$50K and ship stale by months. Pull fresh data on the topics and time windows you care about, own the pipeline, and the per-post cost is a small fraction of either marketplace data or annotator-collected datasets.
5. M&A and pre-deal diligence on Chinese targets — $200-$500 one-off
A pre-LOI sentiment pull on a Chinese target — employee chatter, customer complaints, founder reputation, glass-door-equivalent venting. Boutique diligence firms bill $25K-$75K for the equivalent exercise. As a banker or consultant, even a "$500 in cost, $30K invoice" framing is a 60x markup the client is happy to pay for.
6. Crisis monitoring / hourly brand watch — ~$50/month
Schedule a six-times-a-day run on brand keywords. hot_search catches a viral crisis the moment it crosses into the public consciousness — typically a 4-12 hour head start on Western media coverage. For a brand worth eight figures, that gap is the difference between "managed" and "case study."
7. Macro / consumer-trend reading from the hot list — ~$5/day
The hot search list is the cheapest macro signal in Chinese markets. Tariff reactions, regulatory rumblings, viral consumer products, celebrity scandals that wreck brand deals — all surface here first. Hedge fund quants, geopolitical analysts, and morning-brief writers all bake this in.
The number that matters: $0.005 per item
You pay per item returned. No subscription, no surprise overage.
- 1,000 mentions: $5
- 10,000 mentions: $50
- 100,000 mentions: $500
The Apify free plan gives you ~$5/month in platform credit, which covers your first ~1,000 mentions on this actor. You validate the data fits your use case before you spend a cent.
Quick start
from apify_client import ApifyClient
client = ApifyClient("<YOUR_APIFY_TOKEN>")
# Pull 1,000 posts mentioning Tesla. $5 flat.
run = client.actor("zhorex/weibo-scraper").call(run_input={
"mode": "search",
"searchQuery": "特斯拉",
"maxResults": 1000,
})
# Stream the results
for post in client.dataset(run["defaultDatasetId"]).iterate_items():
print(post["createdAt"], post["repostsCount"], post["text"][:120])
Same shape works for hot_search, post_comments, user_posts. Swap the mode and the input keys to whatever the run takes. The exact input schema lives on the actor page.
Where the real money lives: recurring runs
One-shot pulls are fine for a diligence assignment. The customers who actually extract serious value from this are the ones running it on a schedule. Apify Schedules takes a cron expression and a saved input — the actor runs forever, the dataset accumulates, and you download it as JSON, CSV, or Excel.
The math gets compelling fast. Below is what my heaviest recurring customers actually run:
| Use case | Cron expression | Approx. monthly cost | What it replaces |
|---|---|---|---|
| Morning hot-search dump for the daily brief |
0 9 * * 1-5 (Asia/Shanghai) |
~$15 | A junior analyst's 30-min daily task |
| Brand mentions, every two hours | 0 */2 * * * |
~$450 | $30K/yr Brandwatch contract for China only |
| Equity tickers, hourly (the highest-ROI cron on this list) | 0 * * * * |
~$750 | A $120K/yr China sentiment analyst, half-replicated |
| Crisis watch, every 30 minutes | */30 * * * * |
~$1,500 | A 24/7 PR monitoring agency contract |
| Overnight KOL sweep on 200 handles | 0 2 * * * |
~$60 | $5K/mo influencer-vetting subscription |
Set the cron once, walk away, get paid in compounded insight. The customers running hourly equity cron jobs have been doing it for months without touching the config — the actor runs, the data lands, the alpha shows up in their dashboards. That's the only mode of use that actually justifies the time you invested learning the schema.
If you take one thing from this post: don't run it manually twice — wire the second run into a cron. The actor was built for that, the pricing was designed for that, and that's where every customer who renewed went.
Reliability and what happens when things break
You pay per item. If the actor returns nothing on a run, you pay nothing. If it returns 327 items, you pay for 327. That alignment is the whole reason I picked per-event pricing instead of a monthly subscription — my incentive to keep the thing working is exactly your incentive that it works.
I monitor the actor daily. When something upstream changes, I ship a fix within hours, not weeks. The Apify Store rating and issue history on the actor page are public.
Why I priced it like this
I shipped this eight months ago. By month two it was profitable. The most recent sixteen-day window: fifteen paying customers, 136,400 items returned, $697 revenue, $675 profit, 96.79% margin. The margin isn't there because the work is trivial — it's there because per-event pricing means I only earn when the data is actually delivered.
If you're evaluating Chinese social tools and the lowest quote you can get is $20K+, run a 1,000-mention probe through this actor first. You'll know inside ten minutes whether the data covers your use case. Worst case, you spend $5. Then wire it into a cron and forget about it.
Other Chinese-platform actors I run
Weibo is the macro signal layer for China. These cover the rest of the surface area:
- Xiaohongshu / RED scraper — lifestyle, beauty, female-skewing audience. The #1 platform for DTC brand launches in China.
- Bilibili scraper — long-form video, Gen Z, gaming / anime / tech vertical signal.
- Douban scraper — books, films, music, niche communities. The most "honest" review platform in China.
- Xueqiu scraper — retail-trader-heavy financial discussion. Equity-desk supplement to Weibo.
- Chinese Brand Monitor — composite brand signal across the platforms above.
All same pricing model. Pay per item. Schedule freely.
Compliance posture
- Only public Weibo posts. No private accounts, no DMs, no content behind a login wall.
- No login bypass. The actor does not log into Weibo on your behalf, and does not need an account to function.
- Optional cookies are user-supplied and only raise your personal rate limit. They are never required for the actor to work.
If your use case requires private data, this actor isn't it — and frankly nothing on the Apify Store will be.
If you actually run something interesting with it, leave a comment or open an issue on the actor page — I read all of them.
Top comments (0)