If you cover Chinese equities — A-shares in Shanghai or Shenzhen, H-shares on HKEX, ChiNext growth-tech, the STAR Market sci-tech innovation board, the Beijing Stock Exchange, or US-listed Chinese ADRs — you already know the data problem. China is the world's second-largest equity market by capitalization, and the most fragmented from a research-tooling perspective. Bloomberg, Wind, and Refinitiv each price their China bundles north of twenty thousand dollars per seat per year, English-language retail sources are sparse and frequently stale, and official exchange portals are split across five operators with inconsistent schemas.
This is the umbrella guide to building a China financial data stack out of structured, on-demand actors. It is the final article in our Phase 2 buyer-intent series and a companion to our deep dive on scraping A-share fundamentals from Eastmoney and our regional overview of Asian market data scrapers. Where the Eastmoney post drills into one source for A-share fundamentals, this post walks the entire China stack — every board, every venue, plus the offshore and ADR layers — and shows how to assemble them into a working workflow without writing a check to a terminal vendor.
The Problem: China Equity Data Is Structurally Fragmented
China-focused analysts juggle data from at least six distinct venues. The Shanghai Stock Exchange (SSE) lists mainland blue-chips on its main board plus the STAR Market for sci-tech innovation names. The Shenzhen Stock Exchange (SZSE) runs the main board, the SME board, and ChiNext — the Nasdaq-equivalent growth-tech venue. The Beijing Stock Exchange (BSE), launched late 2021, focuses on innovation-oriented SMEs promoted from NEEQ. Offshore, HKEX hosts H-shares (mainland-incorporated, HK-dual-listed) and red chips. Then there are US-listed Chinese ADRs — a shrinking pool that still includes Alibaba, JD.com, PDD, and dozens of smaller names, many now running parallel H-share secondary listings as delisting insurance.
None of these venues share a tape. Ticker conventions, sector taxonomies, and disclosure cadences all differ. Foreign access runs through Stock Connect (Northbound from Hong Kong into A-shares, Southbound from the mainland into H-shares), which adds its own quota wrinkles. A single research question — "show me the Chinese consumer-discretionary universe ranked by free cash flow yield" — requires five queries against five sources and a normalization layer the analyst builds themselves.
Terminal vendors solved this by buying the data wholesale and reselling a blended feed for twenty thousand dollars per seat at the low end and well over one hundred thousand for Wind enterprise. For a two-person family office, an emerging quant pod, an independent allocator, or a fintech building a China product, that is not a workable cost base. The on-demand actor approach unbundles the same primary sources into discrete jobs you only pay for when you run them.
Why China Equity Data Matters Right Now
Three structural shifts make this dataset more interesting today than three years ago. First, MSCI's progressive inclusion of A-shares in MSCI Emerging Markets and ACWI has pushed passive and benchmark-aware active money into mainland names. The inclusion factor remains well below full weight, meaning every step-up triggers measurable rebalance flow. FTSE Russell has followed a similar path. Tracking the actual A-share constituents, free-float adjustments, and inclusion factors is now table stakes for global EM strategy.
Second, ADR delisting risk under the Holding Foreign Companies Accountable Act reshaped offshore China stock work. The 2022 PCAOB cooperation agreement cooled the extreme scenarios, but ADR investors now systematically track each issuer's Hong Kong secondary or primary listing and monitor conversion mechanics. Mapping every ADR to its H-share equivalent and tracking volume migration between New York and Hong Kong sessions is an ongoing workflow.
Third, the policy-driven repricing of Chinese growth tech made venue-level differentiation more important. ChiNext and STAR Market trade with different daily price limits than main boards (twenty percent vs ten percent), different investor eligibility thresholds, and different IPO mechanics. Aggregating them as a single "China growth" basket loses information that a venue-aware data layer preserves.
The China Financial Data Stack: Venue by Venue
The table maps each major venue to coverage profile, listing population, trading hours in China Standard Time (UTC+8), and the NexGenData actor that targets it. Hours assume the standard continuous-auction session and exclude the noon trading break on mainland venues.
| Venue | Coverage | Listings | Hours (CST) | Use case | Actor |
|---|---|---|---|---|---|
| Shanghai Main Board (SSE) | A-share blue-chips, SOEs, large-cap financials | ~1,700 | 09:30–11:30, 13:00–15:00 | Core A-share fundamental screening | eastmoney-china-stock-screener |
| STAR Market (SSE) | Sci-tech innovation — semis, biotech, hard-tech | ~580 | 09:30–11:30, 13:00–15:00 | Growth-tech screens, IPO research | STAR Market screener (catalog) |
| Shenzhen Main Board (SZSE) | A-share mid/large-caps, manufacturing-heavy | ~1,500 | 09:30–11:30, 13:00–15:00 | Industrial and consumer screens | eastmoney-china-stock-screener |
| ChiNext (SZSE) | Growth-tech — internet, software, new energy | ~1,300 | 09:30–11:30, 13:00–15:00 | High-growth screens, momentum strategies | ChiNext screener (catalog) |
| Beijing Stock Exchange | Innovation SMEs, ex-NEEQ promotions | ~260 | 09:30–11:30, 13:00–15:00 | Small-cap discovery, policy flow | BSE screener (catalog) |
| HKEX (H-shares + red chips) | Dual-listed mainland + HK-domiciled China plays | ~2,600 | 09:30–12:00, 13:00–16:00 | Foreign-accessible China exposure | HKEX Hang Seng screener (catalog) |
| US ADRs (NYSE/Nasdaq) | Chinese issuers in ADR form — BABA, JD, PDD | ~250 | 09:30–16:00 ET | USD-denominated China, arbitrage | Chinese ADRs screener (catalog) |
| China-listed ETFs | Onshore ETF flow across A-shares, themes | ~900 | 09:30–11:30, 13:00–15:00 | Sector rotation, positioning | china-etf-flow-tracker |
| A-share insider activity | Mainland insider buys/sells, pledge events | full A-share | T+1 | Insider signal overlay, governance | china-ashare-insider-trades |
| HKEX insider + short interest | Part XV disclosures, short position reports | full HKEX | next session | Short-side research, dealing disclosures | hkex-insider-short-tracker |
| HKEX IPO calendar | Pipeline including A-to-H and homecomings | rolling 6 months | continuous | IPO subscription, allocation research | hkex-ipo-calendar |
Three actors point to the catalog landing page rather than a dedicated public URL. Those screeners ship to private beta customers; the catalog lists them under the China cluster and access-on-request takes a couple of business days. Everything else is publicly runnable on Apify today.
Example Workflow: Building a China Equity Screen for a Foreign Allocator
The scenario: a foreign allocator benchmarked to MSCI EM wants a long-bias screen for Chinese consumer-discretionary names trading at a discount to their three-year free-cash-flow average, with positive insider-buy signal and improving ETF flow. Output should cover A-shares, H-shares, and ADRs, deduplicated where the same economic asset trades in multiple venues.
Step 1 — Screen the A-share universe. Run eastmoney-china-stock-screener filtered to consumer-discretionary, with trailing-twelve-month free cash flow, FCF yield versus three-year median, and a two-billion CNY market-cap floor. Output is a CSV with venue tags so you can split SSE Main, SZSE Main, ChiNext, and STAR downstream. For the source-level methodology see the dedicated Eastmoney post.
Step 2 — Cross-check H-share dual listings. For each A-share that passes, query the HKEX universe for the matching H-share ticker. The HKEX listing often trades at a meaningful discount — the A-H premium — and a Stock Connect-eligible allocator can frequently access the cheaper venue. Use the HKEX screener from the NexGenData catalog (request access at apify.com/nexgendata) to enrich the A-share list with H-share tickers and live pricing.
Step 3 — Add US ADR equivalents. For names with parallel ADR listings (Alibaba NYSE BABA vs HKEX 9988, JD.com Nasdaq vs HKEX 9618) pull ADR pricing and ADV. The Chinese ADRs screener in the catalog returns the full mapping plus delisting-risk flags from SEC PCAOB filings. This matters for any allocator running a USD share class who wants China exposure without a Stock Connect operational footprint.
Step 4 — Layer institutional and insider signals. Run china-etf-flow-tracker for the consumer-discretionary thematic ETFs over twenty trading days. Persistent positive net creations are a useful positioning tailwind. Then run china-ashare-insider-trades filtered to your screened tickers — director and supervisor buying at the prevailing price is a stronger signal than the noisier US Form 4 universe because mainland disclosure rules require more granular reporting. For HKEX-listed duals, layer hkex-insider-short-tracker for both Part XV substantial-shareholder filings and weekly SFC short-position disclosures.
The four steps run unattended on a daily schedule, write to a single normalized SQL table keyed by ISIN, and produce a deduplicated long candidate list ready for fundamental review. Total Apify spend per nightly run, at typical actor pricing, lands in low single-digit dollars — three orders of magnitude below a terminal seat.
Use Cases Across the China Investing Stack
- Quant fund signal generation — combine A-share fundamentals with insider flags and ETF flow into a daily factor sleeve.
- Fundamental analyst due diligence — pull the full A-share, H-share, and ADR snapshot for any covered name in one normalized payload.
- ADR delisting research — track each ADR's HK secondary listing, conversion ratio, and volume migration as PCAOB status evolves.
- MSCI and FTSE rebalance trading — anticipate inclusion-factor changes and front-run passive flow with a venue-aware A-share screen.
- China ETF construction — build custom thematic baskets across A-share, H-share, and ADR universes with consistent sector mapping.
- Insider trading research — surface unusual director or large-shareholder activity ahead of earnings, M&A, or policy events.
- IPO subscription research — work the HKEX pipeline including A-to-H homecomings; the hkex-ipo-calendar and the regional apac-ipo-calendar-sweep cover the listing-pipeline view.
- Regulatory risk monitoring — overlay hk-sfc-enforcement-tracker on your HK-listed book to flag SFC enforcement actions.
- Sentiment overlays — combine the equity stack with china-trends-tracker for a domestic-narrative read on consumer brands.
- Cross-venue arbitrage — A-H premium, ADR-to-H spreads, and Northbound/Southbound Connect flow imbalances become tractable when the underlying data is structured.
Start With the A-Share Screener
The highest-leverage entry point into this stack is the A-share fundamentals screener. It covers the largest universe (SSE Main, SZSE Main, ChiNext, STAR in one query), it is the most expensive dataset to replicate manually, and it is the anchor every downstream actor enriches.
Run the Eastmoney China A-Share Screener on Apify →
For source-level methodology read the companion Eastmoney A-share scraping post first. Otherwise the actor ships with a default input that returns the full A-share universe with twenty core fundamental fields — you can be looking at the data in under five minutes.
Related Actors in the China and Asia Equity Cluster
- china-ashare-insider-trades — mainland director and supervisor disclosure feed.
- hkex-insider-short-tracker — HKEX Part XV filings plus weekly short positions.
- china-etf-flow-tracker — onshore ETF creations and redemptions.
- hkex-ipo-calendar — HK listing pipeline including A-to-H homecomings.
- hk-sfc-enforcement-tracker — SFC enforcement for HK-listed risk overlay.
- kospi-stock-screener — sister actor covering Korean KOSPI.
- japan-edinet-insider-filings — Japanese insider feed for the cross-North-Asia book.
- global-equities-screener-mcp — multi-venue screening as an MCP server.
For broader regional context see the Asian Market Data Scrapers overview and the rest of the Financial Data Tools category.
Frequently Asked Questions
Are A-share prices delayed?
Eastmoney-sourced fundamentals and end-of-day price snapshots are effectively real-time relative to the 15:00 CST close. Intraday prices carry the standard Eastmoney delay — typically near-real-time but not exchange-direct. For tick-level work use a direct exchange feed; for fundamental and end-of-day screening the actor stack is fit for purpose.
How do I track the A-H premium for dual-listed names?
Run the Eastmoney A-share screener and the HKEX H-share screener on the same daily schedule, join on the cross-listed mapping (the HKEX screener returns the A-share ticker as a field), and compute per-name premium as A-share price translated at CNY/HKD divided by H-share price minus one. The Hang Seng China AH Premium Index is the headline aggregate; per-issuer view is where alpha sits.
Can I screen by sector and industry?
Yes. The Eastmoney screener exposes the Eastmoney/CSRC industry taxonomy. HKEX uses Hang Seng Industry Classification (HSICS). Normalize both onto GICS downstream — no native GICS field on either, but mapping tables are stable.
What about Northbound and Southbound Connect flow?
Daily Stock Connect flow aggregates are published by HKEX and the mainland exchanges. Per-name Northbound holdings — the Shanghai-Hong Kong Stock Connect Aggregate disclosures — publish with one-day lag and are the single highest-signal foreign-positioning dataset for A-shares.
Do you cover STAR Market and ChiNext separately?
Yes. The Eastmoney screener returns a venue tag (SSE Main, SSE STAR, SZSE Main, SZSE ChiNext, BSE) per row. STAR Market and ChiNext run 20% daily price limits versus 10% main-board and have different eligibility thresholds, both of which matter for risk modelling. Dedicated catalog screeners go deeper on board-specific fields including STAR Market AT1.
How frequently is the data updated?
End-of-day fundamentals refresh after each 15:00 CST close. Insider disclosures are T+1 mainland, same-day or next-session HKEX. ETF flow refreshes after end-of-day NAV publication. IPO calendars refresh continuously off the HKEX disclosure portal. Schedule actors at whatever cadence your workflow needs.
Why not just use Bloomberg, Wind, or Refinitiv?
If you have terminal access and your firm pays for it, use it. The unbundled actor stack exists for users for whom a $20K+/seat/year bill is not justified by usage — independent allocators, emerging quant pods, fintech researchers, academic groups, family offices. It is also useful for terminal-equipped teams building proprietary screens outside vendor licensing review.
Are the actors safe for a corporate compliance footprint?
The actors scrape public disclosure pages — exchange portals, regulator filings, and Eastmoney's public site. They do not bypass paywalls or access non-public data. Compliance teams treat them like any web-scraping tool; Apify per-run audit logs satisfy most data-provenance requirements.
See also:New -- Short Interest Tracker
Top comments (0)