Ask any Western fund manager what their best source is for Bombay Stock Exchange equities, Hong Kong insider trades, or Singapore HDB resale prices, and the answer is usually a wince. Bloomberg covers headline indices well, Refinitiv has decent A-share fundamentals if your firm pays the seven-figure subscription, and Crunchbase's APAC startup coverage thins the moment you cross south of Tokyo or west of Mumbai. For everyone else — emerging-markets equity analysts, market-entry consultants, OSINT researchers, regional VC associates — Asia-Pacific public data is a patchwork of native-language portals, government registries, and exchange microsites that no single vendor stitches together affordably.
This post catalogues the Asian market data scrapers we build at NexGenData. They pull from East Money, the HK Companies Registry, MCA21, ACRA BizFile+, EDINET, the Korea Exchange, IDX, SET, PSE, Bursa Malaysia, HOSE, and the major regional social platforms — turning fragmented APAC sources into clean JSON or CSV. Whether you're tracking northbound Stock Connect flows, building a Vietnam consumer-electronics market map, or watching for SEBI enforcement, the goal is structured, refresh-on-demand data that doesn't require a Mandarin-speaking analyst and a Selenium farm.
Why APAC public data stays underserved
The reasons Western vendors under-index on Asia are structural. East Money, the de facto retail terminal for mainland Chinese investors, publishes mostly in Simplified Chinese behind aggressive bot mitigation. The HK Companies Registry charges per-document fees and gates structured exports. India's MCA21 requires CAPTCHA solves for bulk lookups. ACRA BizFile+ is priced for one-off lookups, not analyst workflows. EDINET returns XBRL bundles needing parsing, and Korea's DART lives behind a Korean-language UI. Add regional CDNs that block non-APAC IPs and the operational cost of running headless browsers across a dozen jurisdictions, and the math stops working for generalist vendors. The result: APAC equity coverage on the major Western platforms is a mile wide and an inch deep — Nikkei 225, Hang Seng, Sensex, KOSPI 200, and not much else with the same depth as the S&P 500 dataset.
That gap is where targeted scrapers shine. Each actor below focuses on one source, handles the local-language fields, the anti-bot dance, and the API or HTML quirks specific to that exchange or registry, and returns a normalised payload.
Why this data matters now
APAC weight in global equity benchmarks keeps climbing, MSCI inclusion factors for China A-shares have ratcheted up, and India's float-adjusted market cap crossed $5T in 2024. Practical use cases driving demand for structured APAC data:
- China A-share research — northbound flow analysis, A/H premium tracking, CSI 300 vs ChiNext vs STAR Market rotation, SOE insider activity.
- India market-entry diligence — MCA company master data on local competitors, director-network mapping, SEBI filings, MagicBricks for physical footprint.
- Singapore property and family-office research — URA transactions, HDB resale, MAS-licensed institutions, ACRA UEN counterparty lookups.
- Southeast Asian e-commerce intel — JD.com, Made-in-China, Alibaba B2B catalogues for supplier discovery and category share estimates.
- Regional VC sourcing — IPO calendars across HKEX, SGX, KOSPI plus the APAC sweep for pre-listing diligence and comp sets.
- OSINT and forensic research — HK Companies Registry shell-tracing, Land Registry asset attribution, SFC enforcement, India MCA director-DIN graphs.
- M &A targeting — public filings, ownership disclosures, insider trade clusters, CNIPA patent grants for IP-adjacent deals.
- Macro and policy tracking — RBI MPC statements, MAS and SFC enforcement, exchange-level IPO pipelines as leading indicators.
What's covered, grouped by sub-region
The roster below is organised by geography. Public actors link straight through; specialised regional actors not on the public marketplace are reachable from the NexGenData Apify catalog — message us or trigger a custom run.
China — financial markets
| Actor | Source | What it returns |
|---|---|---|
| Eastmoney A-Shares Screener | East Money / 东方财富 | Full Shanghai + Shenzhen A-share universe with quotes, PE, PB, market cap, sector tags. |
| China ETF Flow Tracker | East Money | Daily inflows, outflows, AUM, premium/discount across mainland-listed ETFs. |
| China A-Share Insider Trades | SSE / SZSE disclosures | Executive 增减持 records — buyer, role, share count, price, transaction window. |
| HKEX Hang Seng Screener | HKEX | Hang Seng constituents, sector breakdown, fundamentals, A/H pair flagging. |
| STAR Market Screener | SSE STAR / 科创板 | Innovation-board listings with R&D intensity, lockup status, sponsor banks. |
| ChiNext Screener | SZSE ChiNext / 创业板 | Growth-board names with fundamentals, IPO date, suspension flags. |
| Chinese ADRs Screener | NYSE / NASDAQ | US-listed China names with ADR ratio, sector, regulator status, delisting risk. |
China — social and e-commerce signal
| Actor | Source | What it returns |
|---|---|---|
| China Trends Tracker | Weibo + Baidu + Douyin | Unified daily trending topics across the three biggest discovery surfaces. |
| Bilibili Video Search | Bilibili / B站 | Keyword search results — title, uploader, views, likes, danmaku count. |
| RedNote (Xiaohongshu) Scraper | Xiaohongshu / 小红书 | Posts, engagement, hashtags from the dominant Gen-Z product-discovery platform. |
| Weibo Hot Search Tracker | Weibo / 微博 | Hourly hot-search board snapshots with rank, topic, view count. |
| Douyin Trending Tracker | Douyin / 抖音 | Trending video board with creator handle, view count, hashtag set. |
| Zhihu Q&A Tracker | Zhihu / 知乎 | Hot questions and answer engagement — useful for B2B and pro-consumer themes. |
| Made-in-China B2B Suppliers | Made-in-China.com | Supplier directory with category, location, certifications, capacity notes. |
| Alibaba B2B Supplier Finder | Alibaba.com | Wholesale supplier intel — Gold member status, transaction volume, MOQ. |
Hong Kong & Taiwan
| Actor | Source | What it returns |
|---|---|---|
| HKEX IPO Calendar | HKEX | Upcoming Hong Kong listings — pricing range, sponsor, lockup terms, debut date. |
| HKEX Insider & Short Tracker | HKEX CCASS + disclosures | Substantial-shareholder filings, director dealings, short-interest aggregates. |
| HK SFC Enforcement Tracker | Securities & Futures Commission | Enforcement notices, sanctions, license suspensions across HK financial sector. |
| HK Land Registry Transactions | HK Land Registry | Property transaction records with consideration, parties, date, address. |
| HK Companies Registry | HK Companies Registry | CR number, directors, officers, registered address, dissolution status. |
| HK Centaline Property Index | Centaline CCL | Weekly residential index — citywide and by sub-market for HK property cycle. |
| HK Trademark Search | HK IP Department | Trademark registry lookups — applicant, class, status, conflict checks. |
| Taiwan TWSE Screener | Taiwan Stock Exchange | TWSE-listed universe with fundamentals, dividend yield, foreign holdings. |
India
| Actor | Source | What it returns |
|---|---|---|
| NSE India Indices Screener | National Stock Exchange | Nifty 50, Nifty Next 50, sector indices — constituents and live fundamentals. |
| BSE India Stock Screener | Bombay Stock Exchange | Sensex and full BSE universe with quotes, fundamentals, group classification. |
| OGD India Companies Lookup | data.gov.in MCA master | Company master records — CIN, status, category, paid-up capital, address. |
| India MCA Companies | MCA21 | CIN-keyed director lookups, charges, recent filings, signatory history. |
| India MCA INC-22 / INC-32 Filings | MCA21 | Registered-office and incorporation filings — useful for fresh-funded leads. |
| India SEBI Filings Tracker | SEBI | Listed-company filings, takeover-code disclosures, enforcement orders. |
| India RBI Monetary Policy | Reserve Bank of India | MPC statements, repo decisions, governor speeches with date and text payload. |
| India MagicBricks Real Estate | MagicBricks | Listings — price, sqft, locality, builder, possession date across metros. |
Singapore
| Actor | Source | What it returns |
|---|---|---|
| Singapore ACRA Companies | ACRA BizFile+ | UEN lookup, directors, registered address, entity type, status. |
| SG HDB Resale Prices | data.gov.sg HDB | Flat resale transactions — block, sqm, lease, price, town, transaction month. |
| SG URA Property Transactions | URA REALIS | Private property caveats — project, type, area, price, tenure. |
| SG MAS Financial Institutions | MAS register | Licensed banks, capital-markets services holders, insurance, payment firms. |
| SG Rental Market Tracker | HDB + URA rental | HDB and private rental contracts with median rent by district and unit type. |
| SG SGX Stock Screener | SGX | STI constituents, sector splits, REIT yield, mainboard vs Catalist. |
| SG MAS Enforcement | MAS enforcement | Fines, prohibition orders, and regulator notices with date and party. |
| SG MyCareersFuture Jobs | MyCareersFuture.gov.sg | Job postings — employer, title, salary band, EP eligibility flag. |
Japan & Korea
| Actor | Source | What it returns |
|---|---|---|
| TSE Japan Stock Screener | Tokyo Stock Exchange | Nikkei 225 + Prime constituents, fundamentals, foreign ownership ratio. |
| Japan EDINET Insider Filings | EDINET | Insider trading disclosures, large-shareholder reports, parsed XBRL fields. |
| KOSPI Stock Screener | Korea Exchange | KOSPI listings — market cap, quotes, foreign holding %, sector classification. |
Southeast Asia
| Actor | Source | What it returns |
|---|---|---|
| HOSE Vietnam Stock Screener | Ho Chi Minh Exchange | VN30 + full HOSE universe with quotes, foreign room, sector tags. |
| IDX Indonesia Stock Screener | Indonesia Stock Exchange | LQ45 and full IDX list, fundamentals, ownership data. |
| SET Thailand Stock Screener | Stock Exchange of Thailand | SET50 constituents, ETF list, dividend yield, sector breakdown. |
| PSE Philippines Stock Screener | Philippine Stock Exchange | PSEi constituents, fundamentals, board lots, sector classification. |
| Bursa Malaysia Stock Screener | Bursa Malaysia | KLCI universe — quotes, syariah flag, sector splits, dividends. |
| APAC IPO Calendar Sweep | HKEX + SGX + KOSPI + others | Pan-Asia upcoming listings consolidated in one table with venue, sponsor, date. |
Example workflow: building an India market-entry brief
Suppose you're a strategy consultant scoping India entry for a European industrial-automation client. Crunchbase is thin past the funded startups, LinkedIn is noisy, and the official MCA portal won't let you bulk-query. Here's a 90-minute workflow that produces a defensible diligence pack.
Step 1 — define the competitor set. Run the BSE India Stock Screener filtered for Capital Goods / Industrial Manufacturing. Cross-reference with the NSE India Indices Screener. You now have a ranked universe of listed competitors with market cap, sector, and ticker.
Step 2 — pull company master data. For every listed competitor plus unlisted private rivals, run the OGD India Companies Lookup or India MCA Companies for CIN, registered address, paid-up capital, directors, charges, last filing date. Spine of the diligence pack.
Step 3 — overlay regulatory signal. Pipe the same CIN list through the India SEBI Filings Tracker to flag takeover-code disclosures, insider trades, and enforcement orders.
Step 4 — physical-footprint check. Use the India MagicBricks Real Estate actor to sweep commercial listings in competitor HQ cities — useful for facility-size and rental benchmarks.
Step 5 — macro overlay. Append the latest RBI Monetary Policy statement for rate-environment context.
Step 6 — export. Every actor returns CSV / JSON. Drop outputs into BigQuery, Snowflake, or a single workbook. The pipeline runs unattended on a weekly schedule, so the brief stays current through the deal cycle.
The same shape works for a China A-share thesis (East Money + A-share insider trades + Trends tracker), a Singapore property pitch (URA + HDB + Centaline), or a Vietnam consumer scan (HOSE + B2B suppliers + social signal).
Who uses these
- EM equity analysts building China A-share, India, or ASEAN coverage without a Bloomberg APAC seat.
- Fund managers tracking northbound Stock Connect flows, A/H premium, and KOSPI foreign ownership shifts.
- Market-entry consultants producing diligence packs for European or US clients scoping APAC expansion.
- Family offices and wealth platforms in Singapore and Hong Kong needing licensed-counterparty checks via MAS and SFC registers.
- VC associates sourcing pre-IPO and recently-listed APAC names from HKEX, SGX, KOSPI calendars and the consolidated APAC IPO sweep.
- OSINT and due-diligence firms running shell-company traces through HK Companies Registry, Land Registry, and India MCA director graphs.
- Trade-finance and supply-chain teams validating Chinese suppliers through Made-in-China, Alibaba B2B, and CNIPA patent records.
- Journalists covering APAC business needing primary-source structured data instead of vendor screenshots.
- Recruiters and talent intelligence mapping the Singapore tech market via MyCareersFuture and EP-flagged listings.
- Macro and policy desks tracking RBI MPC decisions, MAS and SFC enforcement, and regional IPO pipelines as leading indicators.
Start here
Browse the full set of Asian market actors on the NexGenData Apify catalog, or jump straight into the workhorse — the Eastmoney A-Shares Screener — to see the data shape before committing to a workflow. Most actors run on per-event pricing, so you can pull a few hundred records to validate fit before scaling up.
Related actors worth a look
- Japan EDINET Insider Filings — for Japan-focused activist or M&A workflows.
- HK Land Registry Transactions — asset attribution and HK property research.
- RedNote (Xiaohongshu) Scraper — Gen-Z product-discovery signal for consumer brands targeting mainland China.
- APAC IPO Calendar Sweep — single pan-regional IPO feed for syndicate desks.
- Made-in-China B2B Suppliers — sourcing and supplier-risk research.
- China Trends Tracker — combined Weibo / Baidu / Douyin signal for consumer thesis work.
Related reading: Free A-Share data scraping (Chinese-language guide) and Building a real-time FX dashboard for the currency overlay any APAC workflow needs.
FAQ
Why do Western data vendors miss so much APAC data?
Three structural reasons. Language — most APAC primary sources publish in CJK, Bahasa, Thai, or Vietnamese, and parsing requires per-source engineering. Anti-bot — mainland and Indian government sites block non-local IPs and rotate CAPTCHA. Economics — bespoke pipelines for ACRA, MCA21, EDINET, DART, IDX don't pencil out for generalist vendors.
Do these handle Chinese, Japanese, and Korean characters cleanly?
Yes. Every actor that targets a CJK source emits UTF-8 with the original character set preserved alongside any Romanised or English fields the source provides. East Money, EDINET, KOSPI, and HKEX actors have been hardened against the usual encoding pitfalls (mojibake, half-width vs full-width punctuation, traditional vs simplified mixing).
Can I monitor multiple regions in one workflow?
Yes — the most common pattern is a scheduled Apify task that fires three or four region-specific actors in sequence and writes a combined output to a single dataset, Google Sheet, or webhook. The APAC IPO Calendar Sweep is the in-house example of that pattern, pulling HKEX, SGX, KOSPI, and adjacent venues into one stream.
How fresh is the China A-share data?
The Eastmoney A-Shares Screener refreshes within seconds of East Money's own page render — effectively real-time during mainland trading hours, with end-of-day snapshots persisted to your dataset. ETF flows and insider trades follow the underlying exchange disclosure cadence (daily for most fields, intra-day for flow tracking).
Are these compliant with PRC and APAC data laws?
The actors target only public, publicly-disclosed data — exchange filings, government registries, public social platforms — without bypassing login walls or paywalls. PRC PIPL, India DPDPA, Singapore PDPA, and similar regimes principally regulate personal data; structured corporate, listing, and market data is generally outside those scopes. That said, downstream use is your responsibility — if you're republishing or selling derived datasets, get local counsel involved.
How do you handle anti-bot defences on Chinese sites?
A mix of rotating residential proxy pools sized for each region, fingerprint randomisation, request pacing tuned per target, and per-actor fall-through logic that retries with a different network path on classification failure. The infrastructure is shared across actors, so reliability improvements on one Chinese-source actor lift the rest.
What if I need a custom field, region, or source not listed?
Most actors expose configuration that already covers common variations (ticker lists, date ranges, region filters). If you need a genuinely new source — say a specific provincial registry or a niche exchange — message via the Apify console or the NexGenData catalog page and we'll spec it. Many of the listed actors started life as custom builds for a single customer.
Can I export to BI tools or warehouses?
Yes — every Apify run writes to a dataset you can pull as CSV, JSON, JSONL, or Excel, or push via webhook into BigQuery, Snowflake, Postgres, Google Sheets, Airtable, or any system that accepts a POST. The dashboard guides on this site walk through the FX and sentiment-pipeline patterns that work just as well for APAC data.
See also: New -- PSE Edge Disclosures
Top comments (0)