Every Trustpilot business page contains a hidden data layer that most scrapers completely ignore.
While other tools fight with JavaScript rendering, cookie walls, and anti-bot systems — there's a <script> tag sitting right in the HTML source that contains every single review in a machine-readable format.
It's called JSON-LD, and it's there because Trustpilot wants Google to read it.
What Is JSON-LD?
JSON-LD (Linked Data) is a standard that websites use to tell search engines about their content. When you see star ratings in Google search results — that's JSON-LD at work.
Trustpilot embeds this data in every business page:
<script type="application/ld+json">
{
"@type": "LocalBusiness",
"name": "Company Name",
"aggregateRating": {
"ratingValue": "4.2",
"reviewCount": "1847",
"bestRating": "5"
}
}
</script>
And on review pages, each review is a separate JSON-LD object:
{
"@type": "Review",
"author": {"@type": "Person", "name": "John D."},
"reviewRating": {"@type": "Rating", "ratingValue": "5", "bestRating": "5"},
"reviewBody": "Best customer service I have ever experienced...",
"datePublished": "2026-03-15"
}
Why This Is Better Than HTML Scraping
| Factor | HTML Parsing | JSON-LD |
|---|---|---|
| Breaks on redesign | Yes, constantly | Never (Schema.org standard) |
| Needs JavaScript | Often yes | No — it's in raw HTML |
| Data structure | Inconsistent | Consistent, standardized |
| Anti-bot risk | High | Zero (it's meant to be read) |
| Speed | Slow (render JS) | Fast (one HTTP request) |
JSON-LD follows the Schema.org specification, which hasn't changed its core review schema since 2013. It's the most stable scraping target on the internet.
What You Can Do With This Data
Competitive analysis at scale. Pull 5 competitors' Trustpilot reviews in 60 seconds. Build a comparison matrix: what features do their customers praise? What do they complain about?
Sentiment tracking over time. Schedule weekly scrapes. Plot average rating over months. Spot when a competitor's service quality drops — that's your moment to poach their customers.
Feature gap analysis. Extract every review mentioning a specific feature. Count positive vs negative mentions. Identify what's missing in the market.
Sales intelligence. "Your competitor has a 3.2 on Trustpilot. Here's what their customers say is broken. We solve exactly that." Data-backed sales pitch.
The Tool
I built a Trustpilot Review Scraper on Apify that uses JSON-LD exclusively. No headless browser, no JavaScript rendering, no anti-bot bypassing. Just clean, structured data extraction from a publicly available data layer.
Enter a Trustpilot business URL → get every review as structured JSON. Rating, text, author, date, response from business.
Combine With Other Data Sources
Reviews are one signal. For complete competitive intelligence, combine with:
- Google News Scraper — what's being said in the media?
- Reddit Scraper — unfiltered community opinions
- YouTube Comments — audience reactions to reviews/comparisons
- Indeed Jobs Scraper — are they hiring or laying off?
All 77 tools: Apify Store | GitHub
Need competitive review analysis done for you? I'll pull reviews for 5 competitors, structure the data, and deliver a comparison report. $20, 24h delivery.
Order via Payoneer | Email: Spinov001@gmail.com
Top comments (0)