agenthustler

Posted on Mar 26 • Edited on Apr 19

Web Scraping With Rotating Proxies: Complete Setup Guide

#webdev #python #tutorial #webscraping

Web Scraping With Rotating Proxies: Complete Setup Guide

If you have ever had your scraper blocked after a few hundred requests, you know the pain. Rotating proxies are the solution — they automatically cycle through different IP addresses so your requests appear to come from different users.

This guide covers proxy types, rotation strategies, and working Python code to build a robust scraping setup.

Why You Need Rotating Proxies

Websites detect and block scrapers using several signals:

IP frequency: Too many requests from one IP
Geographic patterns: Requests from data center IPs
Behavioral analysis: Non-human request patterns
Rate limiting: Hard caps on requests per IP

Rotating proxies solve all of these by distributing your requests across hundreds or thousands of IPs.

Residential vs Datacenter Proxies

Feature	Residential	Datacenter
IP Source	Real ISP connections	Cloud servers
Detection Rate	Very low	Higher
Speed	Moderate	Fast
Cost	$5-15/GB	$1-3/GB
Best For	Protected sites	Simple targets
Reliability	High	Moderate

Bottom line: Use residential proxies for sites with anti-bot protection. Use datacenter proxies for simple targets where speed matters more than stealth.

ThorData offers both residential and datacenter proxies with automatic rotation, making it easy to switch between them based on your target.

Basic Proxy Rotation in Python

Here is a simple proxy rotator using a list of proxies:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Smart Rotation With Backoff

Naive round-robin rotation is not enough for serious scraping. Here is a smarter approach with exponential backoff and proxy scoring:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Using ThorData Residential Proxies

Instead of managing proxy lists yourself, ThorData handles rotation automatically. You connect to a single endpoint and each request gets a different residential IP:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Rate Limiting Best Practices

Even with rotating proxies, you should implement rate limiting to be respectful and avoid detection:

import time
import random

class RateLimiter:
    def __init__(self, requests_per_minute=30, jitter=0.5):
        self.interval = 60.0 / requests_per_minute
        self.jitter = jitter
        self.last_request = 0

    def wait(self):
        """Wait appropriate time before next request."""
        elapsed = time.time() - self.last_request
        delay = self.interval - elapsed
        if delay > 0:
            # Add random jitter to look more human
            actual_delay = delay + random.uniform(0, self.jitter)
            time.sleep(actual_delay)
        self.last_request = time.time()

# Usage
limiter = RateLimiter(requests_per_minute=20)
for url in urls:
    limiter.wait()
    response = rotator.fetch(url)

Combining Proxies With ScraperAPI

For sites with heavy anti-bot protection (Cloudflare, DataDome), proxy rotation alone may not be enough. ScraperAPI combines proxy rotation with browser rendering and CAPTCHA solving:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Proxy Rotation Checklist

Choose the right proxy type — residential for protected sites, datacenter for simple ones
Implement smart rotation — score-based selection, not just round-robin
Add rate limiting — 20-30 requests per minute is a safe starting point
Use random delays — jitter makes your traffic pattern look more natural
Monitor success rates — drop below 90% means you need to adjust
Rotate user agents — combine IP rotation with header rotation
Handle failures gracefully — retry with different proxies, not the same one

Conclusion

Rotating proxies are essential for any serious web scraping project. Start with ThorData residential proxies for automatic rotation, add smart scoring and rate limiting in your Python code, and use ScraperAPI when you need CAPTCHA solving and JavaScript rendering on top.

The key is combining good proxy infrastructure with respectful scraping practices — rotate IPs, add delays, and handle errors gracefully.

Follow me for more web scraping tutorials and proxy management guides.

DEV Community

Web Scraping With Rotating Proxies: Complete Setup Guide

Web Scraping With Rotating Proxies: Complete Setup Guide

Why You Need Rotating Proxies

Residential vs Datacenter Proxies

Basic Proxy Rotation in Python

Smart Rotation With Backoff

Using ThorData Residential Proxies

Rate Limiting Best Practices

Combining Proxies With ScraperAPI

Proxy Rotation Checklist

Conclusion

Top comments (0)