How to Build a Price Monitoring System with Python in 2026

#python #webscraping #tutorial #automation

You want to track competitor prices automatically. Maybe you run an e-commerce store and need to stay competitive, or you're a deal hunter who wants alerts when prices drop. Either way, a price monitoring system is one of the most practical things you can build with Python.

I've built these systems for clients and for my own projects. Here's what actually works in 2026, what breaks, and how to keep it running reliably.

The Architecture

A price monitoring system has four parts:

Scraper — fetches product pages and extracts prices
Storage — saves price history over time
Scheduler — runs the scraper at regular intervals
Alerting — notifies you when prices change

Let's build each one.

Part 1: The Scraper

Start simple. For most product pages, requests + BeautifulSoup gets the job done:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

This works for static pages. But here's the reality check: many e-commerce sites in 2026 render prices with JavaScript. If your target site uses React, Next.js, or any SPA framework, requests will return an empty price container.

When You Need a Browser

For JS-rendered pages, use Playwright:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Playwright is heavier — slower and uses more RAM. For monitoring 10 products, it's fine. For 500+, you need a different approach.

When to Use a Scraping API

If you're monitoring prices across major e-commerce sites (Amazon, Walmart, Best Buy), you'll hit anti-bot systems fast. CAPTCHAs, IP bans, fingerprint detection — these sites invest millions in blocking scrapers.

This is where scraping APIs save you time:

ScraperAPI — handles proxy rotation and CAPTCHA solving. Good for e-commerce sites. 5,000 free API calls to start.
Scrape.do — similar approach with competitive pricing and JS rendering support.
ScrapeOps — proxy aggregator that lets you compare providers. Useful if you want to optimize cost per request.

Here's how price monitoring looks with ScraperAPI:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

My recommendation: start free with requests. When you start getting blocked, switch to a scraping API. Don't over-engineer it upfront.

Part 2: Storage

SQLite is perfect for this. No server to manage, and it handles thousands of price records easily:

import sqlite3

def init_db(db_path="prices.db"):
    conn = sqlite3.connect(db_path)
    conn.execute("""
        CREATE TABLE IF NOT EXISTS prices (
            id INTEGER PRIMARY KEY AUTOINCREMENT,
            product_name TEXT NOT NULL,
            url TEXT NOT NULL,
            price REAL NOT NULL,
            timestamp TEXT NOT NULL
        )
    """)
    conn.commit()
    return conn

def save_price(conn, product_name, url, price):
    conn.execute(
        "INSERT INTO prices (product_name, url, price, timestamp) VALUES (?, ?, ?, ?)",
        (product_name, url, price, datetime.now().isoformat())
    )
    conn.commit()

def get_price_history(conn, product_name, days=30):
    """Get price history for a product over the last N days."""
    cursor = conn.execute(
        """SELECT price, timestamp FROM prices
           WHERE product_name = ?
           AND timestamp > datetime('now', ?)
           ORDER BY timestamp""",
        (product_name, f"-{days} days")
    )
    return cursor.fetchall()

Part 3: Scheduling

Use schedule for simplicity or cron for reliability:

import schedule
import time

# Define your products to monitor
PRODUCTS = [
    {
        "name": "Sony WH-1000XM5",
        "url": "https://example.com/product/sony-xm5",
        "selector": "span.price",
    },
    {
        "name": "MacBook Air M3",
        "url": "https://example.com/product/macbook-air-m3",
        "selector": "div.product-price",
    },
]

def monitor_all():
    conn = init_db()
    for product in PRODUCTS:
        try:
            price = scrape_price(product["url"], product["selector"])
            save_price(conn, product["name"], product["url"], price["price"])
            print(f"[OK] {product['name']}: ${price['price']}")
            check_price_alert(conn, product["name"], price["price"])
        except Exception as e:
            print(f"[ERROR] {product['name']}: {e}")
    conn.close()

# Run every 6 hours
schedule.every(6).hours.do(monitor_all)

if __name__ == "__main__":
    monitor_all()  # Run once immediately
    while True:
        schedule.run_pending()
        time.sleep(60)

For production, run this as a systemd service or a cron job. The schedule library is fine for development, but cron is more robust for long-running monitors.

Part 4: Alerting

Send yourself an email or Slack message when prices drop:

import smtplib
from email.mime.text import MIMEText

def check_price_alert(conn, product_name, current_price, threshold_pct=5):
    """Alert if price dropped more than threshold_pct from the last recorded price."""
    cursor = conn.execute(
        """SELECT price FROM prices
           WHERE product_name = ?
           ORDER BY timestamp DESC LIMIT 1 OFFSET 1""",
        (product_name,)
    )
    row = cursor.fetchone()
    if not row:
        return  # No previous price to compare

    previous_price = row[0]
    change_pct = ((current_price - previous_price) / previous_price) * 100

    if change_pct <= -threshold_pct:
        send_alert(
            subject=f"Price Drop: {product_name}",
            body=f"{product_name} dropped {abs(change_pct):.1f}%!\n"
                 f"Was: ${previous_price:.2f}\n"
                 f"Now: ${current_price:.2f}"
        )

def send_alert(subject, body):
    """Send email alert. Replace with Slack webhook if preferred."""
    msg = MIMEText(body)
    msg["Subject"] = subject
    msg["From"] = "monitor@yourdomain.com"
    msg["To"] = "you@yourdomain.com"

    with smtplib.SMTP("smtp.yourdomain.com", 587) as server:
        server.starttls()
        server.login("monitor@yourdomain.com", "your_password")
        server.send_message(msg)

Putting It All Together

Here's the complete flow:

Define your products with URLs and CSS selectors
The scheduler runs every 6 hours (adjust based on how often prices change)
Each run scrapes all products, saves to SQLite
If a price drops more than 5%, you get an alert

Scaling tips:

Add random delays (2-5 seconds) between requests to avoid detection
Rotate User-Agent strings
For 50+ products across major retailers, use ScraperAPI or Scrape.do — the time you save debugging blocks is worth the cost
Use ScrapeOps to monitor your scraper's success rates over time

Common Pitfalls

Selectors break. Sites redesign. Your CSS selector that worked yesterday returns nothing today. Build in error handling and alerts for failed scrapes, not just price drops.

Prices vary by location. Some sites show different prices based on your IP's geolocation. If you're using proxies, be consistent with the region.

Dynamic pricing is real. Some sites change prices based on visit frequency. Don't scrape the same product from the same IP every 5 minutes.

Want the Full System?

I wrote a complete guide covering advanced patterns — handling login-protected prices, building price comparison dashboards, and deploying monitors on cheap VPS instances.

Get the complete Web Scraping Playbook — $9 on Gumroad

It includes ready-to-deploy code for price monitoring, SERP tracking, and more.

Questions or need help adapting this to your use case? Reach me at hello@web-data-labs.com.