How to Monitor Hacker News Job Threads for Hiring Signals

#python #programming #tutorial #webdev

Hacker News "Who is Hiring?" threads are goldmines for job market intelligence. Posted monthly, they contain hundreds of real job listings from startups and tech companies. Here's how to build a monitoring system that extracts hiring signals.

The Data Opportunity

Each monthly thread gets 500-1000+ comments, each one a real company posting real roles. This data reveals which technologies are in demand, salary trends, remote work patterns, and which companies are scaling.

Setup

pip install requests beautifulsoup4 pandas

Fetching HN Job Threads

Hacker News has a free API — no authentication needed:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Extracting Job Listings

Each top-level comment is a job posting:

# Implementation is proprietary (that IS the moat).
# Skip the build — use our ready-made Apify actor:
# see the CTA below for the link (fpr=yw6md3).

Building the Monitor

import pandas as pd
from collections import Counter

def analyze_hiring_trends(months=3):
    threads = get_monthly_hiring_threads(months)
    all_listings = []

    for thread in threads:
        listings = extract_job_listings(thread["id"])
        for l in listings:
            l["thread_date"] = thread["date"]
        all_listings.extend(listings)

    df = pd.DataFrame(all_listings)

    # Technology demand ranking
    all_techs = [t for techs in df["technologies"] for t in techs]
    tech_counts = Counter(all_techs).most_common(15)

    print("\nTop Technologies in Demand:")
    for tech, count in tech_counts:
        pct = count / len(df) * 100
        print(f"  {tech}: {count} listings ({pct:.1f}%)")

    # Remote work trend
    remote_pct = df["remote"].mean() * 100
    print(f"\nRemote-friendly: {remote_pct:.1f}% of listings")

    # Salary analysis
    salary_df = df[df["salary_range"].notna()]
    print(f"Listings with salary info: {len(salary_df)} ({len(salary_df)/len(df)*100:.1f}%)")

    return df

df = analyze_hiring_trends(3)

Setting Up Alerts

Get notified when specific conditions appear:

def check_hiring_alerts(df, alerts):
    matches = []
    for _, row in df.iterrows():
        for alert in alerts:
            if alert["type"] == "technology":
                if alert["value"] in row["technologies"]:
                    matches.append({"alert": alert, "listing": row})
            elif alert["type"] == "company":
                if alert["value"].lower() in row["company"].lower():
                    matches.append({"alert": alert, "listing": row})

    return matches

my_alerts = [
    {"type": "technology", "value": "Rust"},
    {"type": "technology", "value": "FastAPI"},
    {"type": "company", "value": "Stripe"},
]

matches = check_hiring_alerts(df, my_alerts)
print(f"Found {len(matches)} matching listings")

Recommended Tools

ScraperAPI for handling rate limits when scraping at scale
ThorData for reliable proxy rotation
ScrapeOps for monitoring your scraper's performance

Conclusion

Monitoring HN hiring threads gives you a real-time pulse on the tech job market. The data is public, structured enough to parse, and refreshed monthly. Build the monitor once, and you'll have an ongoing feed of hiring intelligence that most people overlook.