I've been building Scrapamoja — a production-grade Python web scraping
framework that's evolving into a hybrid browser-API extraction platform.
🔗 GitHub: https://github.com/TisoneK/scrapamoja
What it does
Scrapamoja is designed for serious scraping workloads. Current highlights:
- Async-first HTTP client built on
httpx - Chainable builder interface:
HttpClient → RequestBuilder → .execute() - Custom per-domain token bucket rate limiting
- Direct API mode (SCR-001) — bypass browser overhead for API-backed sites
- Clean modular architecture with strict type checking
Status
Actively in development. Phase 1 (Direct API Mode) is underway with 51
passing tests. Phase 2 will add response caching with TTL.
Feedback, issues, and contributions welcome!
Top comments (0)