Discussion on: Scraping Facebook groups using Python? Avoid getting blocked with Crawlbase

View post

Replies for: You could have tried with selenium (+ chromedriver) with beautifulsoup and requests.

yellow1912 • Nov 13 '20

The problem is that most services like Facebook will try to block you if you go over rate limit. In the end you may still have to pay for proxy service like these to ensure you don't get yourself blocked. There are also free proxy out there but in my experience they are unreliable.

AlphaSierra • Nov 13 '20 • Edited

That's why I said to use selenium. It fools servers to think that an actual user is browsing. Although this will not be feasable if your internet connection is too slow. Here checkout this project of mine where I have used selenium to scrape amazon: github.com/Shetty073/amazon-top-de...

Edit: Also instead of scraping please checkout facebook's API, you might get what you want easily without scraping.

yellow1912 • Nov 13 '20

I'm not sure. Perhaps my use case is different. I scrap Instagram images for my users (scrap their own accounts). Since there are so many users, so many accounts, I always end up going over rate limit.

Pushkar Kathayat • Mar 7 '23

selenium is slow af