<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Estick Silva</title>
    <description>The latest articles on DEV Community by Estick Silva (@esticksb).</description>
    <link>https://dev.to/esticksb</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3880911%2F70e33ef4-b966-4686-84f6-1ad4d6e5802f.jpg</url>
      <title>DEV Community: Estick Silva</title>
      <link>https://dev.to/esticksb</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/esticksb"/>
    <language>en</language>
    <item>
      <title>How I Use Apify Scrapers to Feed Real Projects With Real Data</title>
      <dc:creator>Estick Silva</dc:creator>
      <pubDate>Wed, 15 Apr 2026 22:28:15 +0000</pubDate>
      <link>https://dev.to/esticksb/how-i-use-apify-scrapers-to-feed-real-projects-with-real-data-2ena</link>
      <guid>https://dev.to/esticksb/how-i-use-apify-scrapers-to-feed-real-projects-with-real-data-2ena</guid>
      <description>&lt;p&gt;Most developers know web scraping exists. Few actually build it into their workflow in a way that scales.&lt;br&gt;
This is how I do it, across three different projects, using actors from ParseForge on Apify.&lt;/p&gt;




&lt;h2&gt;
  
  
  The setup
&lt;/h2&gt;

&lt;p&gt;Apify works like an app store for scrapers. You find an actor, configure your inputs, run it, and get structured data back in JSON or CSV. No infrastructure to manage, no proxies to rotate manually.&lt;br&gt;
The part most tutorials skip is actor quality. Not every actor on the platform is maintained or reliable. After testing several options, ParseForge became my default because the output is consistent and the data structure is clean enough to pipe directly into the next step without heavy transformation.&lt;/p&gt;




&lt;h2&gt;
  
  
  Project 1: Investor research with PitchBook data
&lt;/h2&gt;

&lt;p&gt;The goal was simple: build targeted outreach lists for early-stage startups by filtering investors by industry, stage, and geography.&lt;br&gt;
The flow looks like this:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Run the ParseForge PitchBook scraper with your target filters&lt;/li&gt;
&lt;li&gt;Export the results as JSON&lt;/li&gt;
&lt;li&gt;Clean and deduplicate with a simple script&lt;/li&gt;
&lt;li&gt;Load into a spreadsheet or CRM for outreach&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;What used to take a full day of manual research now runs in under an hour. The data includes investor profiles, portfolio companies, and contact signals that would otherwise require a paid PitchBook subscription to access at scale.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkkaqmzn12qfq4ljvoxbw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fkkaqmzn12qfq4ljvoxbw.png" alt="PitchBook Investor Data - ParseForge" width="800" height="424"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Project 2: Amazon product tracking
&lt;/h2&gt;

&lt;p&gt;For a client project I needed to monitor competitor listings across hundreds of products: prices, ratings, review counts, and availability changes over time.&lt;br&gt;
The flow:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Feed a list of ASINs into the ParseForge Amazon scraper&lt;/li&gt;
&lt;li&gt;Schedule it to run daily via Apify's built-in scheduler&lt;/li&gt;
&lt;li&gt;Store results in a simple database&lt;/li&gt;
&lt;li&gt;Diff the output against the previous day to detect changes&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The result is a lightweight competitive intelligence system that runs automatically and flags anything worth looking at. No manual checking, no missed price drops.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqpeniyvndyave8qvk37r.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqpeniyvndyave8qvk37r.png" alt="AWS Marketplace Scraper - ParseForge" width="800" height="414"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Project 3: A Reddit-aware AI agent
&lt;/h2&gt;

&lt;p&gt;This one is more experimental but probably the most interesting.&lt;br&gt;
The idea was to build an AI that can participate in Reddit communities in a way that feels native, not spammy. The problem with most bots is that they generate text without understanding context. They get downvoted into oblivion because they sound like bots.&lt;br&gt;
To fix that, I needed behavioral data from the actual community.&lt;br&gt;
The flow:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Run the ParseForge Reddit scraper targeting specific subreddits&lt;/li&gt;
&lt;li&gt;Pull posts, comment threads, upvote patterns, and post timing&lt;/li&gt;
&lt;li&gt;Feed that data into a language model as context and training signal&lt;/li&gt;
&lt;li&gt;The model learns vocabulary, tone, which topics get engagement, and when to post&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The scraper gives me structured thread data that the model can actually learn from. It understands that a question phrased one way gets engagement while the same question phrased differently gets ignored. It learns the difference between communities that tolerate self-promotion and ones that do not.&lt;br&gt;
The output is not a bot that blasts content. It is an agent that understands the environment it is operating in.&lt;br&gt;
None of that works without clean, reliable input data. If the scraper gives inconsistent output, the model learns the wrong patterns.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3t6aj1v7y1b30oh28urw.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3t6aj1v7y1b30oh28urw.png" alt="Reddit Posts Scraper - ParseForge" width="800" height="394"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;It wouldn't hurt to take a look at &lt;a href="https://apify.com/parseforge" rel="noopener noreferrer"&gt;Scraper Apify&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>saas</category>
      <category>ai</category>
      <category>programming</category>
      <category>datascience</category>
    </item>
    <item>
      <title>Most Founders Look for Investors the Wrong Way</title>
      <dc:creator>Estick Silva</dc:creator>
      <pubDate>Wed, 15 Apr 2026 17:30:22 +0000</pubDate>
      <link>https://dev.to/esticksb/most-founders-look-for-investors-the-wrong-way-48en</link>
      <guid>https://dev.to/esticksb/most-founders-look-for-investors-the-wrong-way-48en</guid>
      <description>&lt;h2&gt;
  
  
  Most Founders Look for Investors the Wrong Way
&lt;/h2&gt;

&lt;p&gt;Most founders think the hard part is finding investors.&lt;br&gt;
It is not.&lt;/p&gt;

&lt;p&gt;What actually slows people down is not access, but direction.&lt;/p&gt;

&lt;p&gt;You can spend weeks searching, building lists, sending emails, and still get no real traction. Not because there are no investors out there, but because you are talking to the wrong ones.&lt;/p&gt;

&lt;p&gt;This is where most founders get stuck.&lt;/p&gt;

&lt;p&gt;They assume the process is about volume. More emails, more contacts, more effort. But in reality, it is about precision.&lt;/p&gt;

&lt;p&gt;Reaching out to 20 highly relevant investors will always outperform contacting 200 random ones.&lt;/p&gt;

&lt;p&gt;The problem is that most people do not have a clear way to identify who is actually relevant. So they default to manual search, scattered data, and guesswork.&lt;/p&gt;

&lt;p&gt;That approach feels productive, but it rarely leads to results.&lt;/p&gt;




&lt;h2&gt;
  
  
  The usual approach
&lt;/h2&gt;

&lt;p&gt;When people start looking for investors, they typically do this:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Search random lists online&lt;/li&gt;
&lt;li&gt;Send cold emails without clear targeting&lt;/li&gt;
&lt;li&gt;Try platforms like PitchBook&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This feels like progress, but it is inefficient.&lt;/p&gt;

&lt;p&gt;You spend hours researching, switching between tabs, collecting fragmented information, and trying to connect the dots manually.&lt;/p&gt;

&lt;p&gt;Platforms like PitchBook solve part of the problem, but they are expensive and not always accessible early on.&lt;/p&gt;

&lt;p&gt;So most founders end up somewhere in the middle. Too much manual work, not enough structured insight.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fivfcd5zo7vpyxd6h01l1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fivfcd5zo7vpyxd6h01l1.png" alt="PitchBook" width="800" height="352"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  He real problem
&lt;/h2&gt;

&lt;p&gt;The problem is not access to investors. The problem is relevance.&lt;br&gt;
The only investors that matter are the ones who:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;invest in your industry&lt;/li&gt;
&lt;li&gt;operate in your geography&lt;/li&gt;
&lt;li&gt;match your stage&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Everything else is noise.&lt;/p&gt;




&lt;h2&gt;
  
  
  What actually works
&lt;/h2&gt;

&lt;p&gt;The biggest shift is moving from manual search to structured data. Instead of guessing, you define what you need and filter for it.&lt;/p&gt;

&lt;p&gt;With the right dataset, you can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;identify investors that actually fit your startup&lt;/li&gt;
&lt;li&gt;build clean, targeted lists&lt;/li&gt;
&lt;li&gt;focus only on high-quality outreach&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This changes the process completely.&lt;/p&gt;

&lt;p&gt;You stop chasing investors and start approaching the right ones.&lt;/p&gt;




&lt;h2&gt;
  
  
  A practical way to do it
&lt;/h2&gt;

&lt;p&gt;There are different ways to get structured investor data.&lt;/p&gt;

&lt;p&gt;One approach that works well is using scraping-based tools that aggregate this information and let you filter it easily.&lt;/p&gt;

&lt;p&gt;For example, I have been using a PitchBook investors scraper available on Apify, built by ParseForge.&lt;/p&gt;

&lt;p&gt;It allows you to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;access a large pool of investor profiles&lt;/li&gt;
&lt;li&gt;filter by industry, location, and investor type&lt;/li&gt;
&lt;li&gt;export the data for outreach or analysis&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It is not a replacement for due diligence, but it removes most of the manual work at the beginning.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Femfl9xsfla5vn713uxgx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Femfl9xsfla5vn713uxgx.png" alt="PitchBook Scraper - ParseForge" width="800" height="470"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Why this matters
&lt;/h2&gt;

&lt;p&gt;Most founders waste time trying to “find investors”.&lt;/p&gt;

&lt;p&gt;The real advantage comes from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;targeting correctly&lt;/li&gt;
&lt;li&gt;working with structured data&lt;/li&gt;
&lt;li&gt;moving faster than others&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Once you have that, outreach becomes more predictable.&lt;/p&gt;




&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Finding investors is not about having access to exclusive platforms.&lt;/p&gt;

&lt;p&gt;It is about using data in a smarter way. When you stop searching randomly and start filtering intentionally, the process becomes faster, clearer, and much more effective.&lt;/p&gt;




&lt;p&gt;If you'd like to try this tool, here it is: &lt;a href="https://apify.com/parseforge/pitchbook-investors-scraper" rel="noopener noreferrer"&gt;Pitchbook Scraper | Investor Data&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webscraping</category>
      <category>startup</category>
      <category>data</category>
      <category>productivity</category>
    </item>
  </channel>
</rss>
