<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Philippe Greenleaf</title>
    <description>The latest articles on DEV Community by Philippe Greenleaf (@philgreenleaf).</description>
    <link>https://dev.to/philgreenleaf</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2716381%2Fa0d1584d-c05d-4cf8-bea1-12820892d265.png</url>
      <title>DEV Community: Philippe Greenleaf</title>
      <link>https://dev.to/philgreenleaf</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/philgreenleaf"/>
    <language>en</language>
    <item>
      <title>Scraping real estate data with Python to find opportunities</title>
      <dc:creator>Philippe Greenleaf</dc:creator>
      <pubDate>Wed, 15 Jan 2025 15:19:41 +0000</pubDate>
      <link>https://dev.to/philgreenleaf/scraping-real-estate-data-with-python-to-find-opportunities-j1d</link>
      <guid>https://dev.to/philgreenleaf/scraping-real-estate-data-with-python-to-find-opportunities-j1d</guid>
      <description>&lt;p&gt;In this tutorial, we'll explore how to &lt;strong&gt;scrape real estate data&lt;/strong&gt; from an API using Python's &lt;code&gt;requests&lt;/code&gt; library. We'll also learn how to apply filters to retrieve &lt;strong&gt;potential bargain properties&lt;/strong&gt; whose prices have recently dropped.&lt;/p&gt;




&lt;h2&gt;
  
  
  Introduction
&lt;/h2&gt;

&lt;p&gt;When hunting for &lt;strong&gt;great real estate opportunities&lt;/strong&gt;, one of the best indicators can be a &lt;strong&gt;recent price drop&lt;/strong&gt;. Having a tool that quickly shows you &lt;strong&gt;only&lt;/strong&gt; these properties can save you tons of time—and might help you scoop up a deal before everyone else notices!&lt;/p&gt;

&lt;p&gt;In this post, we’ll:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Discuss the basics of using &lt;code&gt;requests&lt;/code&gt; to interact with a real estate API.
&lt;/li&gt;
&lt;li&gt;Learn how to &lt;strong&gt;filter results&lt;/strong&gt; using query parameters—particularly focusing on &lt;strong&gt;price variation&lt;/strong&gt; queries.
&lt;/li&gt;
&lt;li&gt;Parse and display the returned data in a concise format.&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  Requirements
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Python 3&lt;/strong&gt; installed
&lt;/li&gt;
&lt;li&gt;A terminal or command-line prompt
&lt;/li&gt;
&lt;li&gt;Basic familiarity with the Python &lt;code&gt;requests&lt;/code&gt; library
&lt;/li&gt;
&lt;li&gt;An API key (if required by the API)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Step 1: Understanding the API
&lt;/h2&gt;

&lt;p&gt;The API we use might respond with data such as:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Property ID
&lt;/li&gt;
&lt;li&gt;Title or Address
&lt;/li&gt;
&lt;li&gt;Price
&lt;/li&gt;
&lt;li&gt;Location
&lt;/li&gt;
&lt;li&gt;Historical price changes
&lt;/li&gt;
&lt;li&gt;Other relevant information
&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Key Query Parameters
&lt;/h3&gt;

&lt;p&gt;This API supports several &lt;strong&gt;query parameters&lt;/strong&gt; that help us &lt;strong&gt;filter the results&lt;/strong&gt;:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Parameter&lt;/th&gt;
&lt;th&gt;Type&lt;/th&gt;
&lt;th&gt;Description&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;includedDepartments[]&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;array&lt;/td&gt;
&lt;td&gt;Filter by department(s). Example: &lt;code&gt;departments/77&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;fromDate&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;date&lt;/td&gt;
&lt;td&gt;Only retrieve properties listed (or updated) after this date.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;propertyTypes[]&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;array&lt;/td&gt;
&lt;td&gt;Filter by property type. Example: &lt;code&gt;0&lt;/code&gt; for apartments, &lt;code&gt;1&lt;/code&gt; for houses, etc.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;transactionType&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;string&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;0&lt;/code&gt; for sale, &lt;code&gt;1&lt;/code&gt; for rent, etc.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;withCoherentPrice&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;bool&lt;/td&gt;
&lt;td&gt;Only retrieve properties whose price is coherent with the market.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;budgetMin&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;number&lt;/td&gt;
&lt;td&gt;Minimum budget threshold.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;budgetMax&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;number&lt;/td&gt;
&lt;td&gt;Maximum budget threshold.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;eventPriceVariationFromCreatedAt&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;date&lt;/td&gt;
&lt;td&gt;Date from which an event of type price is created — inclusive.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;eventPriceVariationMin&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;number&lt;/td&gt;
&lt;td&gt;Minimum percentage of price variation (negative or positive).&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;We’ll especially focus on the &lt;strong&gt;eventPriceVariation&lt;/strong&gt; parameters to &lt;strong&gt;find properties that have decreased in price&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  Step 2: Crafting the Request
&lt;/h2&gt;

&lt;p&gt;Below is a sample script using Python's &lt;code&gt;requests&lt;/code&gt; library to query the endpoint. Adjust the parameters and headers as needed, especially if an &lt;code&gt;X-API-KEY&lt;/code&gt; is required.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;requests&lt;/span&gt;
&lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;json&lt;/span&gt;

&lt;span class="c1"&gt;# 1. Define the endpoint URL
&lt;/span&gt;&lt;span class="n"&gt;url&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.stream.estate/documents/properties&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;

&lt;span class="c1"&gt;# 2. Create the parameters
&lt;/span&gt;&lt;span class="n"&gt;params&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;includedDepartments[]&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;departments/77&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;fromDate&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;2025-01-10&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;propertyTypes[]&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;1&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;    &lt;span class="c1"&gt;# 1 might represent 'apartment'
&lt;/span&gt;    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;transactionType&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;0&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;    &lt;span class="c1"&gt;# 0 might represent 'sale'
&lt;/span&gt;    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;withCoherentPrice&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;true&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;budgetMin&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;100000&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;budgetMax&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;500000&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="c1"&gt;# Focusing on price variation
&lt;/span&gt;    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;eventPriceVariationFromCreatedAt&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;2025-01-01&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;  &lt;span class="c1"&gt;# since the beginning of the year    
&lt;/span&gt;    &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;eventPriceVariationMin&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;10&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;  &lt;span class="c1"&gt;# at least a 10% drop    
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="c1"&gt;# 3. Define headers with the API key
&lt;/span&gt;&lt;span class="n"&gt;headers&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;Content-Type&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;application/json&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;X-API-KEY&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;&amp;lt;your_api_key_here&amp;gt;&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;

&lt;span class="c1"&gt;# 4. Make the GET request
&lt;/span&gt;&lt;span class="n"&gt;response&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;requests&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;get&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;url&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;headers&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;headers&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;params&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="n"&gt;params&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;# 5. Handle the response
&lt;/span&gt;&lt;span class="k"&gt;if&lt;/span&gt; &lt;span class="n"&gt;response&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;status_code&lt;/span&gt; &lt;span class="o"&gt;==&lt;/span&gt; &lt;span class="mi"&gt;200&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
    &lt;span class="n"&gt;data&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;response&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;json&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
    &lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;json&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;dumps&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;data&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;indent&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="mi"&gt;2&lt;/span&gt;&lt;span class="p"&gt;))&lt;/span&gt;
&lt;span class="k"&gt;else&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
    &lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sa"&gt;f&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Request failed with status code &lt;/span&gt;&lt;span class="si"&gt;{&lt;/span&gt;&lt;span class="n"&gt;response&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;status_code&lt;/span&gt;&lt;span class="si"&gt;}&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Explanation of Important Parameters
&lt;/h2&gt;

&lt;p&gt;&lt;code&gt;eventPriceVariationMin = '-10'&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;This means you’re looking for at least a 10% price decrease.&lt;/p&gt;

&lt;p&gt;&lt;code&gt;eventPriceVariationMax = '0'&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;Setting this to 0 ensures you don’t include properties that have had any price increase or any variation above 0%. Essentially, you’re capturing negative or zero changes.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;💡 &lt;strong&gt;Tip:&lt;/strong&gt; Adjust the min/max values to suit your strategy. For instance, -5 and 5 would include price changes within a ±5% range.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Potential Pitfalls &amp;amp; Considerations
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Authentication&lt;/strong&gt;: Always ensure you’re using valid API keys. Some APIs also have rate limits or usage quotas.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Error Handling&lt;/strong&gt;: Handle cases where the API is down or parameters are invalid.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Data Validation&lt;/strong&gt;: The API might return incomplete data for some listings. Always check for missing fields.
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Date Formats&lt;/strong&gt;: Make sure your &lt;code&gt;fromDate&lt;/code&gt; and &lt;code&gt;toDate&lt;/code&gt; are in a format the API recognizes (e.g., &lt;code&gt;YYYY-MM-DD&lt;/code&gt;).
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Large Datasets&lt;/strong&gt;: If the API returns hundreds or thousands of listings, you might need pagination. Check the API docs for pagination parameters like &lt;code&gt;page&lt;/code&gt; or &lt;code&gt;limit&lt;/code&gt;.&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  Wrap-Up
&lt;/h2&gt;

&lt;p&gt;Now you have a &lt;strong&gt;basic Python script&lt;/strong&gt; to &lt;strong&gt;scrape real estate data&lt;/strong&gt;, focusing on &lt;strong&gt;properties that have seen a drop in price&lt;/strong&gt;. This approach can be extremely powerful if you’re looking to &lt;strong&gt;invest&lt;/strong&gt; in real estate or if you simply want to &lt;strong&gt;track market trends&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;As always, tailor the &lt;strong&gt;parameters&lt;/strong&gt; to your specific needs. You can &lt;strong&gt;expand&lt;/strong&gt; this script to sort results by price, integrate advanced analytics, or even plug the data into a &lt;strong&gt;machine learning&lt;/strong&gt; model for deeper insights.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Happy scraping, and may you find that hidden gem!&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Further Reading
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;&lt;a href="https://docs.python-requests.org/" rel="noopener noreferrer"&gt;Python Requests Documentation&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://real-estate-api.net" rel="noopener noreferrer"&gt;Real estate data API comparison&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://stream.estate" rel="noopener noreferrer"&gt;Stream Estate API&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a href="https://gist.github.com/patpohler/36c731113fd113418c0806f62cbb9e30" rel="noopener noreferrer"&gt;Gist of real estate data APIs&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>python</category>
      <category>scraping</category>
      <category>realestate</category>
    </item>
  </channel>
</rss>
