<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: XIANGWEIXIAO</title>
    <description>The latest articles on DEV Community by XIANGWEIXIAO (@_f751e50d019c73e72371f).</description>
    <link>https://dev.to/_f751e50d019c73e72371f</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3882908%2F4603e291-3e08-4173-8dc0-05a543089b4a.png</url>
      <title>DEV Community: XIANGWEIXIAO</title>
      <link>https://dev.to/_f751e50d019c73e72371f</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/_f751e50d019c73e72371f"/>
    <language>en</language>
    <item>
      <title>DuckDB in the Wild: What 6 Minutes of Benchmarking Across 4 Machines Taught Me About Real-World Performance</title>
      <dc:creator>XIANGWEIXIAO</dc:creator>
      <pubDate>Thu, 16 Apr 2026 17:04:49 +0000</pubDate>
      <link>https://dev.to/_f751e50d019c73e72371f/duckdb-in-the-wild-what-6-minutes-of-benchmarking-across-4-machines-taught-me-about-real-world-3gff</link>
      <guid>https://dev.to/_f751e50d019c73e72371f/duckdb-in-the-wild-what-6-minutes-of-benchmarking-across-4-machines-taught-me-about-real-world-3gff</guid>
      <description>&lt;h2&gt;
  
  
  Character Design
&lt;/h2&gt;

&lt;p&gt;I care more about "can we ship this?" than "is this theoretically optimal?"&lt;/p&gt;

&lt;p&gt;When I pick data tools, I usually ask three questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Will it run fast enough on the hardware we actually have?&lt;/li&gt;
&lt;li&gt;How much does object storage overhead really cost us?&lt;/li&gt;
&lt;li&gt;Can I estimate these things without building a full test lab?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;With that mindset, I ran the same DuckDB workload across four different machines, comparing local disk against S3-compatible object storage. The goal was not to crown a winner. It was to calibrate my intuition for what "fast enough" looks like in practice.&lt;/p&gt;




&lt;h3&gt;
  
  
  Why I ran these tests
&lt;/h3&gt;

&lt;p&gt;I had two recurring questions while working with DuckDB:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Same SQL, different machine&lt;/strong&gt; — why does it feel so much slower on some boxes? Is it CPU, disk, or something else entirely?&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Local folder vs object storage&lt;/strong&gt; — where does the extra time actually go?&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;So I ran the same job on a MacBook, a Ubuntu server, a Windows workstation, and a small cloud instance. I kept the query identical, the data identical, and the DuckDB version identical. Everything else — CPU, memory, filesystem, network — I let vary.&lt;/p&gt;

&lt;p&gt;This is not science. It is reconnaissance.&lt;/p&gt;




&lt;h3&gt;
  
  
  What I held constant (and what I did not)
&lt;/h3&gt;

&lt;blockquote&gt;
&lt;p&gt;This is not a lab benchmark. It is an engineering gut-check: same query, different machines, different storage. The numbers are messy, the comparisons are unfair, and yet they are surprisingly useful.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Same across all runs:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DuckDB version: &lt;strong&gt;v1.5.2 (Variegata)&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Data scale:

&lt;ul&gt;
&lt;li&gt;Full dataset: ~64.5M rows, ~13.7 GB source, ~3.5 GB output&lt;/li&gt;
&lt;li&gt;Sample dataset: ~18M rows, ~4.0 GB source, ~1.0 GB output&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;li&gt;Job semantics: identical ELT pattern (read → transform → write)&lt;/li&gt;

&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Storage setups:&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Storage&lt;/th&gt;
&lt;th&gt;Deployment&lt;/th&gt;
&lt;th&gt;What this actually tests&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Local disk&lt;/td&gt;
&lt;td&gt;Native filesystem&lt;/td&gt;
&lt;td&gt;Baseline: no network, no protocol overhead&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;MinIO&lt;/td&gt;
&lt;td&gt;Single-node, same machine as DuckDB&lt;/td&gt;
&lt;td&gt;S3 protocol overhead without network latency&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Alibaba Cloud OSS&lt;/td&gt;
&lt;td&gt;Cloud VM via internal network&lt;/td&gt;
&lt;td&gt;Cloud CPU + memory + network combined&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;blockquote&gt;
&lt;p&gt;Important caveat: I did not test "MinIO on a different machine." From experience, cross-machine object storage introduces bandwidth and latency costs that would push the overhead well beyond what I measured here.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h3&gt;
  
  
  Full dataset results: four machines, two storage types
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;How to read the numbers:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Wall time:&lt;/strong&gt; total elapsed time from start to finish&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Rough read MB/s:&lt;/strong&gt; (source GB × 1024) ÷ wall time&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is not raw disk throughput. It includes Parquet parsing, SQL execution, and write operations. Think of it as "end-to-end productivity" rather than hardware specs.&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Environment&lt;/th&gt;
&lt;th&gt;Data source&lt;/th&gt;
&lt;th&gt;Wall time&lt;/th&gt;
&lt;th&gt;Rows processed&lt;/th&gt;
&lt;th&gt;Rough MB/s&lt;/th&gt;
&lt;th&gt;Source size&lt;/th&gt;
&lt;th&gt;Output size&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;macOS&lt;/td&gt;
&lt;td&gt;Local disk&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;43s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;64.5M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~326&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;13.7 GB&lt;/td&gt;
&lt;td&gt;3.5 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ubuntu&lt;/td&gt;
&lt;td&gt;Local disk&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;58s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;64.5M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~242&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;13.7 GB&lt;/td&gt;
&lt;td&gt;3.5 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Ubuntu&lt;/td&gt;
&lt;td&gt;MinIO (same machine)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;62s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;64.5M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~226&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;13.7 GB&lt;/td&gt;
&lt;td&gt;3.5 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Windows&lt;/td&gt;
&lt;td&gt;Local disk&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;331s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;64.5M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~42&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;13.7 GB&lt;/td&gt;
&lt;td&gt;3.5 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h4&gt;
  
  
  How I interpret these
&lt;/h4&gt;

&lt;p&gt;&lt;strong&gt;macOS vs Ubuntu (both local disk)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The 15-second gap (43s vs 58s) likely comes from CPU microarchitecture and memory bandwidth. The Ubuntu box runs an older Xeon; the MacBook has a newer M-series chip with better vectorization. Same disk type, different compute muscle.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Ubuntu local vs MinIO (same machine)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;58s → 62s is about a &lt;strong&gt;7% overhead&lt;/strong&gt;. That is the cost of S3 protocol parsing, HTTP client work, and MinIO's request handling — without any network latency. If MinIO were on a separate machine, expect significantly more.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Windows local disk (331s)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;This number is an outlier, and I treat it as such. Anti-malware scanning, background services, power management policies, and NTFS characteristics can all drag down I/O on Windows. I would not generalize from this single data point without testing in a clean environment first.&lt;/p&gt;




&lt;h3&gt;
  
  
  Sample dataset results: cloud + OSS vs MacBook
&lt;/h3&gt;

&lt;p&gt;This comparison is intentionally unfair. I am putting it here anyway because unfair comparisons happen in the real world all the time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Cloud VM specs:&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Attribute&lt;/th&gt;
&lt;th&gt;Value&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;vCPU&lt;/td&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Threads&lt;/td&gt;
&lt;td&gt;4 (2 cores × hyperthreading)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Memory&lt;/td&gt;
&lt;td&gt;~7.3 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Disk&lt;/td&gt;
&lt;td&gt;Cloud SSD (ESSD class)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Network&lt;/td&gt;
&lt;td&gt;Internal VPC to OSS&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Comparison baseline:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;MacBook — Apple M5, 10 cores, 32 GB RAM, local NVMe (APFS)&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Environment&lt;/th&gt;
&lt;th&gt;Data source&lt;/th&gt;
&lt;th&gt;Wall time&lt;/th&gt;
&lt;th&gt;Rows processed&lt;/th&gt;
&lt;th&gt;Rough MB/s&lt;/th&gt;
&lt;th&gt;Source size&lt;/th&gt;
&lt;th&gt;Output size&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Debian (2 vCPU cloud)&lt;/td&gt;
&lt;td&gt;OSS (internal)&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;61s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;18.0M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~67&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;4.0 GB&lt;/td&gt;
&lt;td&gt;1.0 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;macOS&lt;/td&gt;
&lt;td&gt;Local disk&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;23s&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;18.0M&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~178&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;4.0 GB&lt;/td&gt;
&lt;td&gt;1.0 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h4&gt;
  
  
  How I interpret this
&lt;/h4&gt;

&lt;p&gt;The cloud instance is &lt;strong&gt;~2.7× slower&lt;/strong&gt; — but that is not a story about "cloud is slow." It is a story about &lt;strong&gt;2 vCPU + 7 GB RAM + network&lt;/strong&gt; versus &lt;strong&gt;10 cores + 32 GB RAM + local NVMe&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;I did not profile deeply enough to separate CPU time from I/O wait from network latency. So I cannot tell you which factor matters most. What I can tell you: if someone hands you a 2-core cloud VM and asks for a performance estimate, 2–3× slower than a modern laptop is a reasonable working assumption.&lt;/p&gt;




&lt;h3&gt;
  
  
  Hardware inventory
&lt;/h3&gt;

&lt;p&gt;For those who want to reproduce or sanity-check:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Machine&lt;/th&gt;
&lt;th&gt;OS&lt;/th&gt;
&lt;th&gt;Kernel&lt;/th&gt;
&lt;th&gt;CPU&lt;/th&gt;
&lt;th&gt;Cores / Threads&lt;/th&gt;
&lt;th&gt;RAM&lt;/th&gt;
&lt;th&gt;Primary storage&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;MacBook&lt;/td&gt;
&lt;td&gt;macOS 26.4&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Apple M5&lt;/td&gt;
&lt;td&gt;10 / 10&lt;/td&gt;
&lt;td&gt;32 GB&lt;/td&gt;
&lt;td&gt;Apple NVMe SSD (APFS)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Workstation&lt;/td&gt;
&lt;td&gt;Windows 10 Pro&lt;/td&gt;
&lt;td&gt;—&lt;/td&gt;
&lt;td&gt;Intel i7-10750H&lt;/td&gt;
&lt;td&gt;6 / 12&lt;/td&gt;
&lt;td&gt;~64 GB&lt;/td&gt;
&lt;td&gt;SK Hynix SSD (NTFS)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Server&lt;/td&gt;
&lt;td&gt;Ubuntu 24.04.3&lt;/td&gt;
&lt;td&gt;6.17.0-19&lt;/td&gt;
&lt;td&gt;Xeon E5-2698 v3&lt;/td&gt;
&lt;td&gt;16 / 32&lt;/td&gt;
&lt;td&gt;~188 GB&lt;/td&gt;
&lt;td&gt;Samsung NVMe (ext4)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cloud VM&lt;/td&gt;
&lt;td&gt;Debian 11&lt;/td&gt;
&lt;td&gt;5.10.0-26&lt;/td&gt;
&lt;td&gt;Xeon Platinum (virtual)&lt;/td&gt;
&lt;td&gt;2 / 4&lt;/td&gt;
&lt;td&gt;~7.3 GB&lt;/td&gt;
&lt;td&gt;Cloud SSD (ESSD)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h3&gt;
  
  
  Minimal reproduction code
&lt;/h3&gt;

&lt;p&gt;Here are the exact patterns I used, in case you want to run your own gut-check.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;From MinIO (S3-compatible, local):&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;INSTALL&lt;/span&gt; &lt;span class="n"&gt;httpfs&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="k"&gt;LOAD&lt;/span&gt; &lt;span class="n"&gt;httpfs&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;

&lt;span class="k"&gt;CREATE&lt;/span&gt; &lt;span class="k"&gt;OR&lt;/span&gt; &lt;span class="k"&gt;REPLACE&lt;/span&gt; &lt;span class="n"&gt;SECRET&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;TYPE&lt;/span&gt; &lt;span class="n"&gt;s3&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;KEY_ID&lt;/span&gt; &lt;span class="s1"&gt;'your_access_key'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;SECRET&lt;/span&gt; &lt;span class="s1"&gt;'your_secret_key'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;REGION&lt;/span&gt; &lt;span class="s1"&gt;'us-east-1'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;ENDPOINT&lt;/span&gt; &lt;span class="s1"&gt;'127.0.0.1:9000'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;URL_STYLE&lt;/span&gt; &lt;span class="s1"&gt;'path'&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;

&lt;span class="k"&gt;COPY&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;read_csv&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
        &lt;span class="s1"&gt;'s3://bucket/raw/**/*.csv'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;ignore_errors&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;filename&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;union_by_name&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;sample_size&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;200&lt;/span&gt;
    &lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;TO&lt;/span&gt; &lt;span class="s1"&gt;'s3://bucket/output/result.parquet'&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;FORMAT&lt;/span&gt; &lt;span class="n"&gt;PARQUET&lt;/span&gt;&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;From local filesystem:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="k"&gt;COPY&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;read_csv_auto&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
        &lt;span class="s1"&gt;'/path/to/data/**/*.csv'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;ignore_errors&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;filename&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;union_by_name&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;sample_size&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;200&lt;/span&gt;
    &lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;TO&lt;/span&gt; &lt;span class="s1"&gt;'/path/to/output/result.parquet'&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;FORMAT&lt;/span&gt; &lt;span class="n"&gt;PARQUET&lt;/span&gt;&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;From Alibaba Cloud OSS:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight sql"&gt;&lt;code&gt;&lt;span class="n"&gt;INSTALL&lt;/span&gt; &lt;span class="n"&gt;httpfs&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;
&lt;span class="k"&gt;LOAD&lt;/span&gt; &lt;span class="n"&gt;httpfs&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;

&lt;span class="k"&gt;CREATE&lt;/span&gt; &lt;span class="k"&gt;OR&lt;/span&gt; &lt;span class="k"&gt;REPLACE&lt;/span&gt; &lt;span class="n"&gt;SECRET&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;TYPE&lt;/span&gt; &lt;span class="n"&gt;s3&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;KEY_ID&lt;/span&gt; &lt;span class="s1"&gt;'your_oss_key'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;SECRET&lt;/span&gt; &lt;span class="s1"&gt;'your_oss_secret'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;REGION&lt;/span&gt; &lt;span class="s1"&gt;'cn-hangzhou'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;ENDPOINT&lt;/span&gt; &lt;span class="s1"&gt;'oss-cn-hangzhou.aliyuncs.com'&lt;/span&gt;
&lt;span class="p"&gt;);&lt;/span&gt;

&lt;span class="k"&gt;COPY&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="k"&gt;SELECT&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt;
    &lt;span class="k"&gt;FROM&lt;/span&gt; &lt;span class="n"&gt;read_csv&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
        &lt;span class="s1"&gt;'s3://bucket/raw/**/*.csv'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;ignore_errors&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;filename&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;union_by_name&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;true&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
        &lt;span class="n"&gt;sample_size&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;200&lt;/span&gt;
    &lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="k"&gt;TO&lt;/span&gt; &lt;span class="s1"&gt;'s3://bucket/output/result.parquet'&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;FORMAT&lt;/span&gt; &lt;span class="n"&gt;PARQUET&lt;/span&gt;&lt;span class="p"&gt;);&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h3&gt;
  
  
  What I actually learned
&lt;/h3&gt;

&lt;p&gt;Three things stick with me:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Same query, same data, 8× wall-time spread&lt;/strong&gt; — from 43 seconds to 331 seconds. Hardware and environment matter more than I sometimes remember.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Local MinIO adds ~7% overhead&lt;/strong&gt; when co-located. That is low enough that protocol compatibility is worth it for my use cases. Cross-machine MinIO would be a different story.&lt;/p&gt;&lt;/li&gt;
&lt;li&gt;&lt;p&gt;&lt;strong&gt;Unfair comparisons are still useful&lt;/strong&gt; — as long as you label them honestly. A 2-core cloud VM being 2–3× slower than a modern laptop is not surprising, but having a concrete number helps with capacity conversations.&lt;/p&gt;&lt;/li&gt;
&lt;/ol&gt;




&lt;h3&gt;
  
  
  What this means for picking your setup
&lt;/h3&gt;

&lt;p&gt;I would not choose hardware based on these numbers alone. But I would use them to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Set expectations:&lt;/strong&gt; If someone proposes a 2-core cloud instance for heavy batch work, 60+ seconds for 4 GB of data is a realistic starting assumption.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Justify local object storage:&lt;/strong&gt; A same-machine MinIO instance is cheap enough to run and compatible enough to integrate that the 7% tax is usually acceptable.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Flag outliers early:&lt;/strong&gt; That 331-second Windows run tells me "something else is going on here" — whether I investigate further depends on whether that machine actually matters for production.&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>database</category>
      <category>dataengineering</category>
      <category>performance</category>
      <category>sql</category>
    </item>
  </channel>
</rss>
