<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: trantanhau</title>
    <description>The latest articles on DEV Community by trantanhau (@trantanhau).</description>
    <link>https://dev.to/trantanhau</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3979126%2F60edddd9-3158-46ba-bfae-31d9d613e321.jpeg</url>
      <title>DEV Community: trantanhau</title>
      <link>https://dev.to/trantanhau</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/trantanhau"/>
    <language>en</language>
    <item>
      <title>How I built privacy-first file tools that run AI models directly in your browser</title>
      <dc:creator>trantanhau</dc:creator>
      <pubDate>Thu, 11 Jun 2026 08:56:32 +0000</pubDate>
      <link>https://dev.to/trantanhau/how-i-built-privacy-first-file-tools-that-run-ai-models-directly-in-your-browser-1557</link>
      <guid>https://dev.to/trantanhau/how-i-built-privacy-first-file-tools-that-run-ai-models-directly-in-your-browser-1557</guid>
      <description>&lt;p&gt;Most online file tools work the same way: you upload your file, their server processes it, they send it back. Simple — but every upload is a privacy risk.&lt;/p&gt;

&lt;p&gt;I wanted to build something different. The result is &lt;a href="https://localmedia-kit.com" rel="noopener noreferrer"&gt;LocalMediaKit&lt;/a&gt; — 25+ file processing tools where most operations never leave your browser.&lt;/p&gt;

&lt;p&gt;Here's what I learned building it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Running AI models in the browser
&lt;/h2&gt;

&lt;p&gt;The background removal tool uses U2-Net-P (a lightweight variant of U2-Net) via ONNX Runtime Web. The model runs entirely in the browser — no GPU server, no API calls.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="k"&gt;import&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="k"&gt;as&lt;/span&gt; &lt;span class="nx"&gt;ort&lt;/span&gt; &lt;span class="k"&gt;from&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;onnxruntime-web&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;

&lt;span class="c1"&gt;// Point to custom WASM paths&lt;/span&gt;
&lt;span class="nx"&gt;ort&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;env&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;wasm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;wasmPaths&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="na"&gt;wasm&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;/models/ort-wasm-simd-threaded.wasm&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="na"&gt;mjs&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;/models/ort-wasm-simd-threaded.mjs&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
&lt;span class="p"&gt;};&lt;/span&gt;

&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;session&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;ort&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;InferenceSession&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;create&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;modelBuffer&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="na"&gt;executionProviders&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;wasm&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;
&lt;span class="p"&gt;});&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The model is downloaded once and cached in &lt;strong&gt;IndexedDB&lt;/strong&gt; (not Service Worker cache) — better suited for large binary files. Subsequent uses load instantly from local storage.&lt;/p&gt;

&lt;p&gt;For large images, tile-based processing prevents out-of-memory crashes: split into overlapping tiles, process each independently, stitch back together.&lt;/p&gt;

&lt;h2&gt;
  
  
  PDF processing with WebAssembly
&lt;/h2&gt;

&lt;p&gt;Merge, split, compress, sign — all handled client-side with pdf-lib. The PDF never leaves the browser:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="k"&gt;import&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt; &lt;span class="nx"&gt;PDFDocument&lt;/span&gt; &lt;span class="p"&gt;}&lt;/span&gt; &lt;span class="k"&gt;from&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;pdf-lib&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt;

&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;merged&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;PDFDocument&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;create&lt;/span&gt;&lt;span class="p"&gt;();&lt;/span&gt;
&lt;span class="k"&gt;for &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;file&lt;/span&gt; &lt;span class="k"&gt;of&lt;/span&gt; &lt;span class="nx"&gt;files&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;doc&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;PDFDocument&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;load&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;file&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;arrayBuffer&lt;/span&gt;&lt;span class="p"&gt;());&lt;/span&gt;
  &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;pages&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;merged&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;copyPages&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;doc&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;doc&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;getPageIndices&lt;/span&gt;&lt;span class="p"&gt;());&lt;/span&gt;
  &lt;span class="nx"&gt;pages&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;forEach&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;page&lt;/span&gt; &lt;span class="o"&gt;=&amp;gt;&lt;/span&gt; &lt;span class="nx"&gt;merged&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;addPage&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;page&lt;/span&gt;&lt;span class="p"&gt;));&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;
&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;output&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;merged&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;save&lt;/span&gt;&lt;span class="p"&gt;();&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Where I had to compromise
&lt;/h2&gt;

&lt;p&gt;Office→PDF and PDF→Word require LibreOffice. There's no way around this — it can't run in a browser.&lt;/p&gt;

&lt;p&gt;For these tools, files go through a server I control. Files are processed and immediately discarded, never stored. I'm upfront about this in the UI so users can make an informed choice.&lt;/p&gt;

&lt;h2&gt;
  
  
  Three things I learned
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;1. WASM multithreading headers are tricky.&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;SharedArrayBuffer&lt;/code&gt; (needed for multi-threaded WASM) requires &lt;code&gt;Cross-Origin-Opener-Policy&lt;/code&gt; and &lt;code&gt;Cross-Origin-Embedder-Policy&lt;/code&gt; headers. The catch: these headers break OAuth popups and third-party embeds. You have to carefully scope them only to routes that need WASM threading, not globally.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. IndexedDB beats Cache API for large models.&lt;/strong&gt;&lt;br&gt;
Service Worker Cache API has per-origin storage limits and eviction policies that can remove large model files unexpectedly. IndexedDB gives more control over persistence for large binary assets.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Vanilla JS scales better than expected.&lt;/strong&gt;&lt;br&gt;
No hydration overhead, no virtual DOM, no bundle size fighting. The whole app stays lean and fast without a framework.&lt;/p&gt;




&lt;p&gt;If you're curious about the client-side AI pipeline, the tile-based processing approach, or the tradeoffs of running LibreOffice on a controlled server vs full client-side alternatives, happy to go deeper in the comments.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://localmedia-kit.com" rel="noopener noreferrer"&gt;LocalMediaKit&lt;/a&gt; is free to try.&lt;/p&gt;

</description>
      <category>webassembly</category>
      <category>ai</category>
      <category>javascript</category>
      <category>privacy</category>
    </item>
  </channel>
</rss>
