<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Oleksandr </title>
    <description>The latest articles on DEV Community by Oleksandr  (@__a570829a).</description>
    <link>https://dev.to/__a570829a</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3961576%2Faee232fc-d62d-492f-b3f7-cafeb3638d92.png</url>
      <title>DEV Community: Oleksandr </title>
      <link>https://dev.to/__a570829a</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/__a570829a"/>
    <language>en</language>
    <item>
      <title>I built an AI video clip finder that runs 100% in your browser — no uploads, no API, no GPU costs</title>
      <dc:creator>Oleksandr </dc:creator>
      <pubDate>Tue, 16 Jun 2026 06:00:00 +0000</pubDate>
      <link>https://dev.to/__a570829a/i-built-an-ai-video-clip-finder-that-runs-100-in-your-browser-no-uploads-no-api-no-gpu-costs-101o</link>
      <guid>https://dev.to/__a570829a/i-built-an-ai-video-clip-finder-that-runs-100-in-your-browser-no-uploads-no-api-no-gpu-costs-101o</guid>
      <description>&lt;p&gt;Every time I used Opus Clip or Vidyo.ai, the same thought hit me:&lt;br&gt;
I’m paying $20/month to upload my video to someone else’s server,&lt;br&gt;
wait in a queue, and hope their AI finds something useful.&lt;/p&gt;

&lt;p&gt;So I built an alternative that runs entirely in the browser.&lt;br&gt;
No file uploads. No subscriptions. No server costs on my end.&lt;br&gt;
The result is ClipGG’s AI Video Highlights tool —&lt;br&gt;
and in this post I’ll walk through exactly how it works technically.&lt;/p&gt;


&lt;h2&gt;
  
  
  The core problem I was solving
&lt;/h2&gt;

&lt;p&gt;Finding highlights in a long video is genuinely hard to automate well.&lt;br&gt;
The expensive approach: transcribe with Whisper, feed text to GPT-4,&lt;br&gt;
profit. But that requires a backend, API costs, and user uploads.&lt;/p&gt;

&lt;p&gt;I wanted zero server involvement.&lt;br&gt;
That meant doing everything with browser APIs.&lt;/p&gt;


&lt;h2&gt;
  
  
  What actually runs in the browser
&lt;/h2&gt;

&lt;p&gt;The pipeline has four stages:&lt;/p&gt;
&lt;h3&gt;
  
  
  1. File reading — no upload needed
&lt;/h3&gt;


&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;arrayBuffer&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;file&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;arrayBuffer&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;span class="c1"&gt;// The file never leaves the device.&lt;/span&gt;
&lt;span class="c1"&gt;// ArrayBuffer is passed directly to Web Audio API.&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;h3&gt;
  
  
  2. Audio analysis — Web Audio API + Web Worker
&lt;/h3&gt;

&lt;p&gt;I use &lt;code&gt;OfflineAudioContext&lt;/code&gt; to decode audio faster than real-time,&lt;br&gt;
then downsample to 8000–11025 Hz before analysis.&lt;br&gt;
This reduces RAM usage from ~115MB to ~19MB for a 10-minute video.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="c1"&gt;// Decode in a Web Worker so the UI never freezes&lt;/span&gt;
&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;tempCtx&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;OfflineAudioContext&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;44100&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;44100&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;audioBuffer&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;tempCtx&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;decodeAudioData&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;arrayBuffer&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="c1"&gt;// Downsample manually — OfflineAudioContext does NOT resample automatically&lt;/span&gt;
&lt;span class="kd"&gt;function&lt;/span&gt; &lt;span class="nf"&gt;downsample&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;channelData&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;originalRate&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nx"&gt;targetRate&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;ratio&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nx"&gt;originalRate&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="nx"&gt;targetRate&lt;/span&gt;
  &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;output&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;new&lt;/span&gt; &lt;span class="nc"&gt;Float32Array&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nb"&gt;Math&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;floor&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;channelData&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;length&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="nx"&gt;ratio&lt;/span&gt;&lt;span class="p"&gt;))&lt;/span&gt;
  &lt;span class="k"&gt;for &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="kd"&gt;let&lt;/span&gt; &lt;span class="nx"&gt;i&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt; &lt;span class="nx"&gt;i&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt; &lt;span class="nx"&gt;output&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;length&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt; &lt;span class="nx"&gt;i&lt;/span&gt;&lt;span class="o"&gt;++&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;start&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nb"&gt;Math&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;floor&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;i&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="nx"&gt;ratio&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;end&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nb"&gt;Math&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;min&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nb"&gt;Math&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;floor&lt;/span&gt;&lt;span class="p"&gt;((&lt;/span&gt;&lt;span class="nx"&gt;i&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt; &lt;span class="mi"&gt;1&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="nx"&gt;ratio&lt;/span&gt;&lt;span class="p"&gt;),&lt;/span&gt; &lt;span class="nx"&gt;channelData&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;length&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
    &lt;span class="kd"&gt;let&lt;/span&gt; &lt;span class="nx"&gt;sum&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;
    &lt;span class="k"&gt;for &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="kd"&gt;let&lt;/span&gt; &lt;span class="nx"&gt;j&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nx"&gt;start&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt; &lt;span class="nx"&gt;j&lt;/span&gt; &lt;span class="o"&gt;&amp;lt;&lt;/span&gt; &lt;span class="nx"&gt;end&lt;/span&gt;&lt;span class="p"&gt;;&lt;/span&gt; &lt;span class="nx"&gt;j&lt;/span&gt;&lt;span class="o"&gt;++&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="nx"&gt;sum&lt;/span&gt; &lt;span class="o"&gt;+=&lt;/span&gt; &lt;span class="nx"&gt;channelData&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="nx"&gt;j&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;
    &lt;span class="nx"&gt;output&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="nx"&gt;i&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nx"&gt;sum&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;end&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="nx"&gt;start&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
  &lt;span class="p"&gt;}&lt;/span&gt;
  &lt;span class="k"&gt;return&lt;/span&gt; &lt;span class="nx"&gt;output&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  3. Scoring — three audio signals
&lt;/h3&gt;

&lt;p&gt;For each 500ms window I compute:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;RMS&lt;/strong&gt; (Root Mean Square) — average energy/loudness&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;ZCR&lt;/strong&gt; (Zero Crossing Rate) — distinguishes speech from noise&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Volume Peak&lt;/strong&gt; — catches sudden loud moments&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Then I do relative normalization so a quiet podcast&lt;br&gt;
and a loud gaming stream are scored fairly against themselves:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="c1"&gt;// Relative normalization — key insight&lt;/span&gt;
&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;normalizedRms&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;seg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;rms&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="nx"&gt;globalMinRms&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;/&lt;/span&gt; &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;globalMaxRms&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="nx"&gt;globalMinRms&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Different content types use different weights:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Mode&lt;/th&gt;
&lt;th&gt;RMS&lt;/th&gt;
&lt;th&gt;ZCR&lt;/th&gt;
&lt;th&gt;Peak&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Gaming&lt;/td&gt;
&lt;td&gt;0.20&lt;/td&gt;
&lt;td&gt;0.35&lt;/td&gt;
&lt;td&gt;0.20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Podcast&lt;/td&gt;
&lt;td&gt;0.50&lt;/td&gt;
&lt;td&gt;0.05&lt;/td&gt;
&lt;td&gt;0.20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Funny&lt;/td&gt;
&lt;td&gt;0.15&lt;/td&gt;
&lt;td&gt;0.20&lt;/td&gt;
&lt;td&gt;0.35&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;General&lt;/td&gt;
&lt;td&gt;0.30&lt;/td&gt;
&lt;td&gt;0.20&lt;/td&gt;
&lt;td&gt;0.25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  4. Clip selection — diversity + peak centering
&lt;/h3&gt;

&lt;p&gt;The selector groups high-scoring segments into zones,&lt;br&gt;
finds the peak moment in each zone, and centers a 30–90 second&lt;br&gt;
clip around it. A diversity radius of 12 seconds prevents&lt;br&gt;
three clips from covering the same moment.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;combinedSignal&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt;
  &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;seg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;score&lt;/span&gt; &lt;span class="o"&gt;??&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt;
  &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;seg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;energyChange&lt;/span&gt; &lt;span class="o"&gt;??&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="mf"&gt;2.0&lt;/span&gt; &lt;span class="o"&gt;+&lt;/span&gt;
  &lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;seg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;volumePeak&lt;/span&gt; &lt;span class="o"&gt;??&lt;/span&gt; &lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="o"&gt;*&lt;/span&gt; &lt;span class="mf"&gt;1.5&lt;/span&gt;

&lt;span class="c1"&gt;// Center the clip around the strongest combined signal,&lt;/span&gt;
&lt;span class="c1"&gt;// not just the loudest sustained section&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  The Safari problem I didn’t expect
&lt;/h2&gt;

&lt;p&gt;Safari on iOS can’t decode video containers&lt;br&gt;
via &lt;code&gt;AudioContext.decodeAudioData()&lt;/code&gt;.&lt;br&gt;
It only accepts clean audio files.&lt;/p&gt;

&lt;p&gt;The fix: detect iOS and pre-extract audio with FFmpeg.wasm&lt;br&gt;
before passing it to the Web Audio API:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;isIOS&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="sr"&gt;/iPhone|iPad|iPod/i&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;test&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nb"&gt;navigator&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;userAgent&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;

&lt;span class="k"&gt;if &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;isIOS&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
  &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;ffmpeg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;exec&lt;/span&gt;&lt;span class="p"&gt;([&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-i&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;input_video&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-vn&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-acodec&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;pcm_s16le&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;  &lt;span class="c1"&gt;// WAV — guaranteed to work on all iOS versions&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-ar&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;44100&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-ac&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;1&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;audio.wav&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;
  &lt;span class="p"&gt;])&lt;/span&gt;
  &lt;span class="c1"&gt;// Pass audio.wav to Web Audio instead of the original video&lt;/span&gt;
&lt;span class="p"&gt;}&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;WAV/PCM is uncompressed and works reliably on every iOS version.&lt;br&gt;
AAC containers are not.&lt;/p&gt;


&lt;h2&gt;
  
  
  Export — FFmpeg.wasm with stream copy
&lt;/h2&gt;

&lt;p&gt;Once highlights are found, FFmpeg.wasm cuts the clips:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight javascript"&gt;&lt;code&gt;&lt;span class="c1"&gt;// Fast path: H.264 + AAC + MP4 = stream copy, no re-encoding&lt;/span&gt;
&lt;span class="c1"&gt;// A 90-second clip exports in ~2–3 seconds&lt;/span&gt;
&lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;ffmpeg&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;exec&lt;/span&gt;&lt;span class="p"&gt;([&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-ss&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;String&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;clip&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;start&lt;/span&gt;&lt;span class="p"&gt;),&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-i&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;input&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-t&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="nc"&gt;String&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="nx"&gt;clip&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;end&lt;/span&gt; &lt;span class="o"&gt;-&lt;/span&gt; &lt;span class="nx"&gt;clip&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nx"&gt;start&lt;/span&gt;&lt;span class="p"&gt;),&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-c&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;copy&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;               &lt;span class="c1"&gt;// copy bytes, don't re-encode&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-avoid_negative_ts&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;make_zero&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;-movflags&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;+faststart&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="nx"&gt;outputName&lt;/span&gt;
&lt;span class="p"&gt;])&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Non-standard formats (MOV, MKV, AV1) get converted to MP4 first&lt;br&gt;
before the analysis pipeline runs. This also fixed all the&lt;br&gt;
“file won’t export” bugs from iPhone footage.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I learned
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;OfflineAudioContext doesn’t resample.&lt;/strong&gt;&lt;br&gt;
I assumed &lt;code&gt;new OfflineAudioContext(1, length, 8000)&lt;/code&gt;&lt;br&gt;
would give me 8kHz audio. It doesn’t.&lt;br&gt;
You get whatever sample rate the source file has.&lt;br&gt;
Downsampling has to be manual.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Transfer, don’t copy ArrayBuffers.&lt;/strong&gt;&lt;br&gt;
&lt;code&gt;worker.postMessage({ arrayBuffer }, [arrayBuffer])&lt;/code&gt;&lt;br&gt;
transfers ownership with zero memory copy.&lt;br&gt;
Without the second argument you’re doubling RAM usage.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;-ss before -i for stream copy, after for re-encode.&lt;/strong&gt;&lt;br&gt;
This one cost me an hour. For &lt;code&gt;-c copy&lt;/code&gt;, seek before input&lt;br&gt;
for speed. For re-encoding, seek after input for frame accuracy.&lt;/p&gt;




&lt;h2&gt;
  
  
  Try it
&lt;/h2&gt;

&lt;p&gt;The tool is live and free at:&lt;br&gt;
👉 &lt;a href="https://clipgg.uk/en/ai-video-highlights" rel="noopener noreferrer"&gt;https://clipgg.uk/en/ai-video-highlights&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Drop a video, pick a mode (Gaming / Podcast / Funny / General),&lt;br&gt;
and get three highlight clips with timestamps in about 30 seconds.&lt;/p&gt;

&lt;p&gt;No account. No upload. Works on desktop Chrome, Firefox,&lt;br&gt;
and now iOS Safari too.&lt;/p&gt;

&lt;p&gt;Curious what others think about the audio scoring approach —&lt;br&gt;
would love feedback on the algorithm in the comments.&lt;/p&gt;

</description>
      <category>javascript</category>
      <category>webdev</category>
      <category>showdev</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>I built 12 free browser-based tools for creators — here's what I learned</title>
      <dc:creator>Oleksandr </dc:creator>
      <pubDate>Sun, 31 May 2026 19:12:10 +0000</pubDate>
      <link>https://dev.to/__a570829a/i-built-12-free-browser-based-tools-for-creators-heres-what-i-learned-5148</link>
      <guid>https://dev.to/__a570829a/i-built-12-free-browser-based-tools-for-creators-heres-what-i-learned-5148</guid>
      <description>&lt;p&gt;A few weeks ago I launched ClipGG — a collection of 12 free &lt;br&gt;
browser-based tools for content creators, video editors, &lt;br&gt;
and writers. No signups, no file uploads, no subscriptions. &lt;br&gt;
Everything runs locally in the browser.&lt;/p&gt;

&lt;p&gt;Here is what I learned building it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why browser-based?
&lt;/h2&gt;

&lt;p&gt;The obvious reason is privacy. When you upload a video to &lt;br&gt;
an online tool, you have no idea where that file goes or &lt;br&gt;
how long it sits on someone's server. With browser-based &lt;br&gt;
processing, the file never leaves your device.&lt;/p&gt;

&lt;p&gt;The less obvious reason is speed. No upload queue, no &lt;br&gt;
server processing time, no waiting. The Web Audio API, &lt;br&gt;
Canvas API, and MediaRecorder handle surprisingly heavy &lt;br&gt;
tasks directly in the tab.&lt;/p&gt;

&lt;h2&gt;
  
  
  What the tools do
&lt;/h2&gt;

&lt;p&gt;The suite covers the small repetitive tasks that slow &lt;br&gt;
down a creator's workflow:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Word &amp;amp; Character Counter&lt;/strong&gt; — real-time word count, 
reading time, speaking time, and keyword density&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SRT Subtitle Cleaner&lt;/strong&gt; — strips timecodes and tags 
from subtitle files, converts to plain text or article format&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SRT ↔ VTT Converter&lt;/strong&gt; — converts between subtitle 
formats for HTML5 video and YouTube&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;YouTube Title Validator&lt;/strong&gt; — previews how your title 
looks in desktop and mobile search before publishing&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Audio Extractor&lt;/strong&gt; — pulls audio from MP4, WebM, 
MKV using the Web Audio API, no upload&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Video Aspect Ratio Resizer&lt;/strong&gt; — crops horizontal video 
to 9:16 for TikTok and Shorts with blur background fill&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AI Video Hook Generator&lt;/strong&gt; — generates scroll-stopping 
opening lines for short-form video&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AI Freelance Email Generator&lt;/strong&gt; — writes cold emails 
and Upwork proposals based on job description&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;AI Content Repurpose Machine&lt;/strong&gt; — turns one piece of 
content into Twitter threads, LinkedIn posts, and more&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Bulk Image Compressor&lt;/strong&gt; — batch compresses JPEG, 
PNG, WebP locally using Canvas&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;YouTube Thumbnail Downloader&lt;/strong&gt; — grabs HD thumbnails 
and extracts dominant color palettes&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Video Teleprompter&lt;/strong&gt; — smooth scrolling prompter with 
mirror mode and webcam overlay&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  One technical thing worth sharing
&lt;/h2&gt;

&lt;p&gt;The Audio Extractor was the hardest to get right. True MP3 &lt;br&gt;
encoding requires a licensed codec that browsers don't &lt;br&gt;
include natively. The output is WebM/Opus — smaller than &lt;br&gt;
WAV, excellent quality, plays in every modern browser and &lt;br&gt;
media player. Renaming to .mp3 works in most players too.&lt;/p&gt;

&lt;p&gt;For the Video Aspect Ratio Resizer, the blur background &lt;br&gt;
effect uses a second canvas layer running the same video &lt;br&gt;
at low resolution with a CSS blur filter, composited &lt;br&gt;
behind the main cropped layer using MediaRecorder. It &lt;br&gt;
runs at full speed on any modern laptop.&lt;/p&gt;

&lt;h2&gt;
  
  
  The stack
&lt;/h2&gt;

&lt;p&gt;Next.js App Router, 16 languages via i18n, all processing &lt;br&gt;
in the browser using native Web APIs. No backend for the &lt;br&gt;
tools themselves.&lt;/p&gt;

&lt;h2&gt;
  
  
  Try it
&lt;/h2&gt;

&lt;p&gt;Everything is free at &lt;a href="https://clipgg.uk" rel="noopener noreferrer"&gt;clipgg.uk&lt;/a&gt; — &lt;br&gt;
no account, no limits.&lt;/p&gt;

&lt;p&gt;Happy to answer questions about any of the browser API &lt;br&gt;
implementations.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frgizfcg3f4lnnqumyrky.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frgizfcg3f4lnnqumyrky.png" alt=" " width="800" height="336"&gt;&lt;/a&gt;&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>javascript</category>
      <category>opensource</category>
      <category>ai</category>
    </item>
  </channel>
</rss>
