How to process 1.5GB+ JSON/CSV files in the browser without crashing the RAM

#performance #architecture #javascript #webdev

Handling massive files in web applications usually leads to the dreaded Out of Memory browser crash. Traditional methods try to read the entire file into a string or buffer, exploding the system RAM.

When building my platform, Big File Tools, I needed a way to format and minify giant enterprise logs and databases (1.5GB+) 100% client-side. Here is the architecture I used to solve memory bottlenecks.

1. Bypassing Memory Buffers with Web Streams API

Instead of using FileReader.readAsText(), which loads the entire file into memory, you must read the file sequentially using binary chunks.

const stream = file.stream();
const reader = stream.getReader();
let { value: chunk, done } = await reader.read();

while (!done) {
  // Process each Uint8Array chunk sequentially here
  const next = await reader.read();
  chunk = next.value;
  done = next.done;
}

2. Offloading Logic to Web Workers

Parsing chunks and applying formatting or minification logic on the main thread will freeze the UI. To maintain 60 FPS, pass the binary chunks directly to a background Web Worker using Transferable Objects to avoid cloning overhead.

// Main Thread
worker.postMessage({ chunk }, [chunk.buffer]);

3. Persistent Storage via IndexedDB

Since you cannot keep gigabytes of formatted strings in JavaScript memory, you need to stream the output into a local storage layer. IndexedDB allows you to store temporary binary blocks or chunks efficiently without crashing the tab. Once the processing is complete, you can generate a local Object URL for the user to download the final file.

Conclusion

Building serverless, 100% client-side tools for heavy data processing is entirely possible today. It keeps infrastructure costs at absolute zero while mathematically guaranteeing user privacy.

If you want to see this architecture in action, check out the live implementation here: Big File Tools