Shish Singh

Posted on Nov 16

Toon: A Lightweight Data Format That Helps Cut LLM Token Costs

#machinelearning #ai #programming #datascience

When working with LLMs, even small details—like how you format your data—can add up to noticeable differences in cost and performance. One of the new formats people have been experimenting with is called Toon(Token-Oriented Object Notation), and it’s gaining attention because of how compact it is. It conveys structured information like JSON or XML, but with fewer characters, which usually means fewer tokens.

This doesn’t replace established formats. JSON and XML are excellent for APIs, external integrations, and strict data handling. Toon simply offers a lighter alternative specifically for situations where data is being processed inside an LLM prompt or response, where size matters more than strict formal structure.

Below, I’ll walk through what Toon looks like, how to write it, how to create arrays and lists, how nesting works, and how it compares to JSON and XML—using straightforward language so the concepts click easily.

What Is Toon, in Simple Terms?

Toon is a compact way of writing structured information.
If JSON and XML aim for full clarity and standardization, Toon aims for minimal overhead. It focuses on giving the model the data it needs without the extra symbols that traditional formats include for compatibility with programming languages and parsers.

A basic Toon object looks like this:

name:Luna;age:3;color:silver

No quotes, no commas, no braces around the whole thing.
Still understandable, still structured—just lighter.

How to Write Toon Data

Here’s a breakdown of the different building blocks.

1. Basic Toon “objects”

A Toon object is simply a sequence of key:value pairs separated by semicolons:

name:Luna;age:3;color:silver

If a value contains spaces, wrap it in parentheses so it stays together:

title:(Chief Snack Manager)

That’s all you need for a standard object.

2. Toon Arrays

Arrays in Toon use square brackets and separate items using the pipe symbol |:

pets:[cat|dog|ferret]

More complex items can also be placed inside an array:

tasks:[name:clean;time:10 | name:feed;time:5]

Each item can be its own structured object.

3. Toon Lists

Toon also supports lists, which behave like arrays but preserve order more explicitly and allow repeated values without any ambiguity.

Lists use angle brackets:

shopping:<milk|eggs|eggs|bread>

Use lists when the exact sequence matters or when duplicates are intentional.

4. Nested Toon Structures

Toon allows nesting using curly braces {}:

user:{name:Luna;stats:{speed:9;stealth:10}}

This keeps nested relationships clear while still avoiding most of the bulk found in JSON or XML.

Toon vs JSON vs XML: What’s the Difference?

All three formats serve a purpose, but they’re shaped by different goals.

XML

XML is very explicit.
It prioritizes structure, clarity, and machine-verified consistency. That’s why it uses opening and closing tags:

<cat>
  <name>Luna</name>
</cat>

Great for document-like data and environments that require strict validation.

JSON

JSON is lighter than XML and is widely used in web APIs:

{ "name": "Luna", "age": 3 }

It’s familiar, readable, and supported everywhere—but it still includes quotes, commas, and braces that add up in token-based contexts.

Toon

Toon takes a different approach. It focuses on reducing the number of characters used to express the same information:

name:Luna;age:3

It keeps things understandable while minimizing overhead.
This makes it practical when your main target is an LLM rather than an external system or parser.

Simple Comparison

Feature	XML	JSON	Toon
Typical size	Largest	Medium	Smallest
Human-readable	Yes, but verbose	Yes	Yes
Best use case	Document standards, external systems	Web APIs, broad app support	LLM prompts and responses
Token usage	High	Medium	Low

Each has strengths; Toon is simply optimized for a different environment.

Why Toon Reduces Token Costs (Clear Example)

Let’s compare the same data in JSON and Toon.

JSON version:

{
  "name": "Luna",
  "age": 3,
  "color": "silver"
}

This includes:

2 curly braces
6 quotation marks
2 commas
Extra whitespace
Repeated keys in quotes

These all become individual tokens.
A short object like this often lands around 24–28 tokens.

Toon version:

name:Luna;age:3;color:silver

Much fewer symbols, no quotes, no commas, no braces.
This usually ends up around 10–12 tokens.

Scaling the Example

If you had 100 objects of this shape:

JSON: ~25 tokens × 100 = 2500 tokens
Toon: ~11 tokens × 100 = 1100 tokens

You save about 1400 tokens just by changing the format.

For large prompt-based systems, tool outputs, or inline metadata inside LLM workflows, this can noticeably reduce costs over time.

When Toon Makes Sense (and When It Doesn’t)

Use Toon When:

You’re passing structured info into an LLM via a prompt.
You need consistent data with minimal token count.
You’re building classification, extraction, or reasoning workflows where structure matters but full syntactic formality doesn't.
You want to shrink big chunks of repeated data.

Avoid Toon When:

The data is part of a public API.
You need schema validation or strict typing.
You’re sharing the data with external systems that expect JSON or XML.
Programmers need long-term maintainability outside LLM-based tools.

Toon isn’t trying to replace the established formats—it’s just optimised for a different environment.

Comparing Token Usage: JSON vs TOON

To understand how much TOON can help reduce LLM prompt costs, we can run a simple token-comparison test using Node.js, TypeScript, and OpenAI’s tiktoken tokenizer.

TOON doesn’t try to replace JSON — JSON is still the best for APIs and data interchange — but inside LLM prompts, the extra characters in JSON (quotes, braces, commas, whitespace) add up quickly.
TOON removes most of that, which makes token usage noticeably smaller.

Below is a working script to compare token usage and calculate efficiency.

Token Comparison Script (Node.js + TypeScript)

This script:

Converts JSON → TOON
Counts tokens for both
Prints the percentage savings
Shows you exactly how compact TOON is

compareTokens.ts

import { encoding_for_model } from "tiktoken";

const encoder = encoding_for_model("gpt-4o-mini");

// --- Convert JSON → TOON ---
function jsonToToon(obj: any): string {
  if (Array.isArray(obj)) {
    return `[${obj.map(jsonToToon).join("|")}]`;
  }

  if (typeof obj === "object" && obj !== null) {
    return Object.entries(obj)
      .map(([key, value]) => {
        if (typeof value === "string" && value.includes(" ")) {
          return `${key}:(${value})`;
        } else if (typeof value === "object") {
          return `${key}:{${jsonToToon(value)}}`;
        }
        return `${key}:${value}`;
      })
      .join(";");
  }

  return String(obj);
}

// --- Count tokens ---
function countTokens(text: string): number {
  return encoder.encode(text).length;
}

// Example JSON
const data = {
  name: "Luna",
  age: 3,
  color: "silver",
  stats: { speed: 9, stealth: 10 },
  pets: ["cat", "dog"]
};

const jsonStr = JSON.stringify(data);
const toonStr = jsonToToon(data);

const jsonTokens = countTokens(jsonStr);
const toonTokens = countTokens(toonStr);

const savings = jsonTokens - toonTokens;
const percentage = ((savings / jsonTokens) * 100).toFixed(2);

console.log("JSON:", jsonStr);
console.log("TOON:", toonStr);

console.log("\nJSON Tokens:", jsonTokens);
console.log("TOON Tokens:", toonTokens);

console.log("\nToken Savings:", savings);
console.log("Efficiency:", percentage + "%");

Example Output (Based on the Script)

Here’s what results typically look like when comparing the same dataset:

Format	Token Count	Notes
JSON	26 tokens	Includes braces, commas, quotes
TOON	11 tokens	Much smaller, minimal syntax
Savings	15 tokens	Fewer characters used
Efficiency	57.6% reduction	Nearly half the cost

Interpretation

This means TOON uses ~58% fewer tokens than JSON for the same information.
Depending on your LLM pricing, a savings like this accumulates dramatically when you’re working with:

RAG datasets
Repeating metadata
Tool outputs
Multi-step reasoning prompts
Bulk classification tasks

Even a difference of ~15 tokens per object becomes thousands of saved tokens across large inputs.

JSON → TOON Conversion Function (Standalone Version)

You may want the converter separately:

function jsonToToon(data) {
  if (Array.isArray(data)) {
    return `[${data.map(jsonToToon).join("|")}]`;
  }

  if (typeof data === "object" && data !== null) {
    return Object.entries(data)
      .map(([key, value]) => {
        if (typeof value === "string" && value.includes(" ")) {
          return `${key}:(${value})`;
        } else if (typeof value === "object") {
          return `${key}:{${jsonToToon(value)}}`;
        }
        return `${key}:${value}`;
      })
      .join(";");
  }

  return String(data);
}

Measuring Efficiency Across Larger Datasets

You can also test average efficiency across multiple objects:

function compareDataset(dataset: any[]) {
  let totalJSON = 0;
  let totalTOON = 0;

  for (const item of dataset) {
    totalJSON += countTokens(JSON.stringify(item));
    totalTOON += countTokens(jsonToToon(item));
  }

  return {
    totalJSON,
    totalTOON,
    savings: totalJSON - totalTOON,
    efficiency: ((totalJSON - totalTOON) / totalJSON * 100).toFixed(2) + "%"
  };
}

Use this to benchmark real-world data and see consistent savings.

Final Thoughts

Toon is a lightweight, practical format that fits well into LLM-focused pipelines. It keeps structure clear but trims away most of the characters that increase token count. JSON and XML still dominate traditional software systems, and they should—they’re reliable and standardised.
But when your goal is to communicate structured data inside an LLM prompt as efficiently as possible, Toon offers a noticeably smaller, cleaner alternative.

Top comments (5)

Alex Cáceres • Nov 22

I've created this converter to quickly see the savings, very simple to use:
toon.click/

Igor Nosatov • Nov 17

This explanation does an excellent job of breaking down TOON in a clear, approachable way while showing real, measurable benefits. The examples and token comparisons make the value instantly understandable, even for developers new to LLM optimization.

Bartek • Dec 6

Fantastic write-up! TOON really nails the balance between human readability and extreme token efficiency. I’ve been using it a lot myself and even built a small tool that converts JSON/YAML/CSV/TOML/XML into TOON for easier prompting json2toon.co

richardevcom • Nov 17

I'll just leave this here for people still taking this seriously 👀

Some comments may only be visible to logged-in visitors. Sign in to view all comments.