Cameron Archer for Tinybird

Posted on Feb 26 • Originally published at tinybird.co

Build a Datadog alternative in 5 minutes

#webdev #javascript #database #devops

Today we're publicly announcing our open source Logs Explorer template, a free Next.js app + Tinybird backend that you can deploy for free in less than 5 minutes to use as a simple Datadog alternative or to build user-facing logs analytics features (like Vercel Observability or Supabase Query Logs) in your app or service.

A screenshot of the Logs Explorer Template, a simple open-source alternative to Datadog. — This is the Logs Explorer template. You can deploy it to Tinybird and Vercel in a few minutes.

You can deploy the template here, check out a live demo with 1.5 billion logs, or fork the repo and modify it locally to fit your use case.

If you want to learn how I built it, keep reading. We recently wrote about how to scale log analytics to trillions of rows and thousands of queries per second. This is a hard problem to solve, so when I build such systems, I like to start with the most basic version. It allows me to iterate faster, test the system end-to-end under real conditions, and then optimize for scale when and where it's needed.

If you want to build a performant logs explorer for yourself or your users, this template is the way to get started and skip all the work I did to get here. Still, if you're interested in my process so you can build a log explorer that scales to trillions of rows and thousands of queries per second, from local to cloud., it's below.

Who doesn't love a hype video?

The basic process

Here's my basic process distilled into bullet points. I'll expand more below.

The stack

Tinybird and Vercel
Next.js
Cursor.ai and Claude Sonnet 3.5

The development workflow

Develop, build and test locally
Bootstrap with tb create
Use LLMs when it make sense
Use version control and CI/CD

Instrumentation

Use Tinybird’s Events API to stream logs directly via HTTP, and/or
Use vector.dev as a log collector

The data

Use one table to store logs
Sort it by the columns users need to filter
Support semi-structured metadata with JSON or String columns

The APIs

Start with basic APIs to validate end to end- Allow filtering by dimensions and date ranges- Aggregate metrics- Implement basic text search and pagination
Test your APIs with fixtures to maximize coverage

The local dev environment

This is my local setup from scratch.

I use Tinybird local for the data layer:

curl -LsSf https://tbrd.co/fwd | sh
tb local start
tb login
tb dev

Next.js for the webapp layer:

npx create-next-app@latest log-analyzer --typescript --tailwind --eslint
touch .env.local  # add the TB_TOKEN and http://localhost:7181
npm run dev

The rest of my tech stack consists of:

Cursor to assist me with the code, Claude Sonnet 3.5 as the LLM model
git for version control
GitHub Actions for CI/CD
Tinybird and Vercel for cloud deployments

tb dev and npm run dev are equivalent. tb dev builds the Tinybird project and the npm run dev builds the Next.js project, both locally. The builds hot refresh on detected changes to their respective files, allowing for faster feedback loops.

The bootstrap

I bootstrapped the whole project with a single command and this is what it generated:

tb create --prompt "create a logs explorer, include a single data source to store logs and three pipes to have counters, a time series and an API to filter logs and build a table"

» Creating new project structure...
✓ /datasources
✓ /endpoints
✓ /materializations
✓ /copies
✓ /pipes
✓ /fixtures
✓ /tests
✓ Scaffolding completed!

» Creating resources...
✓ /datasources/logs.datasource
✓ /endpoints/generic_counter.pipe
✓ /endpoints/endpoint_monitoring.pipe
✓ /endpoints/log_analysis.pipe
✓ Done!

» Creating CI/CD files for GitHub and GitLab...
✓ /.gitignore
✓ .github/workflows/tinybird-ci.yml
✓ ./.gitlab-ci.yml
✓ .gitlab/tinybird/tinybird-ci.yml
✓ Done!

» Creating .cursorrules...
✓ Done!

» Generating fixtures...
✓ /fixtures/logs

Alternatively, I could have created each resource one by one:

tb create --prompt "create a schema for application logs"

» Creating resources...
✓ /datasources/logs.datasource
✓ Done!

» Generating fixtures...
✓ /fixtures/logs

This, for example, is the logs data source generated:

DESCRIPTION >
    Raw logs datasource to store application logs

SCHEMA >
    `timestamp` DateTime64(3) `json:$.timestamp`,
    `level` LowCardinality(String) `json:$.level`,
    `service` LowCardinality(String) `json:$.service`,
    `message` String `json:$.message`,
    `request_id` String `json:$.request_id`,
    `environment` LowCardinality(String) `json:$.environment`,
    `status_code` UInt16 `json:$.status_code`,
    `response_time` UInt32 `json:$.response_time`,
    `request_method` LowCardinality(String) `json:$.request_method`,
    `request_path` String `json:$.request_path`,
    `host` LowCardinality(String) `json:$.host`,
    `user_agent` LowCardinality(String) `json:$.user_agent`,
    `metadata` String `json:$.metadata`

ENGINE "MergeTree"
ENGINE_PARTITION_KEY "toYYYYMM(timestamp)"
ENGINE_SORTING_KEY "timestamp, environment, service, level"
ENGINE_TTL "toDateTime(timestamp) + INTERVAL 180 DAY"

I then instructed the LLM to fine tune the schemas. Some examples:

“choose the right precision for timestamps and numbers and LowCardinality for repeated values”
“add the most common dimensions to the sorting key to make the queries more efficient: timestamp, environment, service and level”
“partition by month” (year or day could work as well depending on my volume)
“add a TTL for data retention”

(you could also do this on your own in the generated files, but I was vibe coding)

Here's my expert advice:

Tinybird makes it easy to evolve schemas in a declarative way. So, don't worry if you get anything wrong, you can just change it later.
Don't over-engineer unless you have a good reason. For instance, I keep all my logs in one table
If you need to report on unstructured data, add a JSON or JSON String column to store it.

Once the project was bootstrapped, I built it, grabbed a token, and added the token to my .env.local to integrate it with my local Next.js app build.

tb dev
tb token ls

The mock data and fixtures

I used mock data to test my build.

Generating mock data takes a little bit of time, but it's part of the development workflow.

I wanted mock data to:

Write tests that I could run locally to validate quickly as I changed resources
Create data fixtures to run tests as part of CI to catch regressions early on
Have a local data source to test the end-to-end Next.js app integration

I don't have a preferred approach here. tb create already generated a logs.ndjson fixture that I could fine tune and use.

Other options include:

Use tb mock --rows 1000 logs to generate a bigger sample (you can supply a prompt to give it some direction. I've done this with up to a million rows locally with no problem. It's just an NDJSON file so it's not that big.)
Use mockingbird for streaming ingestion to a cloud deployment (there's an example schema in the template repo)
Create a custom data generator. LLMs are perfect for building "throwaway code" like this. You can simulate traffic patterns, seasonality, and other stuff without having to code it yourself.
Export random data from production. Keep in mind, this can be a bad idea if it means downloading PII to your machine.

The APIs

The logs explorer UI consists of these components:

The sidebar (with counts) for easy filtering and aggregation
The search bar to filter by an arbitrary text match
The time series chart to show the evolution of the logs over time
The date range selector to filter by time range
The table view to show all the logs in a table
The details drawer to show the details of a specific log

I fed all five visual components with just three APIs. Again, I didn't over-engineer at the beginning. I just needed APIs to support the components, and I could to optimize them later.

Here's a look at each API that I built:

Sidebar counts

One nice thing about the template is it dynamically populates the sidebar with dimensions sorted by logs counts for each one.

A screenshot of the logs explorer filter sidebar — The sidebar shows aggregate counts for each dimension

I created a Tinybird API to add counts for each of the filterable columns in the logs schema, going top-down from more general to more specific filters: environment > service > level > others.

A single pipe can support multiple counters using Tinybird's templating, for example:

TOKEN read_only READ
DESCRIPTION >
    'This pipe is used to have a counter for each category of each dimension'
    
NODE count_attributes
SQL >
%
    SELECT
        toString({{column(column_name, 'level')}}) as category,
        COUNT() as count
    FROM
        logs
    WHERE 1=1
    {% if defined(start_date) %}
        AND timestamp >= {{DateTime(start_date, '2024-01-01 00:00:00')}}
    {% end %}
    {% if defined(end_date) %}
        AND timestamp <= {{DateTime(end_date, '2024-12-31 23:59:59')}}
    {% end %}
    GROUP BY {{column(column_name, 'level')}}
    ORDER BY count DESC
    LIMIT 5
TYPE endpoint

This pipe uses Tinybird's template functions to dynamically request the specific column I want to filter on.

As a reminder, every Tinybird pipe is automatically a REST endpoint. I can query it directly like this:

curl -G '$TB_REGION_HOST/v0/pipes/generic_counter.json'
    -H 'Authorization: Bearer $TB_TOKEN'
    -d 'start_date=2025-01-01 00:00:00'
    -d 'end_date=2025-01-31 23:59:59'
    -d 'column_name=level'

This is just the basics; I'm not supporting all possible filters just yet.

Logs search and exploration

The search bar, table, and details view are all based on the same API that just allows filtering the raw logs on all the various supplied dimensions:

TOKEN read_only READ
DESCRIPTION >
    'Analyze logs with filtering capabilities by time range, attributes and message'

NODE log_analysis_node
SQL >
    %
    SELECT
        timestamp,
        request_id,
        request_method,
        status_code,
        service,
        request_path,
        level,
        message,
        *
    FROM logs
    WHERE 1=1
    {% if defined(start_date) and start_date != '' %}
        AND timestamp >= {{DateTime(start_date, '2024-01-01 00:00:00')}}
    {% end %}
    {% if defined(end_date) and end_date != '' %}
        AND timestamp <= {{DateTime(end_date, '2024-12-31 23:59:59')}}
    {% end %}
    {% if defined(service) and service != '' %}
        AND service in {{Array(service)}}
    {% end %}
    {% if defined(level) and level != '' %}
        AND level in {{Array(level)}}
    {% end %}
    {% if defined(environment) and environment != '' %}
        AND environment in {{Array(environment)}}
    {% end %}
    {% if defined(request_method) and request_method != '' %}
        AND request_method in {{Array(request_method)}}
    {% end %}
    {% if defined(status_code) and status_code != '' %}
        AND status_code in {{Array(status_code)}}
    {% end %}
    {% if defined(request_path) and request_path != '' %}
        AND request_path in {{Array(request_path)}}
    {% end %}
    {% if defined(user_agent) and user_agent != '' %}
        AND user_agent in {{Array(user_agent)}}
    {% end %}
    {% if defined(message) and message != '' %}
        AND message like '%{{String(message)}}%'
    {% end %}
    ORDER BY timestamp DESC
    LIMIT {{Int32(page_size, 100)}}
    OFFSET {{Int32(page, 0)}} * {{Int32(page_size, 100)}}

TYPE endpoint

I use Array parameters to support multiple values for each filter.

{% if defined(environment) and environment != '' %}
    AND environment in {{Array(environment)}}
{% end %}

Whenever I make a selection on the sidebar components, the date range selector, or the search bar, the API will be called with the corresponding parameters (using some React hook), to list the corresponding rows.

Note that this API is not optimized at all. For instance, LIMIT, OFFSET, and filtering with a like expression will generally not be performant at scale, but that's just fine for now. This is just a simple way to query the logs to build the end-to-end logs explorer. I can (and did) optimize it later.

An example of a query with all the filters:

curl -G '$TB_REGION_HOST/v0/pipes/log_analysis.json' \
    -H 'Authorization: Bearer $TB_TOKEN' \
    -d 'start_date=2025-01-01 00:00:00' \
    -d 'end_date=2025-01-31 23:59:59' \
    -d 'environment=production,staging' \
    -d 'service=app,web' \
    -d 'level=error' \
    -d 'message=my_message' \
    -d 'page=0' \
    -d 'page_size=100'

Time series chart

The time series chart is based on a pipe that aggregates the data by time period. I can have multiple metrics and dimensions depending on my needs.

A screenshot of the logs explorer time series and search bar — You can view a time series and search for specific logs on text matching

The most basic is a counter of events and errors over time, grouped by hour:

TOKEN read_only READ
DESCRIPTION >
    'Monitor error rates and response times by service'

NODE error_monitoring_node
SQL >
    %
    SELECT
        service,
        toStartOfHour(timestamp) as hour,
        countIf(level = 'ERROR') as error_count,
        count() as total_requests,
        round(countIf(level = 'ERROR') * 100.0 / count(), 2) as error_rate,
        avg(response_time) as avg_response_time,
        max(response_time) as max_response_time
    FROM logs
    WHERE 1=1
    {% if defined(start_date) %}
        AND timestamp >= {{DateTime(start_date, '2024-01-01 00:00:00')}}
    {% end %}
    {% if defined(end_date) %}
        AND timestamp <= {{DateTime(end_date, '2024-12-31 23:59:59')}}
    {% end %}
    {% if defined(service) %}
        AND service = {{String(service, 'all')}}
    {% end %}
    {% if defined(environment) %}
        AND environment = {{String(environment, 'production')}}
    {% end %}
    GROUP BY service, hour
    ORDER BY hour DESC, error_rate DESC

TYPE endpoint

This is aggregating the raw logs. I know 100% this is not going to scale, but for now it's just fine. Remember, I'm building a working end-to-end logs explorer, not optimizing for scale… yet.

As with my other APIs, I can query it like this:

curl -G '$TB_REGION_HOST/v0/pipes/error_monitoring.json' \
    -H 'Authorization: Bearer $TB_TOKEN' \
    -d 'start_date=2025-01-01 00:00:00' \
    -d 'end_date=2025-01-31 23:59:59'

That's all the APIs I needed for my logs explorer.

On to the Next.js application…

Next.js integration

I used zod-bird to integrate Tinybird APIs (pipes and events) with Next.js. It's a wrapper around the Tinybird APIs that validates pipe results using zod schemas.

This is the generated zod-bird client.

I defined Tinybird tokens using environment variables:

NEXT_PUBLIC_TINYBIRD_API_URL=my_host
NEXT_PUBLIC_TINYBIRD_API_KEY=my_token

This way, I could change them in Vercel without changing the code. (btw, you can learn more about Tinybird authorization mechanisms for Next.js apps in the Next.js integration guide.

For charts I used @tinybirdco/charts.

Testing the data layer

I wanted full testing coverage of my APIs so that when I eventually optimized them I could be confident that the interface between my data and app layer wouldn't break.

Tinybird simplifies this a lot, I just asked the integrated LLM to create a test and then update the assertions: tb test create && tb test update. This generates a test .yaml that can be run locally or during CI.

I supplied a prompt to create more specific tests, for example:

tb test create –prompt "test the error_monitoring endpoint with environment production and day granularity"

For full coverage, I instructed the LLM to create specific tests or fine tune the tests params to match my mock data, and I ran tb test update to update the assertions.

See how Tinybird tests are defined here.

CI/CD

I used the GitHub Actions generated by tb create.

The process is simple:

CI: build the project and run tests
CD: deploy to Tinybird Cloud and Vercel

I usually want to deploy the Tinybird project first and then the Next.js application, since they have different life cycles, ensuring they are both backwards compatible.

Instrumentation

You can obviously use mock data to test, but eventually you want real logs from your app. Whether you generate your own logs or use third party libraries or services, you need a way to get your logs into the Tinybird logs data source.

Here are some strategies to instrument your application logs.

The most basic approach is to make your application send logs directly to the Events API, a piece of data infrastructure that can ingest data at scale. Optionally, you can add some buffering and retries or a fire and forget strategy for logs that are not critical.

If you prefer a Sidecar or Gateway strategy, you can use Vector.dev as a log collector. Vector.dev integrates with all the popular services and the Tinybird Events API.

Here is an example of using Vector.dev to send logs to Tinybird.

Conclusion

I built a Datadog alternative for logs in less than a day using modern dev tools that play well with LLMs.

The initial end-to-end product wasn't perfect or optimized for scale, but it proved my project and gave me a good foundation to iterate and deploy changes as I improved them.

The Logs Explorer template builds on what I shared here, but it is optimized to handle billions of logs and thousands of concurrent user requests. Fork the template and modify it for your use case. (Demo here).

You can dig into the code if you want to see the specific optimizations I applied (hint, they're based on this). Or, check in tomorrow for Part II of this post, where I'll share my specific optimization techniques to scale the template to support billions or even trillions of logs.

DEV Community

Build a Datadog alternative in 5 minutes

The basic process

The stack

The development workflow

Instrumentation

The data

The APIs

The local dev environment

The bootstrap

The mock data and fixtures

The APIs

Sidebar counts

Logs search and exploration

Time series chart

Next.js integration

Testing the data layer

CI/CD

Instrumentation

Conclusion

Top comments (0)