DEV Community

mehmet akar
mehmet akar

Posted on • Edited on

13 1 1

Open-Source Deep Research Initiatives

An Open-Source Alternative to OpenAI's Deep Research: Open Deep Research

After OpenAI introduced Deep Research, many open source versions comes up. Actually, there was already some built agents similar to deep research.

Huggingface published a blog post about that and they also run some agents for it.

Other great open implementations of Deep Research emerged from the community, specifically from

Each of these implementations use different libraries for indexing data, browsing the web and querying LLMs.

In this article, I want to mention Nicolas Silberstein Camara for his great action against Openai Deep Research Premium. Here is the details and tutorial.


Open-Source Deep Research: Overview

Open Deep Research is an open-source clone of OpenAI's Deep Research experiment. Unlike OpenAI's proprietary model, this project leverages Firecrawl’s extract + search technology combined with a reasoning model to conduct deep research across the web.

Open-Source Deep Research: Key Features

  • Firecrawl Extract + Search

    • Feeds real-time data to the AI via search.
    • Extracts structured data from multiple websites.
  • Next.js App Router

    • Uses React Server Components (RSCs) for efficient rendering.
    • Supports server-side rendering for performance optimization.
  • AI SDK Integration

    • Supports multiple LLM providers:
    • OpenAI (default: gpt-4o)
    • Anthropic, Cohere, DeepSeek, and more.
  • Advanced UI Components

    • Styled with Tailwind CSS.
    • Uses shadcn/ui with Radix UI for flexible component handling.
  • Data Persistence

    • Uses Vercel Postgres (Neon) for chat history and user data.
    • Stores files efficiently with Vercel Blob.
  • Authentication System

    • Implemented using NextAuth.js for secure user login.

Open Deep Research: Installation Guide (Run Locally)

1. Clone the Repository

Open your terminal and run:

git clone https://github.com/nickscamara/open-deep-research.git
cd open-deep-research
Enter fullscreen mode Exit fullscreen mode

2. Install Dependencies

Install pnpm if not installed:

npm install -g pnpm
Enter fullscreen mode Exit fullscreen mode

Then install all dependencies:

pnpm install
Enter fullscreen mode Exit fullscreen mode

3. Set Up Environment Variables

You'll need to define environment variables in .env using the .env.example file as a reference.

To automatically configure environment variables:

vercel env pull
Enter fullscreen mode Exit fullscreen mode

Or manually create a .env file and include:

OPENAI_API_KEY=your_openai_api_key
FIRECRAWL_API_KEY=your_firecrawl_api_key
AUTH_SECRET=your_auth_secret
Enter fullscreen mode Exit fullscreen mode

⚠️ Do not commit the .env file to avoid exposing sensitive API keys.

4. Run Database Migrations

pnpm db:migrate
Enter fullscreen mode Exit fullscreen mode

5. Start the Application

pnpm dev
Enter fullscreen mode Exit fullscreen mode

Your app should now be running at:

🔗 http://localhost:3000


Alternative Deployment: One-Click Deploy to Vercel

If you prefer not to run it locally, you can deploy to Vercel in one click:

Deploy with Vercel


Model Providers

By default, the project uses OpenAI's GPT-4o.

However, it supports multiple LLM providers via Vercel's AI SDK, including:

  • Anthropic (Claude)
  • Cohere
  • DeepSeek
  • TogetherAI
  • OpenRouter

Switching Models

Modify the .env file:

REASONING_MODEL=deepseek-reasoner
BYPASS_JSON_VALIDATION=true
Enter fullscreen mode Exit fullscreen mode

Adding Model Dependencies

If you want to use a model other than GPT-4o, install the respective dependency.

DeepSeek AI Model

pnpm add @ai-sdk/deepseek
Enter fullscreen mode Exit fullscreen mode

TogetherAI Model

pnpm add @ai-sdk/togetherai
Enter fullscreen mode Exit fullscreen mode

🚨 Check TogetherAI rate limits:

🔗 Rate Limit Info


Reasoning Model Configuration

This project includes a reasoning model for structured outputs such as research analysis, data extraction, and document summarization.

Provider Models Supported Notes
OpenAI gpt-4o, o1, o3-mini Native JSON support
TogetherAI deepseek-ai/DeepSeek-R1 Requires BYPASS_JSON_VALIDATION=true
DeepSeek deepseek-reasoner Requires BYPASS_JSON_VALIDATION=true

Key Notes

  • GPT-4o, o1, o3-mini → Natively support structured JSON outputs.
  • DeepSeek & TogetherAI → Need BYPASS_JSON_VALIDATION=true.
  • If no model is set, it defaults to o1-mini.
  • If an invalid model is chosen, it falls back to o1-mini.

To use DeepSeek as the reasoning model, add this to .env:

REASONING_MODEL=deepseek-reasoner
BYPASS_JSON_VALIDATION=true
Enter fullscreen mode Exit fullscreen mode

Open-Source Deep Research: Conclusion

Open Deep Research by Nicolas Silberstein Camara is a powerful, open-source alternative to OpenAI’s Deep Research.

It allows users to autonomously research the web, retrieve structured data, and leverage multiple AI models.

By following the steps above, you can:
Run it locally

Deploy it to Vercel

Customize model providers

For the latest updates, visit the GitHub Repository.

Developer: Nicolas Silberstein Camara

GitHub Repository: Github

Demo: Live Demo

API Trace View

Struggling with slow API calls?

Dan Mindru walks through how he used Sentry's new Trace View feature to shave off 22.3 seconds from an API call.

Get a practical walkthrough of how to identify bottlenecks, split tasks into multiple parallel tasks, identify slow AI model calls, and more.

Read more →

Top comments (0)

Sentry image

See why 4M developers consider Sentry, “not bad.”

Fixing code doesn’t have to be the worst part of your day. Learn how Sentry can help.

Learn more