DEV Community

eternalsix
eternalsix

Posted on • Originally published at eternalsix.com

AI for note-taking and second brain

Your Second Brain Is Lying to You (And AI Is Making It Worse)

Last Tuesday I had 47 browser tabs open, three Notion databases that hadn't been touched in six weeks, an Obsidian vault with 800 notes I couldn't search effectively, and a voice memo on my phone that said "follow up on the thing." I am supposedly a person who thinks seriously about AI-assisted knowledge work. The irony was not lost on me. The problem isn't that I don't have enough tools — I have too many, each one promising to be the last tool I'll ever need. The real problem is that "AI for note-taking" has become a category stuffed with features that feel powerful in demos and evaporate in daily use. Here's what's actually happening, what's actually broken, and what a real solution architecture looks like.

The Capture Illusion

Every second-brain system starts with capture. Write it down. Voice memo it. Clip it. Screenshot it. The promise of AI is that capture becomes frictionless — just dump everything in and the machine will sort it out.

Except it doesn't. What actually happens is that your capture surface area explodes while your retrieval stays broken. AI transcribes your voice memos instantly, sure. But now you have 200 transcribed voice memos that are semantically searchable but contextually orphaned. They don't connect to the project they were for. They don't connect to the decision that followed. They are just... indexed text floating in a database.

The capture illusion is that more frictionless input equals more useful output. It doesn't. A warehouse full of unlabeled boxes is not a knowledge base. AI has made it easier than ever to fill the warehouse. It has done almost nothing to help you know what's actually in there or why any of it matters.

The builders who have cracked this are the ones who treat capture as a typed event, not a dump operation. When you capture, you're not just recording — you're making a claim about context: this belongs to project X, this is a decision under consideration, this is a reference I will need when doing Y. AI can assist that context-tagging in real time, but only if your tool is designed to prompt for it, not just accept raw input.

Retrieval Is a UX Problem Disguised as an AI Problem

Vector search changed everything and changed nothing. Yes, you can now find notes by semantic meaning rather than exact keywords. That's real and it matters. But most implementations have shipped the technology without solving the actual UX problem: you don't know what you're looking for until you find it.

This is the rediscovery problem. The most valuable thing a second brain can do is surface something you forgot you knew, at the moment it becomes relevant. That's not a search problem. That's a proactive inference problem. And it requires your system to have a model of what you're currently working on, not just a corpus of everything you've ever written.

I've tested this with four major AI note tools over the past eight months. All of them have semantic search. None of them have proactive relevance. They wait for you to ask. But the whole point of a second brain is that it knows things your first brain has dropped. If it waits for you to remember what to ask, you've lost the loop.

The developers building in this space who are ahead are the ones building ambient context engines — something that watches your current work surface and asks "have you considered X from three months ago?" without being asked. It's technically harder than shipping a search bar. It's also the only thing that actually delivers on the second-brain promise.

The Model Context Problem Nobody Talks About

Here's a thing that almost nobody in the note-taking AI discourse addresses honestly: your notes are not a good prompt context by default.

LLMs have context windows. Even with 200k tokens, you cannot stuff your entire knowledge base into every query. So every AI note tool has to make a decision about retrieval-augmented generation — which chunks of your notes does it pull into context when you ask a question? This is a solved problem in the research sense and an unsolved problem in the product sense.

Most tools do naive top-K retrieval: grab the five most semantically similar chunks and jam them in. This works for simple factual recall. It fails completely for synthesis questions like "what do I actually think about microservices based on everything I've written over two years?" because the relevant signal is spread across dozens of notes, none of which individually ranks high enough.

Good RAG for personal knowledge requires a pipeline that understands document type (is this a half-formed thought or a concluded decision?), temporal weighting (how has your thinking evolved?), and project scope filtering (only pull from notes tagged to this initiative). Building that pipeline is real engineering work. Most consumer tools have not done it.

The Workflow Integration Gap

The second brain that lives only in its own app is a hobby, not infrastructure. The reason Roam and Obsidian have cult followings among developers is that they're local-first and API-accessible — they can be wired into actual workflows.

But wiring them in is still manual and brittle. You write a script to sync your standup notes to your task manager. The API changes. Your script breaks. You spend a weekend fixing it instead of doing actual work.

The missing layer is a stable abstraction between your knowledge store and your workflow tools. Something that says: when a note is tagged as a decision, push a summary to the relevant project channel. When a meeting transcript is processed, extract action items and route them to the right assignee. When a research note reaches a conclusion state, archive the source material and surface the conclusion in the appropriate context.

This is not a feature. It's an integration architecture. And it requires someone to build the connective tissue, not just another note-taking surface.

A Framework for Evaluating AI Note Systems

Before you commit to any AI-assisted knowledge tool, run it through these five checks:

  • Context tagging at capture — Does the tool force or strongly encourage you to specify project, type, and intent when capturing? Or does it just accept raw input and promise to figure it out later?
  • Proactive surfacing — Does it surface relevant past notes without you asking, based on what you're currently working on? If the answer is no, it's a search tool, not a second brain.
  • RAG pipeline transparency — Can you understand or configure how it retrieves context for AI queries? Naive top-K is a red flag.
  • Workflow hooks — Does it expose stable webhooks or APIs for routing structured outputs (decisions, action items, conclusions) to downstream tools?
  • Decay and curation model — Does it have any concept of note lifecycle? A system with no archiving or pruning logic will become a landfill.

If a tool fails three or more of these, the AI features are cosmetic. You're paying for a better Evernote.

How AI Handler Approaches This

I've been building AI Handler because I kept running into the same architectural gap across every AI workflow tool I used: they optimize for a single surface (notes, chat, tasks) and treat integration as someone else's problem.

AI Handler is designed around the premise that AI-assisted knowledge work is a pipeline problem, not a feature problem. The architecture starts with structured capture — every input is typed and contextualized on the way in, with AI assistance that prompts for context rather than just accepting the dump. Retrieval is built on a layered RAG pipeline that weights recency, document type, and active project scope before doing semantic similarity ranking. And the integration layer is first-class, not an afterthought — workflow hooks are part of the core schema, not a third-party automation layer you have to build yourself.

The honest build-in-public version of this: it's not done. The proactive surfacing engine is the hardest part and it's still rough. But the core pipeline — typed capture, layered retrieval, workflow routing — is working, and I'm using it daily to manage everything from engineering decisions to content planning.

The goal is a tool that developers and AI power users can actually trust as infrastructure, not just another app they'll abandon in six weeks when the novelty wears off.


AI Handler is the unified AI workflow tool I am building. Launching June 2026. Email ceo@eternalsix.com for beta access.

Top comments (0)