I use NotebookLM as an attack layer before I publish anything.
Not for summaries. Not for research. I upload my draft, let the audio overview run, and listen to two AI hosts interrogate my argument. If the narration exposes a gap — a claim that doesn't land, a section that wanders — I go back and fix it before the article goes live.
It works. Better than re-reading. Better than asking a colleague. The distance of hearing your argument spoken back at you by something that doesn't share your assumptions is genuinely useful.
One thing is missing.
The voice. My voice.
What NotebookLM already does
The hard problems are solved.
NotebookLM ingests PDFs, Google Docs, audio files, YouTube links, and web URLs. It grounds its narration in your sources — it doesn't hallucinate beyond them. It produces coherent two-host audio that actually sounds like a conversation, not a text-to-speech dump. It maintains source fidelity across long documents.
These are not small engineering problems. Source grounding alone eliminates an entire class of failure that plagues every generic AI summarizer. The coherence of the narration — the way the two hosts disagree, interrupt, and redirect — required significant work to build.
Most people use it to summarize research papers. Some use it for meeting notes. I use it to stress-test arguments before they're public.
What nobody is talking about: Google solved the hard part already.
The one feature that changes everything
Personalized voice.
Not a voice clone of a podcast host. Not a generic British narrator. Your voice. Trained on your recordings, matched to your cadence, deployed on your content.
The moment Google ships that, three things happen simultaneously:
Audio overviews stop sounding like someone else reading your work. They start sounding like you, presenting your argument, in your register. The use case expands from "summarize this" to "publish this." And every startup selling AI-powered personal audio becomes a feature comparison in a product Google already owns.
The countdown timers
Let's name them.
ElevenLabs built a genuinely impressive voice cloning product. The quality is real. The API is well-documented. Developers use it. But ElevenLabs' core value proposition — "your voice, anywhere" — is exactly what Google would ship as a NotebookLM toggle. Not a new product. A settings page.
Podcastle sells AI-powered podcast creation with voice cloning and audio cleanup. It's a prosumer tool aimed at creators who want professional audio without a studio. It's also a collection of features that sit inside what NotebookLM already does structurally, minus the voice layer.
Wondercraft is an AI audio platform for turning written content into audio. Good product. Direct overlap with NotebookLM's architecture. One product update away from redundancy.
Descript is the most defensible of the group — it has a video editing layer, a timeline, a collaboration workflow. It isn't purely an audio generation tool. But its AI voice layer, "Overdub," is exactly the feature that would become noise the day Google ships personal voice to NotebookLM.
None of them are bad products. That's not the argument.
The argument is that their moat is a gap Google hasn't prioritized filling. That's a countdown timer, not a competitive advantage.
The Google tax
This pattern has a name. Developers who've been around long enough know it.
Google Workspace has a task manager. It's called Tasks. It's fine. Todoist, TickTick, and Things 3 all exist because Tasks is fine and not great, and "fine" left a gap big enough to build companies in.
Google Calendar handles scheduling. Calendly became a $3 billion company because Calendar doesn't do one specific thing — let other people book time in your calendar without an email thread. One feature. Entire company.
Google Keep exists. Notion exists anyway. The overlap is real; the gap was real enough to matter.
Some of these survive. Calendly survived because the booking workflow is genuinely distinct from what Calendar does natively. Notion survived because it expanded beyond the gap — documents, databases, wikis — before Google closed it.
The AI audio startups don't have that runway. They're not building adjacent to NotebookLM. They're building inside it. Their entire value proposition sits in the gap between what NotebookLM already does and the one feature Google hasn't shipped.
Why Google probably won't ship it next quarter
Let's be honest about the timeline.
Google is not aggressively pursuing this. NotebookLM is a Google Labs product — impressive, genuinely useful, and clearly not the company's primary focus. The team is small relative to the broader Gemini push. Personal voice cloning has real regulatory and ethical baggage — deepfakes, consent, liability — that slows any large company down in ways a startup can ignore.
The AI audio startups might have 18 months. Maybe 24.
But here's the thing about building on a platform gap: the clock isn't running on whether Google ships it. The clock is running on whether the market believes Google will ship it. The moment that belief takes hold — a Google I/O demo, a leak, a product page — the fundraising environment for AI audio personalization startups changes overnight.
They're not being hunted. They're being ignored.
Ignored by Google is its own kind of death sentence.
What this actually means for developers
If you're building in this space, the question isn't "can we build a better voice cloning product than ElevenLabs?" The question is: "What does this product do that would survive a Google I/O keynote?"
The companies that survive the Google tax survive it by expanding beyond the gap before it closes. Calendly survived by becoming a scheduling platform — reminders, routing, integrations — not just a booking link. Notion survived by becoming a workspace, not just a note-taking tool.
AI audio startups that survive will be the ones that embed voice into a workflow Google doesn't own. Video production pipelines. Podcast distribution. Live audio. Language learning. The ones that build deep into a workflow Google has no reason to touch.
The ones building "NotebookLM but with your voice" are the countdown timers.
I still use NotebookLM every time I publish. I still listen to two AI hosts interrogate my arguments while I cook.
I just do it in someone else's voice.
For now.
Top comments (0)