Haripriya Veluchamy

Posted on Apr 4

Everyone Suddenly Said “RAG is Dead”

#ai #machinelearning #algorithms #rag

Lately I keep seeing this everywhere:

“RAG is dead”
“Vector search is outdated”
“Reasoning-based retrieval is the future”

And suddenly… everyone is talking like vector search is useless.

I’m not against the hype. These things happen.

But honestly, this whole idea didn’t just click for me immediately.

Because for me, this problem was already in my head for a long time.

Not because of hype.

Just because of my use case.

What I Was Actually Trying to Figure Out

I read a lot of long tech blogs and architecture posts.

After reading, I always have questions like:

“why did they do this?”
“what’s the tradeoff here?”
“what happens if we change this design?”

So I wanted a system where I can:

paste a document
ask questions
actually get useful answers

At some point I started thinking:

should I just stick with vector RAG?
or should I try something like PageIndex / reasoning-based retrieval?
or even something like Agent-style flow later?

That curiosity is what pushed me to build this.

Not the “RAG is dead” trend.

So I Built a Simple Comparison

Nothing fancy.

Just a small app where:

same document
same question
same model
only retrieval changes

Pipeline 1 — Vector RAG

split document
embed
store in ChromaDB
retrieve top-k
answer

This is what most of us are already doing.

Pipeline 2 — PageIndex

build a tree structure from the document
let the model navigate it
pick relevant sections
answer from that

This felt very different.

Not “searching”.

More like… “reading with guidance”.

What I Noticed

The difference is actually deeper than I expected.

Vector RAG:

find similar chunks

PageIndex:

figure out where the answer should be, then go there

That “where” part is interesting.

One Example

I tested with a Netflix architecture article.

Question:

Why did they use live origin instead of only CDN?

Vector RAG

faster (~7s)
decent answer
but retrieval had some noise

PageIndex

slower (~11s)
answer felt more precise
citations were cleaner

My Honest Take

Vector RAG is not dead.

But…

Blind chunking + embedding + top-k is not enough anymore (at least for some cases).

Where I See the Difference

Vector RAG works well when:

you have multiple documents
you want speed
you just need “good enough” answers

PageIndex works well when:

single long document
structured content
you want cleaner reasoning

What I’m Actually Thinking Now

I don’t think this is:

“one replaces the other”

Feels more like:

both solve different parts of the problem

What I’m more interested in now is:

can I combine them?
use vector search to find documents
then use something like PageIndex inside that?

That feels more practical.

Why I’m Exploring This

For my use case, I’m also thinking about:

can I plug this into an agent flow later?
how does retrieval affect agent decisions?
does better retrieval reduce hallucination in multi-step tasks?

That’s where this is going for me.

Final Thought

Honestly, I didn’t build this to prove anything.

Just to understand.

And one thing became clear very fast:

hype says “X is dead”
reality says “it depends”

If you’re building something similar, I’d really suggest:

Don’t pick a side early.

Test both.

You’ll understand the difference immediately.

Top comments (1)

Rejwan Islam Rizvy • Apr 4

So they serve different purposes. Thanks for testing it out.