DEV Community

Haripriya Veluchamy
Haripriya Veluchamy

Posted on

Everyone Suddenly Said “RAG is Dead”

Lately I keep seeing this everywhere:

“RAG is dead”
“Vector search is outdated”
“Reasoning-based retrieval is the future”

And suddenly… everyone is talking like vector search is useless.

I’m not against the hype. These things happen.

But honestly, this whole idea didn’t just click for me immediately.

Because for me, this problem was already in my head for a long time.

Not because of hype.

Just because of my use case.


What I Was Actually Trying to Figure Out

I read a lot of long tech blogs and architecture posts.

After reading, I always have questions like:

  • “why did they do this?”
  • “what’s the tradeoff here?”
  • “what happens if we change this design?”

So I wanted a system where I can:

  • paste a document
  • ask questions
  • actually get useful answers

At some point I started thinking:

should I just stick with vector RAG?
or should I try something like PageIndex / reasoning-based retrieval?
or even something like Agent-style flow later?

That curiosity is what pushed me to build this.

Not the “RAG is dead” trend.


So I Built a Simple Comparison

Nothing fancy.

Just a small app where:

  • same document
  • same question
  • same model
  • only retrieval changes

Pipeline 1 — Vector RAG

  • split document
  • embed
  • store in ChromaDB
  • retrieve top-k
  • answer

This is what most of us are already doing.


Pipeline 2 — PageIndex

  • build a tree structure from the document
  • let the model navigate it
  • pick relevant sections
  • answer from that

This felt very different.

Not “searching”.

More like… “reading with guidance”.


What I Noticed

The difference is actually deeper than I expected.

Vector RAG:

find similar chunks

PageIndex:

figure out where the answer should be, then go there

That “where” part is interesting.


One Example

I tested with a Netflix architecture article.

Question:

Why did they use live origin instead of only CDN?


Vector RAG

  • faster (~7s)
  • decent answer
  • but retrieval had some noise

PageIndex

  • slower (~11s)
  • answer felt more precise
  • citations were cleaner

My Honest Take

Vector RAG is not dead.

But…

Blind chunking + embedding + top-k is not enough anymore (at least for some cases).


Where I See the Difference

Vector RAG works well when:

  • you have multiple documents
  • you want speed
  • you just need “good enough” answers

PageIndex works well when:

  • single long document
  • structured content
  • you want cleaner reasoning

What I’m Actually Thinking Now

I don’t think this is:

“one replaces the other”

Feels more like:

both solve different parts of the problem

What I’m more interested in now is:

  • can I combine them?
  • use vector search to find documents
  • then use something like PageIndex inside that?

That feels more practical.


Why I’m Exploring This

For my use case, I’m also thinking about:

  • can I plug this into an agent flow later?
  • how does retrieval affect agent decisions?
  • does better retrieval reduce hallucination in multi-step tasks?

That’s where this is going for me.


Final Thought

Honestly, I didn’t build this to prove anything.

Just to understand.

And one thing became clear very fast:

hype says “X is dead”
reality says “it depends”


If you’re building something similar, I’d really suggest:

Don’t pick a side early.

Test both.

You’ll understand the difference immediately.


Top comments (0)