Discussion on: How to Build AI Agents That Actually Remember: Memory Architecture for Production LLM Apps

View post

Good overview. The part I keep coming back to is that "remembering" is only half of the problem.

The harder half is deciding what should be recalled for a specific task.

If the system retrieves too much, the agent gets noisy context.
If it retrieves stale information, the agent becomes confidently wrong.
If it retrieves private information, the agent violates boundaries.
If it retrieves low-authority assumptions, those assumptions may become treated like facts.

So I think production memory needs a curation layer between storage and prompt injection.

The model should not receive memory just because it exists.

It should receive memory because the runtime/context layer decided it is relevant, authorized, current, and useful.