Discussion about this post

User's avatar
Maria Edlenborg Mortensen's avatar

What struck me most is that the hard problem no longer seems to be memory.

It's deciding what deserves to become memory.

A correction is not automatically a lesson.

Some are preferences.

Some are local context.

Some are contradictions.

Some are genuine improvements.

The moment an agent starts learning from feedback, the question becomes:

How does it distinguish between them?

That feels less like a memory problem and more like an epistemic governance problem.

Ikram Rana's avatar

I work with small business owners on AI automation every week and the self-grading trap is real. The agents that actually improve are the ones with an external eval layer. Did you find the agent improved more from structured feedback or from exposure to diverse failure cases?

10 more comments...

No posts

Ready for more?