Vector Memory | designpattern.fyi

Skip to main content

designpattern.fyi

The Blueprint OOP & Design Patterns

The Engine Algorithms & Data Structures

The Guardrails SOLID, DRY, Code Quality

Glossary Agentic AI Terminology

Agent Loop Autonomous AI Patterns

Agent Skills Knowledge Packaging

Agent Memory Persistent Context

Resource Discovery ARD Specification

Explainable AI (xAI) Healthcare XAI Framework

AI Adoption Principles Strategic AI Framework

Healthcare Lakehouse Cloud-Agnostic AI Architecture

Evolving Engineering in AI AI Engineering Disciplines

Ontological Engineering Patterns/anti-patterns for Ontological Engineering

Loop Engineering Engineering Patterns for Agent Loops

Fleet Engineering Agent Orchestration

Agentic Context Engineering Building Self-Improving AI Systems

Prompt Engineering English is a new programming language

Harness Engineering Designing everything around an AI model

Forward Deployed Engineering Shift left to accelerate tangible business impact

Feature Engineering Transforming Raw Data into Predictive Power

Agentic AI Patterns Patterns/anti-patterns for AI Agents

Cloud Architecture AWS, Azure, GCP, K8s

Microservices Distributed Systems

Event-Driven Async & Reactive

Enterprise Integration Message Patterns

Spec-Driven Development Development methodology for AI systems

Total Cost of Ownership Calculate and optimize AI implementation costs

Trade-offs System Decisions

Language Models LLM Patterns

Machine Learning MLOps Architecture

Data Science Data Pipelines

AI Token Economy Cost & Strategy

AI Security Threat Landscape & Risks

OWASP Security Top 10 Security Risks

OWASP LLM LLM Security Top 10

OWASP Agentic AI Agent Security Top 10

OWASP AIVSS AI Vulnerability Scoring System

OWASP Citizen Development Citizen Development Security

Data Protection Privacy & PII

OKF Specification Knowledge Format

Securing AI Agents GDM Safety Framework

Problem Solver Structured Problem Thinking

Statement Builder AI Coding Prompt Generator

Skills Builder Design Agent Skills

Prompt Engineering Interactive Prompt Workspace

Enterprise Pattern Cognitive Agent Patterns

Trip Planner Multi-Agent AI Pipeline

designpattern.fyi

Software Design Catalog

Agentic AI

Back to Catalog

Agentic AI Memory

Vector Memory

Store memories as embeddings in a vector index and retrieve the most semantically similar items at query time — so relevance is judged by meaning, not keyword match or recency.

Vector Memory is the standard scalable memory layer for long-running agents: every memory item is embedded and indexed; at query time, embed the current state or query and retrieve the top-k most semantically similar memories, optionally weighted by recency or salience — the agent surfaces contextually relevant past without needing to read everything.

Intent & Description

🎯 Intent

Store memories as embeddings in a vector index and retrieve the most semantically similar items at query time.

📋 Context

A long-running agent accumulates facts and observations over time. On each step it needs to find the small subset of past items most relevant to the current situation. Relevance is best judged by semantic similarity, not exact term match or chronological recency — “find past notes whose meaning is closest to what’s happening now.”

💡 Solution

Embed and index each memory item. At query time, embed the query (or a summary of current state), retrieve the top-k most similar memories, and prepend to context. Optionally apply decay (boost recent, age old) and salience weighting.

Real-world Use Case

A long-running agent accumulates facts whose relevance is best judged by semantic similarity.
An append-only log would otherwise grow unboundedly without selective retrieval.
An embedding model and vector index can be deployed and maintained.

Source

View Original Source →

📌 TL;DR

Embed every memory and retrieve top-k by semantic similarity — relevance by meaning beats keyword matching and chronological recency for most long-running agent tasks.

Advantages

Semantically relevant past surfaces automatically — no explicit query planning needed.
Scales to memory stores far too large to fit in context.

Disadvantages

Misses purely temporal queries (“what did I do yesterday?”) — vector similarity doesn’t capture chronology.
Embedding drift on model or schema changes can silently degrade retrieval quality.

102 of 329

Steer AGI - Your Codes Reflect!

© 2026 designpattern.fyi. Vibe Coded with ❤️ for modern software engineers by Dr. Amit Puri at OpenAGI