Tool-Result Eviction | designpattern.fyi

Skip to main content

designpattern.fyi

The Blueprint OOP & Design Patterns

The Engine Algorithms & Data Structures

The Guardrails SOLID, DRY, Code Quality

Glossary Agentic AI Terminology

Agent Loop Autonomous AI Patterns

Agent Skills Knowledge Packaging

Agent Memory Persistent Context

Resource Discovery ARD Specification

Explainable AI (xAI) Healthcare XAI Framework

AI Adoption Principles Strategic AI Framework

Healthcare Lakehouse Cloud-Agnostic AI Architecture

Evolving Engineering in AI AI Engineering Disciplines

Ontological Engineering Patterns/anti-patterns for Ontological Engineering

Loop Engineering Engineering Patterns for Agent Loops

Fleet Engineering Agent Orchestration

Agentic Context Engineering Building Self-Improving AI Systems

Prompt Engineering English is a new programming language

Harness Engineering Designing everything around an AI model

Forward Deployed Engineering Shift left to accelerate tangible business impact

Feature Engineering Transforming Raw Data into Predictive Power

Agentic AI Patterns Patterns/anti-patterns for AI Agents

Cloud Architecture AWS, Azure, GCP, K8s

Microservices Distributed Systems

Event-Driven Async & Reactive

Enterprise Integration Message Patterns

Spec-Driven Development Development methodology for AI systems

Total Cost of Ownership Calculate and optimize AI implementation costs

Trade-offs System Decisions

Language Models LLM Patterns

Machine Learning MLOps Architecture

Data Science Data Pipelines

AI Token Economy Cost & Strategy

AI Security Threat Landscape & Risks

OWASP Security Top 10 Security Risks

OWASP LLM LLM Security Top 10

OWASP Agentic AI Agent Security Top 10

OWASP AIVSS AI Vulnerability Scoring System

OWASP Citizen Development Citizen Development Security

Data Protection Privacy & PII

OKF Specification Knowledge Format

Securing AI Agents GDM Safety Framework

Problem Solver Structured Problem Thinking

Statement Builder AI Coding Prompt Generator

Skills Builder Design Agent Skills

Prompt Engineering Interactive Prompt Workspace

Enterprise Pattern Cognitive Agent Patterns

Trip Planner Multi-Agent AI Pipeline

designpattern.fyi

Software Design Catalog

Agentic AI

Back to Catalog

Agentic AI Tool Use & Environment

Tool-Result Eviction

Once a tool's raw output is consumed, replace it in context with a one-line marker — reclaim tokens without losing the fact that the call happened.

Intent & Description

🎯 Intent

Stop paying context cost for tool outputs the model already extracted and moved on from.

📋 Context

Your agent calls search, file reads, and API queries, each returning bulky JSON or file contents. The model reads the payload, extracts what it needs, acts — and then that raw payload sits in context for the rest of the session, consuming tokens and attention for no reason.

💡 Solution

After a tool result is consumed, replace the raw payload in context with a short marker: 'read config.yaml: 3 services defined', 'searched docs: no rate-limit setting found'. Keep the marker so the agent doesn’t re-issue the call. Offload the full payload to external storage if it might be needed verbatim again. Apply eviction lazily (oldest-consumed first) or eagerly (immediately after extraction) based on window pressure.

Real-world Use Case

Tool observations are large relative to the context window.
Most results are consumed once and not needed verbatim again.
Window pressure or per-call cost is a binding constraint.
You can write a faithful one-line marker for each consumed result.

Source

View Original Source →

📌 TL;DR

Tool result read? Compress it to a one-liner, keep the payload in external storage. Context stays lean. Don’t evict until you’re sure the model is done with it.

Advantages

Window pressure from bulky observations drops sharply.
Cost and latency per call fall — dead payloads stop being re-sent.
The trace of what was called and concluded survives in the marker.
Signal-to-noise in the window improves.

Disadvantages

Evicting a result that’s still needed forces a redundant re-call.
A marker that loses a key value can mislead later reasoning.
Deciding when an observation is truly ‘consumed’ is error-prone.
Without offload, an evicted payload needed verbatim later is gone.

305 of 329

Steer AGI - Your Codes Reflect!

© 2026 designpattern.fyi. Vibe Coded with ❤️ for modern software engineers by Dr. Amit Puri at OpenAGI