Tool Loadout Hot-Swap

Tool Loadout Hot-Swap destroys cache reuse and produces contradicted conditioning by changing which tools are available between turns of a live run.

Intent & Description

🎯 Intent

Dynamically adding or removing tool definitions during a running task to keep the tool set lean as the task evolves.

📋 Context

The team interprets “don’’t expose all tools” as “add tools as needed, remove them when done.” Sounds like good hygiene. In practice, every mutation blows the KV-cache and leaves a model that was conditioning on tools that no longer exist — producing calls to removed tools and contradicted reasoning across the run.

💡 Solution

Define the tool palette once at run start and keep it stable for the entire run. To restrict what the model can call in a given state, mask the tool-name token during decoding — don’’t remove the definition. See tool-loadout (pick the subset at run start, not mid-run), tool-search-lazy-loading, prompt-caching.

Real-world Use Case

Never. The cache invalidation and contradicted conditioning are not worth the apparent flexibility.
Pick the tool loadout at run start (tool-loadout) and hold it stable across the entire run.
Constrain tool availability by masking logits during decoding, not by mutating the registry.

Source

View Original Source →

📌 TL;DR

Lock the tool palette at run start — use logit masking to restrict tool availability mid-task, not registry mutations.