Partial-Output Salvage | designpattern.fyi

Skip to main content

designpattern.fyi

The Blueprint OOP & Design Patterns

The Engine Algorithms & Data Structures

The Guardrails SOLID, DRY, Code Quality

Glossary Agentic AI Terminology

Agent Loop Autonomous AI Patterns

Agent Skills Knowledge Packaging

Agent Memory Persistent Context

Resource Discovery ARD Specification

Explainable AI (xAI) Healthcare XAI Framework

AI Adoption Principles Strategic AI Framework

Healthcare Lakehouse Cloud-Agnostic AI Architecture

Evolving Engineering in AI AI Engineering Disciplines

Ontological Engineering Patterns/anti-patterns for Ontological Engineering

Loop Engineering Engineering Patterns for Agent Loops

Fleet Engineering Agent Orchestration

Agentic Context Engineering Building Self-Improving AI Systems

Prompt Engineering English is a new programming language

Harness Engineering Designing everything around an AI model

Forward Deployed Engineering Shift left to accelerate tangible business impact

Feature Engineering Transforming Raw Data into Predictive Power

Agentic AI Patterns Patterns/anti-patterns for AI Agents

Cloud Architecture AWS, Azure, GCP, K8s

Microservices Distributed Systems

Event-Driven Async & Reactive

Enterprise Integration Message Patterns

Spec-Driven Development Development methodology for AI systems

Total Cost of Ownership Calculate and optimize AI implementation costs

Trade-offs System Decisions

Language Models LLM Patterns

Machine Learning MLOps Architecture

Data Science Data Pipelines

AI Token Economy Cost & Strategy

AI Security Threat Landscape & Risks

OWASP Security Top 10 Security Risks

OWASP LLM LLM Security Top 10

OWASP Agentic AI Agent Security Top 10

OWASP AIVSS AI Vulnerability Scoring System

OWASP Citizen Development Citizen Development Security

Data Protection Privacy & PII

OKF Specification Knowledge Format

Securing AI Agents GDM Safety Framework

Problem Solver Structured Problem Thinking

Statement Builder AI Coding Prompt Generator

Skills Builder Design Agent Skills

Prompt Engineering Interactive Prompt Workspace

Enterprise Pattern Cognitive Agent Patterns

Trip Planner Multi-Agent AI Pipeline

designpattern.fyi

Software Design Catalog

Agentic AI

Back to Catalog

Agentic AI Cognition & Introspection

Partial-Output Salvage

Stream every model token to an atomic partial file so mid-stream crashes leave a consistent salvage — then surface the recovery status to the model on the next prompt.

Intent & Description

🎯 Intent

Without a partial-output mechanism, a SIGKILL mid-inference loses all tokens that were streaming — minutes of model time and real context gone with no trace.

📋 Context

The agent runs on hardware that occasionally crashes: OOM killer, watchdog timer, deploy restart mid-stream. Per-call inference is long enough that losing a half-finished stream is meaningful. The existing resumption pattern only restores durably written state — not the tokens that were streaming when the kill signal landed.

💡 Solution

Mechanical finite-state machine. On stream start: open partial.tmp, write a start marker with thought-id, timestamp, model ID. On each chunk: append to tmp, periodically os.rename(tmp, partial) for atomicity. On normal stream end: rename to canonical thought path, delete partial. On startup: scan for orphan partial.* files, finalize each with a typed RecoveryStatus enum (RECOVERED_FROM_PARTIAL for hard kill, TIMEOUT_PARTIAL for watchdog timeout). Include last_partial_recovery: in the next prompt’s system context so the model can adjust.

Real-world Use Case

The runtime can SIGKILL the agent mid-stream and that loses meaningful work.
Inference is long enough per call that a partial stream has real salvage value.
The filesystem supports atomic rename in the working directory.

Source

View Original Source →

Advantages

Mid-stream tokens are not lost on hard crash — minutes of inference are recoverable
Typed recovery marker preserves debuggability — the salvage isn’t hidden from the model
Atomic rename keeps the partial file readable and consistent at every moment

Disadvantages

Rename overhead per N chunks is non-zero; chunk size needs tuning
Partials add filesystem clutter if not periodically cleaned up
Recovery status surfaced in the prompt costs tokens every time it fires

46 of 329

Steer AGI - Your Codes Reflect!

© 2026 designpattern.fyi. Vibe Coded with ❤️ for modern software engineers by Dr. Amit Puri at OpenAGI