Extended Thinking | designpattern.fyi

Skip to main content

designpattern.fyi

The Blueprint OOP & Design Patterns

The Engine Algorithms & Data Structures

The Guardrails SOLID, DRY, Code Quality

Glossary Agentic AI Terminology

Agent Loop Autonomous AI Patterns

Agent Skills Knowledge Packaging

Agent Memory Persistent Context

Resource Discovery ARD Specification

Explainable AI (xAI) Healthcare XAI Framework

AI Adoption Principles Strategic AI Framework

Healthcare Lakehouse Cloud-Agnostic AI Architecture

Evolving Engineering in AI AI Engineering Disciplines

Ontological Engineering Patterns/anti-patterns for Ontological Engineering

Loop Engineering Engineering Patterns for Agent Loops

Fleet Engineering Agent Orchestration

Agentic Context Engineering Building Self-Improving AI Systems

Prompt Engineering English is a new programming language

Harness Engineering Designing everything around an AI model

Forward Deployed Engineering Shift left to accelerate tangible business impact

Feature Engineering Transforming Raw Data into Predictive Power

Agentic AI Patterns Patterns/anti-patterns for AI Agents

Cloud Architecture AWS, Azure, GCP, K8s

Microservices Distributed Systems

Event-Driven Async & Reactive

Enterprise Integration Message Patterns

Spec-Driven Development Development methodology for AI systems

Total Cost of Ownership Calculate and optimize AI implementation costs

Trade-offs System Decisions

Language Models LLM Patterns

Machine Learning MLOps Architecture

Data Science Data Pipelines

AI Token Economy Cost & Strategy

AI Security Threat Landscape & Risks

OWASP Security Top 10 Security Risks

OWASP LLM LLM Security Top 10

OWASP Agentic AI Agent Security Top 10

OWASP AIVSS AI Vulnerability Scoring System

OWASP Citizen Development Citizen Development Security

Data Protection Privacy & PII

OKF Specification Knowledge Format

Securing AI Agents GDM Safety Framework

Problem Solver Structured Problem Thinking

Statement Builder AI Coding Prompt Generator

Skills Builder Design Agent Skills

Prompt Engineering Interactive Prompt Workspace

Enterprise Pattern Cognitive Agent Patterns

Trip Planner Multi-Agent AI Pipeline

designpattern.fyi

Software Design Catalog

Agentic AI

Back to Catalog

Agentic AI Reasoning

Extended Thinking

Give the model a private scratchpad to reason deeply before it responds.

Extended Thinking exposes a model-native reasoning buffer — separate from the user-visible response — where the model can explore, backtrack, and self-correct before committing to an answer.

Intent & Description

🎯 Intent

Unlock deeper, multi-step reasoning by giving the model a first-class internal monologue that doesn’t pollute the final output.

📋 Context

Standard CoT mixes reasoning and response in the same token stream, which creates pressure to produce clean, confident-looking output even mid-reasoning. Native thinking blocks remove that pressure — the model can be uncertain, wrong, and self-correcting in the scratchpad.

💡 Solution

Use models with native extended thinking support (Claude extended thinking, o1/o3 reasoning traces). Set a thinking_budget (token cap) appropriate to task complexity. The thinking block is returned separately or stripped from the user response depending on your UX needs. Pair with adaptive-compute-allocation to avoid paying for extended thinking on simple tasks. See also: chain-of-thought, scratchpad, large-reasoning-model-paradigm.

Real-world Use Case

Hard reasoning tasks: multi-step math, complex code generation, strategic planning.
Cases where you want the reasoning visible for audit but not shown to end users.
Tasks where the model needs to explore multiple approaches before committing.

Source

View Original Source →

📌 TL;DR

Let the model think privately first — the answer it gives after is meaningfully better.

Advantages

Significantly improves accuracy on hard benchmarks vs. standard CoT.
Thinking is isolated — model can be uncertain without undermining response confidence.
Thinking budget is tunable — balance cost vs. reasoning depth per task.

Disadvantages

Expensive — thinking tokens count against your bill.
Not all models support it natively; prompting workarounds are imperfect substitutes.
Thinking content can be verbose and hard to parse for downstream use.

170 of 329

Steer AGI - Your Codes Reflect!

© 2026 designpattern.fyi. Vibe Coded with ❤️ for modern software engineers by Dr. Amit Puri at OpenAGI