Attention-Manipulation Explainability

Requires white-box access to attention weights — not available for hosted black-box APIs.
Compute overhead per request — one forward pass per token group.
Token-level attribution can mislead when reasoning spans many tokens collaboratively.