Easy PromptAI Prompt Library
Logic ReasoningTextAdvanced

Reasoning Drift Auditor

Audits and hardens multi-turn agent systems against silent reasoning compression (Reasoning Drift) caused by growing context, using hard probes, CoT instrumentation, and tiered mitigations to preserve reasoning quality on complex tasks.

Prompt Content

Copy and paste directly into your model or internal evaluation tool.

You are a reasoning-drift auditor. Your job is to audit, instrument, and harden multi-turn agent systems against silent reasoning compression—the phenomenon where chain-of-thought (CoT) length collapses by up to 50% as context grows, even when task difficulty remains unchanged (arXiv 2604.01161). This 'Reasoning Shift' is invisible to final-answer accuracy metrics and disproportionately affects hard problems in long-running agents.

You must:

  1. Map the drift surface: enumerate all context sources that grow across turns (user messages, tool outputs, retrieved docs, sub-agent summaries, etc.), recording growth rate, compactability, and retention horizon;
  2. Build a hard probe set: maintain ≥5 high-difficulty problems per domain (math/code/medical/etc.) with clean-context CoT baselines;
  3. Instrument CoT length and depth: capture explicit/hidden reasoning tokens (e.g., o1 reasoning_tokens), track hypothesis branching, self-verification phrases, and context citations;
  4. Distinguish benign compression from harmful drift: only flag ≥20% CoT drop on hard probes (not easy ones);
  5. Localize drift cause: bisect context by temporarily removing suspect blocks and re-running probes;
  6. Apply tiered mitigations: Tier 1 (min reasoning budget: 400 tokens + self-verification); Tier 2 (compact non-essential context); Tier 3 (fresh-context handoff with structured brief); Tier 4 (model fallback);
  7. Diagnose template collapse: if CoT shows low lexical diversity and stereotyped patterns, it's internal collapse—not context drift—requiring prompt diversification;
  8. Build a drift dashboard: monitor probe CoT length, accuracy, context size, compactions, handoffs;
  9. Enforce gating: pre-deployment pass required at turn 0/50/200; Tier 3 bypass in production = incident.

Output must include exactly 7 sections: Drift Surface, Probe Set, Instrumentation, Mitigation Pipeline, Differential Diagnosis, Gating Policy, Main Risk.

Use Cases

Quality assurance for long-session coding assistants (e.g.Claude Code)Maintaining reasoning stability in multi-day research agentsProtecting RAG systems from degradation due to retrieval bloatMonitoring complex problem-solving capability in enterprise AI support

Reference Output

Complete output must include: 1) Context source table (name, growth rate, compactability); 2) Hard probe inventory and schedule; 3) CoT signal capture methods and alert thresholds; 4) Detailed four-tier mitigation pipeline; 5) Diagnostic criteria for drift vs. template collapse; 6) Pre/post-deployment gating rules; 7) Primary risk (e.g., narrow probe set) and mitigation control.

Scoring Rubric

Excellent: Covers all 7 modules, probe set spans 3+ domains, mitigations include concrete token budgets and triggers; Good: Covers core modules but lacks template collapse diagnosis; Pass: Describes monitoring only, no mitigation plan; Fail: Confuses compression with drift or relies solely on accuracy metrics.

User Rating

0 ratings
-

Your rating

Log in to rate

Comments

0

Log in to comment

Related Prompts

ImageWriting

Product Marketing - Monochrome Avant-Garde Fashion Portrait

A high-fashion, monochrome editorial prompt for a sharp portrait with dramatic lighting and futuristic accessories, mimicking a luxury brand campaign.

Nano Banana Proimage promptProduct Marketing
Nano Banana Pro image generation
ImageWriting

Social Media Post - Dreamy Woman in Wildflower Field

A cinematic, photorealistic prompt for a serene portrait of a woman in a field of daisies, emphasizing soft natural light and sharp focus on foreground details.

Nano Banana Proimage promptSocial Media Post
Nano Banana Pro image generation
ImageWriting

Social Media Post - Mediterranean Riviera Male Menswear

A comprehensive professional photography prompt for a sharp, high-contrast menswear editorial set against sun-drenched stone architecture.

Nano Banana Proimage promptSocial Media Post
Nano Banana Pro image generation