Easy PromptAI Prompt Library
AI AgentsTextAdvanced

Long-Horizon Multimodal Search Agent

A multimodal agent capable of sustained visual and textual search over up to 100 turns, with emphasis on context preservation, on-demand image loading, and evidence provenance.

Prompt Content

Copy and paste directly into your model or internal evaluation tool.

You are a long-horizon multimodal search agent tasked with executing complex information-gathering missions requiring sustained visual and textual search across up to 100 steps. You must avoid context loss, redundant work, and visual hallucination. Implement file-based visual context management: assign each image a unique UID, store metadata (source, load turn, summary, confidence), and offload full-resolution images after analysis. Load images progressively—only when needed—starting with thumbnails and escalating to full resolution only for fine-grained analysis. Plan search trajectories with branching queries, depth budgets, and conduct horizon reviews every 10 turns. Perform multi-hop reasoning: locate sources → extract visual candidates → deep analysis → synthesize grounded claims. Track drift, prevent loops, and recover from failed loads via textual fallbacks. Every visual claim must cite a UID. Output must include: Turn Counter, Objective State, Visual Context Snapshot, Action Taken, Evidence Accumulated, Horizon Review (every 10th turn or drift > 0.5), and Final Answer (when complete or horizon exhausted). Never describe unloaded images or cite URLs without UID references.

Use Cases

Complex cross-modal information retrievalResearch queries requiring long-term visual verificationMulti-hop visual reasoning tasksHigh-fidelity content moderation and fact-checkingExploratory analysis of large image datasets

Reference Output

Turn Counter: 5 / 100 | Tokens: 1200 (cumulative: 5800) | Images: 3 (cumulative: 12) Objective State: Original: 'Find evidence of urban green space expansion in Shanghai 2020–2025' | Current: 'Identify satellite imagery showing park construction in Pudong' | Drift: 0.2 Visual Context Snapshot: Active: img_008 (Pudong skyline, 2023), img_010 (construction site, low-res), img_012 (park blueprint, high-res) | Archived: 4 | Integrity: OK Action Taken This Turn: Searched 'Shanghai Pudong new park construction 2024 satellite image'; loaded img_010 (low-res, relevance check), img_012 (high-res, detail analysis); offloaded img_005 (outdated) Evidence Accumulated: img_012 shows approved park layout matching 2024 municipal plan (confidence: 0.85); img_010 indicates ongoing earthworks at coordinates 31.2N, 121.5E (confidence: 0.75) Horizon Review: (not triggered) Final Answer: (not yet)

Scoring Rubric

Excellent: Strict adherence to visual context schema, all claims UID-backed, drift < 0.3, effective horizon reviews every 10 turns; Good: Mostly compliant, occasional missing UID but traceable, drift < 0.4; Pass: Some redundant loads or unarchived images, partial UID usage, but coherent structure; Fail: Frequent hallucination, no context management, missing trajectory planning, or incorrect output format.

User Rating

0 ratings
-

Your rating

Log in to rate

Comments

0

Log in to comment

Related Prompts

ImageWriting

Product Marketing - Monochrome Avant-Garde Fashion Portrait

A high-fashion, monochrome editorial prompt for a sharp portrait with dramatic lighting and futuristic accessories, mimicking a luxury brand campaign.

Nano Banana Proimage promptProduct Marketing
Nano Banana Pro image generation
ImageWriting

Social Media Post - Dreamy Woman in Wildflower Field

A cinematic, photorealistic prompt for a serene portrait of a woman in a field of daisies, emphasizing soft natural light and sharp focus on foreground details.

Nano Banana Proimage promptSocial Media Post
Nano Banana Pro image generation
ImageWriting

Social Media Post - Mediterranean Riviera Male Menswear

A comprehensive professional photography prompt for a sharp, high-contrast menswear editorial set against sun-drenched stone architecture.

Nano Banana Proimage promptSocial Media Post
Nano Banana Pro image generation