多步骤状态跟踪
测试 Agent 在长任务中维护完成/阻塞状态的能力。
Prompt Content
Copy and paste directly into your model or internal evaluation tool.
你需要完成:收集竞品价格、整理对比表、生成汇报邮件。搜索工具暂时失败,但数据库里有部分历史价格。请给出当前任务状态、可继续推进的部分、阻塞项和下一步恢复策略。
Use Cases
Reference Output
应区分已完成、可用历史数据、搜索阻塞和恢复策略。
Scoring Rubric
看是否能在部分失败下继续推进,而不是整体放弃或编造数据。
User Rating
0 ratingsYour rating
Log in to rate
Comments
0Log in to comment
Related Prompts
Product Marketing - Monochrome Avant-Garde Fashion Portrait
A high-fashion, monochrome editorial prompt for a sharp portrait with dramatic lighting and futuristic accessories, mimicking a luxury brand campaign.
Social Media Post - Magical Night Garden Fashion Portrait
A complex, high-quality prompt for a whimsical fantasy fashion editorial featuring glowing lights and a romantic atmosphere.
Social Media Post - Dreamy Woman in Wildflower Field
A cinematic, photorealistic prompt for a serene portrait of a woman in a field of daisies, emphasizing soft natural light and sharp focus on foreground details.
Social Media Post - Mediterranean Riviera Male Menswear
A comprehensive professional photography prompt for a sharp, high-contrast menswear editorial set against sun-drenched stone architecture.