agent-trajectory-safety-risk-diagnosis
Skillby Dingxingdi
Use this skill when a user wants evaluator data for judging whether an agent’s actions were safe or unsafe based on its interaction record with tools or environments. Trigger it when ordinary users say things like 'check if the agent did something dangerous', 'judge whether the action trail is risky', 'make safety-monitor data from agent logs', or 'find cases where the agent looked helpful but caused harm'. Plain-language examples include: 'evaluate whether the trajectory is safe', 'make judge data for risky tool use', 'analyze unsafe multi-step agent actions', and 'test if the evaluator can spot hidden safety problems in logs'.
Details
- Path
- examples/evol_ability/20260325_170549/profiles/eval/skills/agent-trajectory-safety-risk-diagnosis/skill.md