●builderMixed results suggest you cannot rely on current agent frameworks to resist credential phishing without explicit defensive prompting or guardrails.
●researcherThe OpenClaw test lab provides a reproducible framework for evaluating agent phishing resistance — useful for benchmarking prompt injection and social engineering defenses.