●builderYou can use this dataset to add user-sensitive state detection to GUI automation agents, reducing liability from autonomous actions on sensitive screens.
●researcherProvides a structured benchmark for evaluating when GUI agents should defer to users, a currently underexplored safety dimension in agentic systems.