●builderYou can use this dataset to fine-tune or evaluate agents on privacy-sensitive tool calls and message-sharing decisions.
●researcherProvides a human-annotated benchmark for contextual privacy alignment that avoids proxy-label pitfalls common in prior work.
●policyWorth watching because it operationalizes contextual integrity norms into a measurable alignment benchmark for agentic systems.