[HUGGINGFACE]score: 0.32

MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation

May 31, 2026

MCP-Persona benchmarks LLM agents on personalized MCP tool use across social and enterprise applications like Reddit and Xiaohongshu, filling a gap left by existing benchmarks that focus on generic information-retrieval tools. The benchmark simulates real user environments with individual accounts and local databases, testing agents on tasks that require interacting with personal data rather than public APIs.

paper

SOURCE

https://huggingface.co/papers/2606.02470

← back to feed