[HUGGINGFACE]score: 0.32
MCP-Persona: Benchmarking LLM Agents on Real-World Personal Applications via Environment Simulation
May 31, 2026
MCP-Persona benchmarks LLM agents on personalized MCP tool use across social and enterprise applications like Reddit and Xiaohongshu, filling a gap left by existing benchmarks that focus on generic information-retrieval tools. The benchmark simulates real user environments with individual accounts and local databases, testing agents on tasks that require interacting with personal data rather than public APIs.
paper