Comparative One-Shot Benchmarking of Claude Sonnet 5 and Competitors
July 1, 2026
A single-prompt benchmark evaluates Claude Sonnet 5, Fable 5, GPT-5.5, and Gemini by tasking them to generate a Three.js cyberpunk scene. The test measures the ability of each model to produce a complete, functional 3D environment with specific camera movements in a single attempt.
HOW THIS AFFECTS YOU
●
builderYou can use these visual coding benchmarks to select the best model for generative front-end tasks.
●
researcherThis provides qualitative data on zero-shot code generation capabilities across frontier models.