GLM-5.2 (Max) 在 Code Arena: Frontend 以高胜率超越 Opus 系列

精选理由

GLM-5.2 在前端任务上干掉了 Claude Opus 系列，对 Kimi 和 Sonnet 胜率超 60%，开源模型里相当能打。

AI 摘要

GLM-5.2 (Max) 在 Code Arena: Frontend 排名第二，仅次于 Fable 5，但击败了 Claude Opus 4.8 (Thinking) 和 Opus 4.7 (Thinking)。对 Kimi-K2.6 胜率 61.0%，对 Sonnet 4.6 胜率 59.4%，对 Opus 4.7 (Thinking) 胜率 55.0%。最接近的挑战来自 GPT-5.5 (xHigh)（41.7% vs 40.0%）和 Opus 4.6（47.0% vs 42.4%）。与前任 GLM-5.1 打成平手（45.5% - 45.5%）。在 Brand & Marketing、Data & Analytics 等多项子类别中排名第一。

AI 翻译 · 中文

lmarena.aiHow did GLM-5.2 (Max) get to the top of Code Arena: Frontend? Looking at matched head-to-head on real-world web dev frontend tasks, @Zai_org 's latest model takes a higher win share than its opponent in every pairing…

查看原推