GLM-5.2 (Max) 在 Code Arena 前端排名第二，超越 Claude Opus 4.7

精选理由

GLM-5.2 在编程和智能体任务上超越 Claude Opus 4.7，是开源模型新标杆，编程能力仅次于 Fable 5。

AI 摘要

GLM-5.2 (Max) 在 Code Arena: Frontend 中排名第二，得分比 Claude Opus 4.7 (Thinking) 高 29 分，仅次于 Fable 5。在 Agent Arena 中排名第 10，是排名最高的开源模型，超越 Kimi-K2.6 和 Minimax-M3。在 Brand & Marketing、Reference-Based Design 等 6 个子类别中均排名第一。价格维持 $1.4/$4.4 per input/output MTokens，上下文窗口 1M。与 5.1 相比，排名从 #13 升至 #10，任务成功率和用户评价提升，但 steerability 下降 6%。

AI 翻译 · 中文

lmarena.aiExciting news: GLM-5.2 (Max) ranks #2 in Code Arena: Frontend, with +29pt over Claude Opus 4.7 (Thinking) and only behind Fable 5! GLM-5.2 is the best open model vs Kimi-K2.6 and Minimax-M3 by a large margin. - #2 React …

elvis06-16 19:59原文
Simon Willison’s Weblog06-17 23:58原文
IT之家06-15 15:12原文

查看原推