GLM 5.2 在 FrontierSWE 基准排名第三，超越 GPT-5.5

精选理由

GLM 5.2 在编码基准上干掉了 GPT-5.5，开源里最强，值得关注。

AI 摘要

GLM 5.2 在 FrontierSWE 基准上排名第 3，得分仅次于 Fable 5 和 Opus 4.8，并超越 GPT-5.5。这是首个缩小 Anthropic/OpenAI 与其他提供商之间差距的模型，同时也是目前最强的开源权重模型。该成绩展示了开源模型在编码任务上的竞争力。

AI 翻译 · 中文

elvisLooks strong at SWE too. x.com/ProximalHQ/sta… Proximal @ProximalHQ GLM 5.2 ranks #3 on FrontierSWE. It is only behind Fable 5 and Opus 4.8, and it outperforms GPT-5.5. This is the first model that closes the large gap b…

Decoder06-17 17:30原文
宝玉06-16 23:30原文
marktechpost06-15 06:10原文
arXiv: Anthropic06-15 10:37原文
lmarena.ai06-17 20:21原文
AI Will06-15 05:27原文
Simon Willison’s Weblog06-16 05:20原文
kimmonismus06-16 13:55原文
Fireworks AI06-16 22:11原文
ollama06-17 18:03原文

查看原推