Claude Fable 5 在 FrontierMath 最难题目上领先 GPT-5.5 13 个百分点

精选理由

Anthropic 新模型数学碾压 GPT-5.5

AI 摘要

Anthropic 的 Claude Fable 5 在 FrontierMath 最难层级上达到 88% 准确率，较 Opus 4.5 在 2026 年初低于 10% 的成绩大幅提升。OpenAI 的 GPT-5.5 在同一层级上达到约 75%。两者差距为 13 个百分点，显示 AI 数学能力加速提升。

AI 翻译 · 中文

DecoderAnthropic's Claude Fable 5 hits 88 percent accuracy on the hardest FrontierMath tier, a massive jump from Opus 4.5, which sat below 10 percent in early 2026. OpenAI's GPT-5.5 reaches about 75 percent on the same tier. Th…

lmarena.ai06-11 19:35原文
Artificial Analysis06-12 04:49原文
Epoch AI06-13 05:17原文
rohanpaul_ai06-11 13:00原文
向阳乔木06-11 17:02原文
AI Will06-13 01:24原文
Cognition06-13 01:36原文
Simon Willison06-13 02:14原文
elvis06-13 03:04原文
marktechpost06-13 08:15原文

阅读原文