Arena 推出 Agent Mode，支持 GPT-5.5/Claude Opus 4.7 等模型

精选理由

Arena 的 Agent Mode 让开发者可以直接对比前沿模型在真实任务中的表现，做 AI 评测或选型的团队值得一试。

AI 摘要

Arena 平台今日正式推出 Agent Mode，允许用户测试前沿模型在真实任务中的表现，包括深度研究、生成报告、创建网站、调试代码等。该模式通过工具调用（如网页搜索、沙箱 bash、图像生成、文件写入）完成复杂任务。首批支持的模型包括 GPT-5.5、Claude Opus 4.7、Gemini 3.1 Pro 及顶级开源模型。同时，Battle Mode 投票数已突破 5000 万。

AI 翻译 · 中文

lmarena.aiAs we launch Agent Mode on Arena today, we want to celebrate the community that brought us here. Battle Mode - where it all started - just passed 50 million votes. Thank you. Your browser does not support the video tag. …

rohanpaul_ai06-05 22:41原文
Fireworks AI06-03 16:41原文

查看原推