Mistral 3.5 加入 Arena Agent Mode，可执行复杂真实任务

精选理由

Arena 的 Agent Mode 让开发者能直接对比主流模型在真实复杂任务上的表现，做智能体应用选型的团队值得亲自上手测试，结果会直接影响排行榜。

AI 摘要

Arena 平台推出全新的 Agent Mode，并已将 Mistral 3.5 模型纳入其中。该模式允许模型执行深度研究、生成报告、创建网站、调试代码等复杂任务，通过调用网页搜索、沙箱环境 bash、图像生成、文件写入等工具完成。用户可亲自测试包括 GPT-5.5、Claude Opus 4.7、Gemini 3.1 Pro 及顶级开源模型在内的前沿模型。用户的测试会话将帮助塑造 Agent Arena 排行榜，为评估智能体能力提供真实场景数据。

AI 翻译 · 中文

lmarena.aiMistral 3.5 by @MistralAI has been added to Arena's new Agent Mode! Put models to work on your most complex real-world tasks, and see how they perform. Your sessions will help shape the Agent Arena leaderboard. Arena…

查看原推