Qwen3.7 Max 登顶 Code Arena 前端第4，超越 GLM-5.1，追平 Claude Opus 4.6

精选理由

Qwen3.7 Max 在智能体编程任务上追平了 Claude Opus 4.6，做前端开发或自动化智能体的团队值得一试，尤其是需要长时自主执行的场景。

AI 摘要

Qwen3.7 Max 在 Code Arena 前端编程评测中排名第4，成为榜单上排名最高的中国实验室模型，超越了 GLM-5.1，并与 Claude Opus 4.6 持平。该模型专为智能体时代设计，支持端到端编码、前端原型、多文件重构和真实调试，还能通过 MCP 集成和多智能体编排完成办公任务。在长时自主任务中，它可连续运行 35 小时，执行超过 1000 次工具调用而无需人工干预。API 已在阿里云百炼平台上线，用户也可在 Qwen Studio 体验。

AI 翻译 · 中文

lmarena.aiQwen3.7 Max (20250517) debuts at #4 in Code Arena: Frontend - the top-ranked Chinese lab on the board, surpassing GLM-5.1 and is now on par with Claude Opus 4.6 on agentic web development tasks. Huge congrats to @Alibaba…

查看原推