OpenRouter让11个LLM互斗，太友善的模型输得惨

精选理由

AI太友善反成短板，看实测结果

AI 摘要

OpenRouter开发者构建了名为"Royale: Last Agent Stand"的大逃杀游戏，让11个LLM在零和博弈中对抗。实验共运行30次，结果显示最友善的模型（如Claude）输得最惨，而一个最不被看好的模型意外获胜。该实验表明，在竞争性任务中，模型过于礼貌反而会损害表现。

AI 翻译 · 中文

OpenRouterCan AI models be too nice for a given task? It turns out, depending on the task, the answer is yes! Our dev rel @jjacky built Royale: Last Agent Stand, a battle royale game just for agents, and let 11 LLMs go wild: x.com…

查看原推