Memory Contagion：评估者偏差通过Agent记忆跨时间传播

精选理由

这篇论文发现用有偏评估者训练智能体，偏差会像病毒一样通过记忆传染给后来者。旧模型DeepSeek V4-Chat中招，Claude和V4-Pro没事，权威偏见传不出去。

AI 摘要

LLM Agent记忆系统在持续整合中会退化，但现有研究假设记忆来自无偏体验。本研究提出Memory Contagion现象，即有偏评估者导致的偏差会通过记忆跨时间传播。实验显示长度偏好偏差在旧模型DeepSeek V4-Chat上传播（Gamma_A=13.18），而新模型V4-Pro和Claude免疫。权威偏差在全部15个多种子实验中未传播（Gamma_A=0.00）。污染率低至p=0.2时仍能检测到长度偏差传播，未发现安全阈值。

AI 翻译 · 中文

arXiv: DeepSeekLarge Language Model (LLM) agents increasingly rely on memory systems to maintain long-term coherence. Recent work shows that agent memories degrade during continuous consolidation. However, existing research assumes mem…

阅读原文