编码器vs解码器：LLM安全评估裁判系统对比研究

精选理由

ArXiv上新论文，用ModernBERT和Ettin编码器做安全裁判，比LlamaGuard快还便宜，准确率没差太多。

AI 摘要

该论文系统比较了ModernBERT、Ettin等现代编码器分类器与LlamaGuard 3、LlamaGuard 4等LLM裁判在识别有害输出上的性能。使用F1分数、假阴性率和精准率-召回率指标评估，并分解了单轮提示、分解、升级和上下文操纵四种攻击技术。实验发现编码器分类器在多数场景下性能接近LLM裁判，但成本和延迟显著更低。

AI 翻译 · 中文

arXiv cs.AIWith the widespread adoption of large language models (LLMs) in chatbots and everyday applications, companies increasingly need guardrails that are effective while remaining low-cost and low-latency. Safety evaluation of…

阅读原文