OpenAI新研究：训练模型在长期高风险任务中保持有益安全行为

精选理由

OpenAI发了篇新论文，研究怎么让AI在超出训练场景的长期任务里也乖乖做好事，关心AI安全的朋友可以看看。

AI 摘要

OpenAI发布新研究，旨在训练AI模型将有益和安全行为推广到训练范围之外的新领域，并在压力下维持。该方法聚焦于让模型具备广泛且持久的利他性，论文名为《Beneficial RL》。研究通过强化学习框架，让模型学会在更长、更高风险的任务中自主保持符合人类意图的行为，而不仅是拟合训练数据。相关论文和代码已发布在alignment.openai.com/beneficial-rl/。

AI 翻译 · 中文

OpenAIAs AI takes on longer, higher-stakes tasks, we want models to carry beneficial and safe behavior into new domains beyond their training—and maintain it under pressure. That’s the idea behind our new research on training …

Decoder06-19 10:08原文
orange.ai06-20 14:54原文
marktechpost06-17 05:49原文
arXiv: OpenAI06-17 08:04原文
elvis06-19 15:04原文
宝玉06-16 23:30原文
shao__meng06-17 00:53原文
IT之家06-17 02:06原文
AI Will06-17 09:19原文
Aadit Sheth06-17 19:22原文

查看原推