Fireworks 推出 Nemotron 3 RL 微调服务，按 GPU 小时计费

精选理由

Fireworks 刚上线了 Nemotron 3 的 RL 微调，按 GPU 小时计费不怕长对话烧钱，用 GRPO 训练一条龙搞定。

AI 摘要

Fireworks 宣布对 NVIDIA Nemotron 3 的强化学习微调功能上线，首批支持 Nemotron 3 Super 的 LoRA 微调。训练采用 GRPO 算法，可在一处平台完成训练和部署。计费方式改为按 GPU 小时而非按 token，解决了长多轮对话成本不可控的问题。

AI 翻译 · 中文

Fireworks AIRL fine-tuning is now live for @nvidiaai Nemotron 3 on Fireworks, starting with Nemotron 3 Super (LoRA). Train with GRPO and serve the model in one place. We price by GPU-hour, not per token, so long multi-turn rollouts …

marktechpost06-27 00:02原文
IT之家06-24 08:11原文
AI Will06-24 09:39原文
Hugging Face: Blog06-24 16:00原文
NVIDIA AI06-24 16:03原文
berryxia06-24 16:50原文

查看原推