Amazon SageMaker AI Async Inference 支持行内请求载荷

精选理由

AWS SageMaker 异步推理现在可以直接在请求里传数据，不用先传 S3 了，省一步操作。

AI 摘要

AWS 宣布 Amazon SageMaker AI 异步推理（Async Inference）现支持在 InvokeEndpointAsync API 的请求体中直接发送推理负载（inline payload），无需预先上传至 S3。这一功能简化了工作流，减少了与 S3 的交互步骤，并降低了延迟。用户可在请求正文中放入不超过 2MB 的数据，适用于轻量级推理场景。

Amazon SageMaker AI Async Inference 支持行内请求载荷 — 图片来源 · AWS Machine Learning Blog

AI 翻译 · 中文

AWS Machine Learning BlogToday, we’re announcing inline payload support for Amazon SageMaker AI Async Inference. Customers can now send inference payloads directly in the request body of the InvokeEndpointAsync API, removing the need to upload i…

marktechpost06-16 09:21原文

阅读原文