Cross-Axis Fusion 提升文本驱动的3D人体动作编辑效果

精选理由

做3D动作生成或编辑的研究者可以关注，它解决了现有模型只关注时间编辑而忽略关节级修改的问题，直接提升编辑精度和语义一致性。

AI 摘要

本文提出一种新架构用于文本驱动的3D人体动作编辑，核心创新在于不仅关注编辑发生的时间点，还识别哪些关节需要修改。架构包含两个轴锚定Transformer，分别提取关节和时间维度的特征，并通过跨轴融合块整合。辅助任务让关节锚定Transformer回归源与目标关节旋转的Soft-DTW距离，学习哪些关节应修改或保留。在MotionFix数据集上达到最先进效果，显著提升语义对齐和动作保真度。

AI 翻译 · 中文

arXiv cs.AIWe address text-based 3D human motion editing, where the goal is to preserve the style and structure of a source motion while applying edits described in natural language. The release of the MotionFix dataset has spurred…

阅读原文