OrbitForge: 文本到3D场景生成的新方法

精选理由

OrbitForge用文本直接生成360度3D场景，解决了视频视角不全的问题，效果比单用MedianGS好很多。

AI 摘要

OrbitForge利用冻结的视频先验和逐提示高斯泼溅重建优化，将单个文本生成视频转换为规范闭环轨道3D高斯泼溅场景。它通过可变形高斯泼溅和稳健MedianGS代理获得初步3D重建，然后渲染指定轨道视图检测缺失视角。该方法仅补全缺失视角并重建最终场景，无需任务特定视频或多视角微调。在300提示T3Bench审计中，OrbitForge达到了359.0度中位数跨度，并将Q10 ImageReward从8.07提升至16.36，同时与VideoMV保持竞争力。

AI 翻译 · 中文

arXiv cs.AIGeneric text-to-video models can be used as rich open-world scene priors. Despite the high quality of today's generated videos, they do not directly yield reliable 3D assets: camera motion is difficult to control, view c…

阅读原文