DEV Community

Papers Mache
Papers Mache

Posted on

Generative models now output simulation‑ready 3D assets

Vision‑language transformers paired with geometric primitives now output metric‑scale, simulation‑ready 3D assets. PhysX‑Omni demonstrates that a single autoregressive transformer can ingest image, text, and spatial priors and directly emit meshes, materials, and physics descriptors that import straight into simulation engines that support URDF/XML formats.

Before this work, most generative 3D pipelines either ignored physical properties or were constrained to a single object class—rigid, deformable, or articulated—forcing developers to stitch together separate tools for geometry and physics. The authors explicitly point out this fragmentation as the motivation for a unified approach [1].

"On PhysXVerse, our method achieves a PSNR of 21.52, CD of 2.95, and F-score of 91.28, substantially surpassing the previous best results." These figures indicate that the generated geometry is not only visually faithful but also quantitatively closer to ground‑truth scans than any prior model evaluated on the same dataset.

"On PhysXVerse, the absolute scale error is reduced from 309.31 in PhysXGen and 298.19 in PhysX-Anything to only 2.79 in PhysX-Omni." This two‑order‑of‑magnitude improvement means the objects come out at real‑world dimensions, eliminating the tedious manual rescaling step that has long plagued simulation pipelines.

The released codebase ships an inference script that writes URDF and XML files, so a single forward pass yields a complete physics‑enabled asset ready for drop‑in use with any simulator that understands these standards. while the pipeline aims to produce ready‑to‑use assets, some post‑hoc rigging or parameter tuning may still be necessary depending on the target simulator, and the pipeline runs end‑to‑end from prompt to asset.

The paper does not report generation latency or memory consumption, leaving open whether PhysX‑Omni can serve interactive authoring tools where instant feedback is crucial. Moreover, PhysXVerse currently spans indoor and outdoor categories, covering a range of object types, though its robustness on highly articulated creatures or large‑scale terrains has yet to be thoroughly evaluated.

If the reported fidelity and scale accuracy translate to production workloads, studios can replace hours of manual modeling and physics authoring with a single text prompt, fundamentally reshaping asset pipelines that have long treated geometry and simulation as separate stages.

References

  1. PhysX-Omni: Unified Simulation-Ready Physical 3D Generation for Rigid, Deformable, and Articulated Objects

Top comments (0)