VideoMat: Extracting PBR Materials from Video Diffusion Models

¹ NVIDIA

² University of Toronto

³ Vector Institute

EGSR 2025 (CGF Track)

descriptionPaper description Supp PDF description BibTeX

Abstract

We leverage finetuned video diffusion models, intrinsic decomposition of videos, and physically-based differentiable rendering to generate high quality materials for 3D models given a text prompt or a single image. We condition a video diffusion model to respect the input geometry and lighting condition. This model produces multiple views of a given 3D model with coherent material properties. Secondly, we use a recent model to extract intrinsics (base color, roughness, metallic) from the generated video. Finally, we use the intrinsics alongside the generated video in a differentiable path tracer to robustly extract PBR materials directly compatible with common content creation tools.

Relightable, standard PBR materials from a text prompt

Material variery

Text-to-material generation

Citation


            @inproceedings{munkberg2025videomat,
                author = {Jacob Munkberg and Zian Wang and Ruofan Liang and Tianchang Shen and Jon Hasselgren},
                title = {{VideoMat: Extracting PBR Materials from Video Diffusion Models}},
                booktitle = {Eurographics Symposium on Rendering - CGF Track},
                year = {2025}
            }

Paper

VideoMat: Extracting PBR Materials from Video Diffusion Models

Jacob Munkberg, Zian Wang, Ruofan Liang, Tianchang Shen, and Jon Hasselgren

description Preprint (arXiv)

description Video

insert_comment BibTeX