RL Dreams: Policy Gradient Optimization for Score Distillation 3D Generation

Under submission

Anonymous authors

Abstract

3D generation has rapidly accelerated in the past decade owing to the progress in the field of generative modeling. Score Distillation Sampling (SDS) based rendering has improved 3D asset generation to a great extent. Further, the recent work of Denoising Diffusion Policy Optimization (DDPO) demonstrates that the diffusion process is compatible with policy gradient methods and has been demonstrated to improve the 2D diffusion models using an aesthetic scoring function. We first show that this aesthetic scorer acts as a strong guide for a variety of SDS-based methods and demonstrates its effectiveness in text-to-3D synthesis. Further, we leverage the DDPO approach to improve the quality of the 3D rendering obtained from 2D diffusion models. Our approach called DDPO3D, employs the policy gradient method in tandem with aesthetic scoring. To the best of our knowledge, this is the first method that extends policy gradient methods to 3D score-based rendering and shows improvement across SDS-based methods such as DreamGaussian, which are currently driving research in text-to-3D synthesis. Our approach is compatible with score distillation-based methods, which would facilitate the integration of diverse reward functions into the generative process.

Architecture overview

We start with the SDS-based gradient calculation using a frozen UNet model calculated by performing the denoising operation on the image generated using the NeRF/Gaussian Splatting technique. Then, using the rendered image, we calculate the aesthetic score and perform another step of noising-denoising using the UNet as a policy network. We use the generated latents as the action and compute the policy gradient, which is then used to update the NeRF/Gaussian splat parameters to maximize the aesthetic score.

Generated 3D assets from textual prompts

A campfire

A small saguaro cactus planted in a clay pot
A photo of an icecream

A tulip

A parrot sitting on a basket of macarons
An imperial state crown of England

A photo of a fire hydrant, highly detailed
A bagel filled with cream cheese and lox

A delicious hamburger

A ripe strawberry

A pineapple

A fresh cinnamon roll covered in glaze, high resolution
A blue jay standing on a large basket of rainbow macarons

A delicious croissant


A photo of fries and a hamburger

A photo of a cupcake

Exported Meshes

There might be some artifacts (dark spots) on the models could be attributed to the HTML model viewer components in certain browsers and operating systems, as observed in the case of Chrome on Ubuntu. These artifacts are not a result of the exported meshes.
A pineapple
A parrot sitting on a basket of macarons
A photo of a fire hydrant, highly detailed
A bagel filled with cream cheese and lox
A photo of a cupcake
An imperial state crown of England
A blue jay standing on a large basket of rainbow macarons
A campfire