This is the official implementation of Nabla-R2D3, which is a highly effective and sample-efficient reinforcement learning alignment framework for 3D-native diffusion models using pure 2D rewards.
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.