Add partial support for second-order derivatives (for grid.h's input) #69

ventusff · 2022-03-18T02:01:41Z

Hi,

As discussed in issue #58, a backward_backward functionality is needed.

For now, I managed to add a partial support for second-order derivatives, only for grid.h, and only for d(dL_dinput)_d(...) .
This satifies the need of backwarding nablas's gradients toward grid params and downstream modules' params.

edit: now support d(dL_dinput)_d(input)

# NOTE: currently support:
#  ✓ d(dL_dinput)_d(dL_doutput)  ->  kernel_grid_backward_input_backward_dLdoutput
#  ✓ d(dL_dinput)_d(params)      ->  kernel_grid_backward_input_backward_grid
#  ✓ d(dL_dinput)_d(input)       ->  kernel_grid_backward_input_backward_input
#  x d(dL_dparam)_d(...)

I tried my best to follow the code style of your original project, and have done various tests to ensure that the code is running correctly.
The changes are made as little as I can, and the compiling has passed with no errors.

Code tests

I provide three testing tools for the newly added backward_backward_input functionality:
https://gist.github.com/ventusff/57f47588eaff5f8b77a382260e7da8a3

✔️ test_train(): train a toy SDF model with eikonal term.
✔️ grad_check(): check backward_backward numerical correctness via torch.autograd.gradcheck.
✔️ vis_graph(): visualize torch compute graph

Toy model compute graph:

toy_model:

class SDF(nn.Module):
    def __init__(self, hash=True, n_levels=12, log2_hashmap_size=15, base_resolution=16) -> None:
        super().__init__()
        self.encoder = tcnn.Encoding(3, {
            "otype": "HashGrid" if hash else "DenseGrid",
            "n_levels": n_levels,
            "n_features_per_level": 2,
            "log2_hashmap_size": log2_hashmap_size,
            "base_resolution": base_resolution,
            "per_level_scale": 1.5
        })
        self.decoder = nn.Sequential(
            nn.Linear(self.encoder.n_output_dims, 64),
            nn.ReLU(True),
            nn.Linear(64, 1)
        )
    
    def forward(self, x):
        encoded = self.encoder(x).to(dtype=torch.float)
        sdf = self.decoder(encoded)
        return sdf
    
    def forward_with_nablas(self, x):
        with torch.enable_grad():
            x = x.requires_grad_(True)
            sdf = self.forward(x)
            nablas = autograd.grad(
                sdf,
                x,
                torch.ones_like(sdf, device=x.device),
                create_graph=True,
                retain_graph=True,
                only_inputs=True)[0]
        return sdf, nablas

You can see that the gradients of nablas are backwared towards grid_params (needs d(dL_dx)_dgrid)) and decoder.xxx (needs d(dL_dx)_d(dL_doutput)) through backwardBackward.

Theoretical derivation

From gradients of dL_dx, to gradients of dL_dgrid and dL_d(dL_doutput).

edit: add theoretical derivations of d(dy_dx)_dx

1. Since I add a virtual function backward_backward_input in object.h:DifferentiableObject, I must put a dummy error function in network.h:Network and encoding.h:Encoding to pass compiling, which inherit from DifferentiableObject.
Currently this funciton just throw a NotImplementedError. I hope this won't hurt any current mechanisms, and you can finish the remain support for backward_backward later to cancel this dummy function.

…nput)

Tom94

Hi Jianfei, thank you very much for this PR -- I think this is great functionality to have, even if implemented just for the encoding!

This is also a great implementation, I have very little to criticize. Here are just a few small requests before I feel comfortable merging.

bindings/torch/tinycudann/modules.py

include/tiny-cuda-nn/encoding.h

include/tiny-cuda-nn/encodings/grid.h

Tom94 · 2022-03-18T09:27:37Z

Also just wanted to say that I very much appreciate the detailed derivations and explanations in the PR. This is really something to point other people to as an example of how it should be done!

include/tiny-cuda-nn/encodings/grid.h

bindings/torch/tinycudann/modules.py

ventusff · 2022-03-18T16:15:01Z

Thanks for the nice tips and kind words :) I resolved some of the suggestions now. There are a few remains, mostly about computing that I'm not sure.

ventusff · 2022-03-25T03:25:22Z

Hi @Tom94,
I've finished the following implementation and tests, and I think it's probably ready to merge now 😃

update with your latest convention of code and data convention, including MatrixView and xxx_impl
as you suggested, I add an implementation of d(dL_dx)_dx, which requires:
- an implementation of d(dy_dx)_dx
- an overload of pos_fract for second-order derivatives, and second-order derivatives for Smoothstep
- an additional kernel kernel_grid_backward_input_backward_input
- bindings update and modules udpate
passed additional gradcheck and gradgradcheck for the newly added d(dL_dx)_dx's calculation; check numerical correctness under both Linear interpolation and Smoothstep interpolation (the previous gist script for testing is also updated)

~~I will clean and upload theoretical derivations for d(dy_dx)_dx soon.~~ In the meantime you can review my changes :)
Edit: theoretical derivations are updated in the top PR comment.

Tom94 · 2022-03-25T08:54:18Z

Wow, this is incredible, thank you so much for adding d(dL_dx)_dx! I haven't had the time to get back to this PR during the week and did not expect the follow-up. Will go through it now. :)

By the way: do you want to add your test script to the scripts folder? It would surely come in handy in the future to ensure nothing regresses. (E.g. when support for non-contiguous inputs from torch is added.)

Tom94 · 2022-03-25T09:21:22Z

include/tiny-cuda-nn/encodings/grid.h

+				forward.dy_dx.data(),
+				dL_dy_rm,
+				// outputs
+				dL_ddLdoutput->pitched_ptr()


Could you replace this pitched pointer with ->view() in order to also support row-major dL_ddLdoutput?

Tom94 · 2022-03-25T09:35:01Z

Turns out that was the only nitpick I could find in the code -- so I quickly made the change myself and merged.

Thanks again for adding all this. I can't stress enough how much I appreciate this sort of high-quality code contribution!

juuso-oskari · 2023-04-04T08:11:15Z

@ventusff Thank you so much for your work on this. Are you still planning to do the double backwards for the fully_fused_mlp.cu? I would really like to test it on my thesis project.

Add partial support for second-order derivatives (only for grid.h's i…

cb30a9f

…nput)

ventusff mentioned this pull request Mar 18, 2022

Any plans for double backward / second-order gradients ? i.e. backward for backward functions. #58

Closed

ventusff added 3 commits March 18, 2022 10:22

Minor fixes typo

c6da39a

Fix illegal memory access

b0323b1

Minor fix comments

243db3c

Tom94 reviewed Mar 18, 2022

View reviewed changes

ventusff commented Mar 18, 2022

View reviewed changes

include/tiny-cuda-nn/encodings/grid.h Outdated Show resolved Hide resolved

Unified module_function & throw Error directly at base class

97ff02e

ventusff commented Mar 18, 2022

View reviewed changes

bindings/torch/tinycudann/modules.py Show resolved Hide resolved

ventusff added 3 commits March 24, 2022 06:52

Merge branch 'master' of github.com:NVlabs/tiny-cuda-nn

b294a90

Change to latest MatrixView & xxx_impl convention

6d252a3

Add d(dydx)_dx support & overload pos_fract

83ac516

Minor updates of comment

b35e196

Tom94 reviewed Mar 25, 2022

View reviewed changes

Tom94 merged commit b35e196 into NVlabs:master Mar 25, 2022

juuso-oskari mentioned this pull request Apr 4, 2023

double backwards for the fully_fused_mlp.cu? ventusff/ventusff#1

Open

SusanLiu0709 mentioned this pull request Sep 19, 2023

merge 2nd order derivative of CutlassMLP #370

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add partial support for second-order derivatives (for grid.h's input) #69

Add partial support for second-order derivatives (for grid.h's input) #69

ventusff commented Mar 18, 2022 •

edited

Loading

Tom94 left a comment

Tom94 commented Mar 18, 2022

ventusff commented Mar 18, 2022

ventusff commented Mar 25, 2022 •

edited

Loading

Tom94 commented Mar 25, 2022

Tom94 Mar 25, 2022

Tom94 commented Mar 25, 2022

juuso-oskari commented Apr 4, 2023

Add partial support for second-order derivatives (for grid.h's input) #69

Add partial support for second-order derivatives (for grid.h's input) #69

Conversation

ventusff commented Mar 18, 2022 • edited Loading

Code tests

Toy model compute graph:

Theoretical derivation

edit: add theoretical derivations of d(dy_dx)_dx

Tom94 left a comment

Choose a reason for hiding this comment

Tom94 commented Mar 18, 2022

ventusff commented Mar 18, 2022

ventusff commented Mar 25, 2022 • edited Loading

Tom94 commented Mar 25, 2022

Tom94 Mar 25, 2022

Choose a reason for hiding this comment

Tom94 commented Mar 25, 2022

juuso-oskari commented Apr 4, 2023

ventusff commented Mar 18, 2022 •

edited

Loading

ventusff commented Mar 25, 2022 •

edited

Loading