Would making the gradient "data" by detaching them implement first order MAML? #128

brando90 · 2022-02-01T20:19:25Z

I didn't realize until now that the track_higher_grads.

But now I realized I might have a weird version of MAML going on in my code and wanted to make sure it was correct.

What I did is make the gradient a raw tensor by detaching them from the computation graph e.g.

                if self.fo:  # first-order
                    g = g.detach()  # dissallows flow of higher order grad while still letting params track gradients.

I was wondering if this would be equivalent to track_higher_grads=False.
In particular I have the detach but leave track_higher_grads=True...which is the point that confuses me.

official fo maml: #63
docs: https://higher.readthedocs.io/en/latest/optim.html
cross: #128 , https://stackoverflow.com/questions/70947042/how-does-one-run-first-order-maml-with-pytorchs-higher-library

The text was updated successfully, but these errors were encountered:

brando90 mentioned this issue Feb 2, 2022

Potential bug with first order MAML using only higher, setting track_higher_grads = False leads to .grad field to not be populated and be None, is that a bug? #129

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Would making the gradient "data" by detaching them implement first order MAML? #128

Would making the gradient "data" by detaching them implement first order MAML? #128

brando90 commented Feb 1, 2022 •

edited

Loading

Would making the gradient "data" by detaching them implement first order MAML? #128

Would making the gradient "data" by detaching them implement first order MAML? #128

Comments

brando90 commented Feb 1, 2022 • edited Loading

brando90 commented Feb 1, 2022 •

edited

Loading