[RFC] [WIP] Making sampling methods differentiable. #16196

xidulu · 2019-09-18T09:05:52Z

Background

Backpropagation through random variables is no easy task. Two main methods are often adopted for derivative estimation: score function estimator and pathwise derivative estimator (see https://arxiv.org/abs/1506.05254 for more details). The former one is wildly used in reinforcement learning while the pathwise derivative estimator could be seen a lot in variational autoencoder related models, often referred to as the reparameterization trick. One of the key differences between the two method is that, pathwise derivative estimator requires the derivative of density function f(x;θ) with respect to the parameter, which requires the sampling operation to have gradient, while the SF estimator could bypass such calculation by using log derivative trick.

Proposal

I'm planning to prototype the pathwise gradient for some of the sampling methods in Deep Numpy (Gaussian and Gamma for now) by applying the following modification:

Add require_grads parameter in python frontend.
Add backward function in the backend.

If my experiment goes well, these enhanced sampling methods could possibly serve as the foundation for the distribution module mentioned in MXNet 2.0 Roadmap #16167
Also, differentiable sampling has been introduced into both Tensorflow (tf.distributions) and Pytorch (torch.distributions) for many years, I think it is necessary for MXNet to have such feature as well.

Update:

Gradient for Gaussian added, under review.
#16330
Next, I will try to implement a vanilla VAE demo based on it to find out if the interface is easy to use in practice.

The text was updated successfully, but these errors were encountered:

mxnet-label-bot · 2019-09-18T09:05:55Z

Hey, this is the MXNet Label Bot.
Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it.
Here are my recommended label(s): Feature

sxjscience · 2019-09-25T18:27:32Z

Also, you may check the more recent REBAR paper: https://papers.nips.cc/paper/6856-rebar-low-variance-unbiased-gradient-estimates-for-discrete-latent-variable-models

sxjscience · 2019-09-25T18:51:46Z

Ping @szhengac who should be in charge of the distribution module.

xidulu · 2019-09-26T08:30:10Z

@sxjscience
Thanks for your reply, I've briefly read REBAR before, estimating gradient by combing reparameterization trick with REINFORCE has been more and more popular these days. (e.g. https://arxiv.org/abs/1807.11143 , https://arxiv.org/abs/1711.00123 )
I'll have further discussion with @szhengac regarding the distribution module.

sxjscience · 2019-09-30T19:27:32Z

Let me link it to #12932

sxjscience · 2019-09-30T19:36:57Z

@xidulu Also, you may refer to https://www.tensorflow.org/probability .

zachgk added Feature request RFC Post requesting for comments labels Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] [WIP] Making sampling methods differentiable. #16196

[RFC] [WIP] Making sampling methods differentiable. #16196

xidulu commented Sep 18, 2019 •

edited

Loading

mxnet-label-bot commented Sep 18, 2019

sxjscience commented Sep 25, 2019

sxjscience commented Sep 25, 2019

xidulu commented Sep 26, 2019

sxjscience commented Sep 30, 2019

sxjscience commented Sep 30, 2019

[RFC] [WIP] Making sampling methods differentiable. #16196

[RFC] [WIP] Making sampling methods differentiable. #16196

Comments

xidulu commented Sep 18, 2019 • edited Loading

Background

Proposal

Update:

mxnet-label-bot commented Sep 18, 2019

sxjscience commented Sep 25, 2019

sxjscience commented Sep 25, 2019

xidulu commented Sep 26, 2019

sxjscience commented Sep 30, 2019

sxjscience commented Sep 30, 2019

xidulu commented Sep 18, 2019 •

edited

Loading