What?

Enabling reparameterisation trick to work for discrete random variables.

Why?

We want low-variance gradient estimate methods (reparameterisation) to work with discrete random variables.

How?

source: original paper

source: original paper

And?

https://s3-us-west-2.amazonaws.com/secure.notion-static.com/1443187c-aa6b-4c61-aa28-26d7d89822f3/Untitled.png


This note is a part of my paper notes series. You can find more here or on Twitter. I also have a blog.