What?

Sampled MuZero, a MuZero extension for continuous and (large) discrete action spaces.

Why?

Would be cool to have one algorithm to work with arbitrary action spaces.

How?

source: original paper

source: original paper

And?


This note is a part of my paper notes series. You can find more here or on Twitter. I also have a blog.