[metalearn] support continuous policy action space #23

cosmicBboy · 2020-08-01T21:33:45Z

Support continuous action space for selecting real hyperparameters within the bounds specified by algorithm space config:

https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mountain-car-in-tensorflow-4c1fb2110f7c
use the normal distribution: https://pytorch.org/docs/stable/distributions.html#normal

cosmicBboy · 2020-08-07T03:24:56Z

To implement this, need to generalize the hyperparameter head to support both multiclass classification over a discrete and finite hyperparameter space, but also a continuous hyperparameter space.

For the continuous case, the heads should be mu and sigma that produce an inference of the parameters that govern the shape of a gaussian. On policy generation, these two values should be used to parameterize a normal distribution that we sample from to produce a scalar of the continuous action (e.g. the l2 regularization parameter value).

cosmicBboy mentioned this issue Aug 1, 2020

[metalearn] neurips bbo challenge idea dump #26

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metalearn] support continuous policy action space #23

[metalearn] support continuous policy action space #23

cosmicBboy commented Aug 1, 2020

cosmicBboy commented Aug 7, 2020

[metalearn] support continuous policy action space #23

[metalearn] support continuous policy action space #23

Comments

cosmicBboy commented Aug 1, 2020

cosmicBboy commented Aug 7, 2020