Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[metalearn] support continuous policy action space #23

Open
cosmicBboy opened this issue Aug 1, 2020 · 1 comment
Open

[metalearn] support continuous policy action space #23

cosmicBboy opened this issue Aug 1, 2020 · 1 comment

Comments

@cosmicBboy
Copy link
Owner

Support continuous action space for selecting real hyperparameters within the bounds specified by algorithm space config:

@cosmicBboy
Copy link
Owner Author

To implement this, need to generalize the hyperparameter head to support both multiclass classification over a discrete and finite hyperparameter space, but also a continuous hyperparameter space.

For the continuous case, the heads should be mu and sigma that produce an inference of the parameters that govern the shape of a gaussian. On policy generation, these two values should be used to parameterize a normal distribution that we sample from to produce a scalar of the continuous action (e.g. the l2 regularization parameter value).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant