You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I just noticed that the code is using TruncNormal as the actor distribution instead of TanhNormal as in v1. I wonder did you make some ablations on these two choices and see TruncNormal provide better results? Or the change is only because the entropy of TruncNormal is easier to compute than TanhNormal for the entropy regularizer?
The text was updated successfully, but these errors were encountered:
IcarusWizard
changed the title
Performance different between TruncNormal and TanhNormal
Performance difference between TruncNormal and TanhNormal
Jan 19, 2023
Hey @danijar.
I just noticed that the code is using
TruncNormal
as the actor distribution instead ofTanhNormal
as in v1. I wonder did you make some ablations on these two choices and seeTruncNormal
provide better results? Or the change is only because the entropy ofTruncNormal
is easier to compute thanTanhNormal
for the entropy regularizer?The text was updated successfully, but these errors were encountered: