Experiments on Mania's dataset #28

dtch1997 · 2023-05-25T11:16:09Z

Summary of changes:

Make URDF configurable, add Mania's URDF
Add discriminator temperature config option
Change sigma to be learnable by default (I may end up reverting this)
Increase default kP, kD to match simulation

dtch1997 · 2023-05-26T13:20:00Z

According to VIPER, simply training a likelihood-based model as opposed to an adversarial model may improve performance. https://arxiv.org/abs/2305.14343
What that would look like for our repo: Instead of predicting binary logit for real / fake, predict a Gaussian distribution and maximize likelihood on dataset examples.

dtch1997 · 2023-05-28T07:52:53Z

The results of experiment are actually somewhat confusing. Refer to this run: the demo logit is high (about +5) while the agent logit is low (-15); we would expect that agent receives essentially no reward from this, and yet the average reward is quite good.

Edit: I think I understand why now, the rewards/frame is actually calculating the task reward not the AMP reward. So it doesn't sync.

In this light, the reason why policy isn't learning, may be that the reward provided by discriminator is 'too sparse'. So we should actually decrease the temperature

Add cfg option for disc temperature

c84c43b

dtch1997 added 17 commits May 28, 2023 18:37

[WIP] VIPER training loss

c9b39f6

Add script to run sweeps

de847bb

Add Mania actions

d2ee798

Add mania urdf

cc2eee4

Make filepath configurable

7d0a1e3

Truncate mania files

c06ec90

Make policy stdev learnable by default

01c86c3

Modify experiment commanD

da3c532

Log all checkpoints

4992caa

Modify experimental setup

6298ab9

Increase kP, kD

9c8ddf7

Fix typo

14aafae

Change exp config; add dummy task

019c5f9

Minor bugfix

f8e1b59

Add position control data

2862a8c

Make dt configurable

12879b7

Add processed mania_position dset

ee9199e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Experiments on Mania's dataset #28

Experiments on Mania's dataset #28

dtch1997 commented May 25, 2023 •

edited

Loading

dtch1997 commented May 26, 2023 •

edited

Loading

dtch1997 commented May 28, 2023 •

edited

Loading

Experiments on Mania's dataset #28

Are you sure you want to change the base?

Experiments on Mania's dataset #28

Conversation

dtch1997 commented May 25, 2023 • edited Loading

dtch1997 commented May 26, 2023 • edited Loading

dtch1997 commented May 28, 2023 • edited Loading

dtch1997 commented May 25, 2023 •

edited

Loading

dtch1997 commented May 26, 2023 •

edited

Loading

dtch1997 commented May 28, 2023 •

edited

Loading