Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experiments on cheetah-dir and ant-dir #2

Open
Lagrant opened this issue Feb 2, 2023 · 2 comments
Open

Experiments on cheetah-dir and ant-dir #2

Lagrant opened this issue Feb 2, 2023 · 2 comments

Comments

@Lagrant
Copy link

Lagrant commented Feb 2, 2023

Hi,

I've tried normalizing environments, revising reward functions, upgrading/downgrading MuJoCo versions, but still not able to reproduce the performance declared in your paper on ant-dir. The average return just fluctuates at a very low level around 10. Besides, experiments on cheetah-dir get an average training return of 1300 but an average testing return of -1400 which never happens on other enviroments. There seems to be something wrong with the environment. Could you also check it out?

My experiment logs, models and configurations are uploaded to google file for your reference.

Any reply would be much appreciated!

@nanzhu2003
Copy link

Hello, I tried the hopper_param and it didn't get good results yet. At the same time, can I ask you which model did you load when run the test_ood_context.py?When I run this code, it always showed that the input size is incompatible with the parameter that I trained through the train_offpolicy_with_trained_encoder. Thank you a lot!

@Foo1szz
Copy link

Foo1szz commented Apr 7, 2024

Hi,

I've tried normalizing environments, revising reward functions, upgrading/downgrading MuJoCo versions, but still not able to reproduce the performance declared in your paper on ant-dir. The average return just fluctuates at a very low level around 10. Besides, experiments on cheetah-dir get an average training return of 1300 but an average testing return of -1400 which never happens on other enviroments. There seems to be something wrong with the environment. Could you also check it out?

My experiment logs, models and configurations are uploaded to google file for your reference.

Any reply would be much appreciated!

I got the same question. When I ran experiments on ant-dir. The average return fluctuates at a very low level around 10, too. Could the authors give some advices?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants