feature(wrh): add harmony dream in unizero #255

ruiheng123 · 2024-07-31T09:40:31Z

Add harmony dream for balancing loss weights among observation, reaward, policy, value.
Paper reference: https://arxiv.org/abs/2310.00344
Code reference: https://github.com/thuml/HarmonyDream/blob/main/wmlib-torch/wmlib/agents/dreamerv2.py#L117

puyuan1996 · 2024-08-05T13:22:25Z

lzero/model/unizero_world_models/utils.py

@@ -164,19 +176,60 @@ def __init__(self, latent_recon_loss_weight=0, perceptual_loss_weight=0, **kwarg
        self.loss_total = torch.tensor(0., device=device)
        for k, v in kwargs.items():


# Define a dictionary for loss weights and harmony_s variables loss_weights = { 'loss_obs': (self.obs_loss_weight, 'loss_obs_harmony_s'), 'loss_rewards': (self.reward_loss_weight, 'loss_rewards_harmony_s'), 'loss_policy': (self.policy_loss_weight, 'loss_policy_harmony_s'), 'loss_value': (self.value_loss_weight, 'loss_value_harmony_s'), 'loss_ends': (self.ends_loss_weight, 'loss_ends_harmony_s'), 'latent_recon_loss': (self.latent_recon_loss_weight, 'latent_recon_loss_harmony_s'), 'perceptual_loss': (self.perceptual_loss_weight, 'perceptual_loss_harmony_s') } # Iterate through kwargs to process the losses for k, v in kwargs.items(): if k in loss_weights: weight, harmony_var_name = loss_weights[k] harmony_s = globals().get(harmony_var_name) # Get the harmony_s variable by name if harmony_s_dict is None: self.loss_total += weight * v elif harmony_s is not None: self.loss_total += (v / torch.exp(harmony_s)) + torch.log(torch.exp(harmony_s) + 1) else: self.loss_total += weight * v

puyuan1996 · 2024-08-05T13:22:46Z

lzero/policy/unizero.py

        )
-
+
+        # else:


ruiheng123 added 3 commits July 31, 2024 09:00

feature(wrh): add harmony dream in unizero

78967ed

feature(wrh): add harmony dream in unizero

98cd24a

feature(wrh): add harmony dream in unizero

f46efd1

puyuan1996 added the enhancement New feature or request label Aug 5, 2024

ruiheng123 added 2 commits August 5, 2024 13:08

feature(wrh): add harmony dream in unizero

ad1d215

feature(wrh): add harmony dream in unizero

6e27ddd

puyuan1996 reviewed Aug 5, 2024

View reviewed changes

lzero/policy/unizero.py Outdated

)

# else:

Copy link

Collaborator

puyuan1996 Aug 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

删除

feature(wrh): add harmony dream in unizero

2f751ab

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(wrh): add harmony dream in unizero #255

feature(wrh): add harmony dream in unizero #255

ruiheng123 commented Jul 31, 2024 •

edited

Loading

puyuan1996 Aug 5, 2024

puyuan1996 Aug 5, 2024

		@@ -164,19 +176,60 @@ def __init__(self, latent_recon_loss_weight=0, perceptual_loss_weight=0, **kwarg
		self.loss_total = torch.tensor(0., device=device)
		for k, v in kwargs.items():

feature(wrh): add harmony dream in unizero #255

Are you sure you want to change the base?

feature(wrh): add harmony dream in unizero #255

Conversation

ruiheng123 commented Jul 31, 2024 • edited Loading

puyuan1996 Aug 5, 2024

Choose a reason for hiding this comment

puyuan1996 Aug 5, 2024

Choose a reason for hiding this comment

ruiheng123 commented Jul 31, 2024 •

edited

Loading