Towards SoftAdapt loss balancing for tf.compat.v1 #1586

pescap · 2023-12-06T12:39:21Z

Work in progress!

pescap · 2023-12-06T13:25:09Z

deepxde/callbacks.py

+        loss_weights = dde.Variable(loss_weights, trainable=False, dtype=loss_weights.dtype)
+        loss_weights *= 0
+


I am trying to allow loss_weights to be Variable, such that the loss function updates automatically every time that the weights change. Any clue @lululxvi ?

Here, I was trying to set the loss_weights to 0. Therefore, the loss shall give 0 for next epochs (which is not the case so far).

Shall we define loss_weights differently in model.compile?

Maybe we need to work here in:

deepxde/deepxde/model.py

Lines 169 to 183 in 3b08fe3

def losses(losses_fn):

# Data losses

losses = losses_fn(

self.net.targets, self.net.outputs, loss_fn, self.net.inputs, self

)

if not isinstance(losses, list):

losses = [losses]

# Regularization loss

if self.net.regularizer is not None:

losses.append(tf.losses.get_regularization_loss())

losses = tf.convert_to_tensor(losses)

# Weighted losses

if loss_weights is not None:

losses *= loss_weights

return losses

Thank you!

How do you plan to update loss_weights?

lululxvi · 2023-12-06T14:32:04Z

Implementing in TensorFlow is tricky, as it is static graph. It should be much easier to implement in pytorch, where you can directly change the loss_weights value.

pescap · 2023-12-06T15:07:13Z

Implementing in TensorFlow is tricky, as it is static graph. It should be much easier to implement in pytorch, where you can directly change the loss_weights value.

Thank you for your feeback. I would really prefer to implement this adaptive loss callback in tensorflow.compat.v1.

I think I'll start with a simple two-terms loss (and one weighing parameter).

haison19952013 · 2024-02-01T05:12:31Z

Implementing in TensorFlow is tricky, as it is static graph. It should be much easier to implement in pytorch, where you can directly change the loss_weights value.

It can be done if the loss_weights is the argument of the train_step()
If not, when we iteratively change the loss weights, we will need tensorflow to make the graph all over again. In other words, model will .compile() again and the training might be slow.

pescap · 2024-02-01T15:41:32Z

Hi, if we define loss_weights as Variable, no need to compile several times, right?

Next, we have to define appopriately the total_loss.

haison19952013 · 2024-02-02T00:23:56Z

Hi, if we define loss_weights as Variable, no need to compile several times, right?

Next, we have to define appopriately the [total_loss]

deepxde/deepxde/model.py

Line 244 in 8592029

total_loss = tf.math.reduce_sum(losses)

.

My [last response] (Towards SoftAdapt loss balancing for tf.compat.v1 #1586 (comment)) might raise some confusion so I have corrected it.
You are right, we can update it like that without recompiling. But, it's better to verify this again on a toy problem.
For more details about implementation in deepxde, we might need to set loss_weights as the arguments of outputs_losses() and then total_loss().

ThomasHuangZA · 2024-09-02T14:21:03Z

Hi @pescap @lululxvi @haison19952013

I have recently been working on the adaptive weights, and here's what worked for me (I think).
Take this code with a big grain of salt, but do let me know what you think.
I've adapted this from the code I see here, and have used this for my code (which has 3 governing equation losses, 8 boundary losses, and 3 data losses)

from deepxde.backend import tf, Variable

class SoftAdapt(dde.callbacks.Callback):
    """Use adaptive loss balancing.
    Args:
        beta: If beta > 0, then softAdapt will pay more attention the worst performing
            loss component. If beta < 0, then SoftAdapt will assign higher weights
            to the better performing components. Beta==0 is the trivial case and
            all loss components will have coefficient 1.
        epsilon: parameter to prevent overflows.
    """
    def __init__(self, beta=0.1, epsilon=1e-8,losshistory=None,lr=0.01):
        super().__init__()
        
        self.beta = beta
        self.epsilon = epsilon
        self.losshistory=losshistory
#         self.epoch = 0
        self.lr = lr
        
    def on_epoch_end(self):

        loss_weights = self.losshistory.loss_weights
        weight_bc = loss_weights[3]
        weight_data = loss_weights[-1]
        current_loss = self.losshistory.loss_train[-1]

        pde_avg = current_loss[0:3].mean()

        bc_avg = current_loss[3:11].mean()

        data_avg = current_loss[11::].mean()

        weight_bc = (1-self.lr) * weight_bc + self.lr * current_loss[0:3].max() / bc_avg
        weight_data = (1 - self.lr) * weight_data + self.lr * current_loss[0:3].max() / data_avg

        loss_weights = [1] * 3 + [weight_bc] * 8 + [weight_data] * 3
        self.losshistory.set_loss_weights(loss_weights)

        print(self.losshistory.loss_weights, "loss_weights")

lululxvi · 2024-09-02T15:14:48Z

What is the backend you have tested?

ThomasHuangZA · 2024-09-03T10:22:07Z

I've used tensorflow.v1 compact for this code

lululxvi · 2024-09-04T14:05:34Z

I don't think the code would change the weights during training. For TF v1, if you don't recompile, it always uses the original computational graph.

pescap added 2 commits December 6, 2023 09:36

initial commit towards softadapt

2a31829

initial commit towards softadapt

da7685b

pescap mentioned this pull request Dec 6, 2023

Implementing Adaptive Loss Weights via Callback - tf.compat.v1 #1085

Open

pescap marked this pull request as draft December 6, 2023 12:46

working on setting variable loss_weights

ca92dc7

pescap commented Dec 6, 2023

View reviewed changes

minor changes

b1ea5a2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Towards SoftAdapt loss balancing for tf.compat.v1 #1586

Towards SoftAdapt loss balancing for tf.compat.v1 #1586

pescap commented Dec 6, 2023

pescap Dec 6, 2023 •

edited

Loading

lululxvi Dec 6, 2023

lululxvi commented Dec 6, 2023

pescap commented Dec 6, 2023 •

edited

Loading

haison19952013 commented Feb 1, 2024 •

edited

Loading

pescap commented Feb 1, 2024

haison19952013 commented Feb 2, 2024 •

edited

Loading

ThomasHuangZA commented Sep 2, 2024

lululxvi commented Sep 2, 2024

ThomasHuangZA commented Sep 3, 2024

lululxvi commented Sep 4, 2024

		loss_weights = dde.Variable(loss_weights, trainable=False, dtype=loss_weights.dtype)
		loss_weights *= 0

	def losses(losses_fn):
	# Data losses
	losses = losses_fn(
	self.net.targets, self.net.outputs, loss_fn, self.net.inputs, self
	)
	if not isinstance(losses, list):
	losses = [losses]
	# Regularization loss
	if self.net.regularizer is not None:
	losses.append(tf.losses.get_regularization_loss())
	losses = tf.convert_to_tensor(losses)
	# Weighted losses
	if loss_weights is not None:
	losses *= loss_weights
	return losses

Towards SoftAdapt loss balancing for tf.compat.v1 #1586

Are you sure you want to change the base?

Towards SoftAdapt loss balancing for tf.compat.v1 #1586

Conversation

pescap commented Dec 6, 2023

pescap Dec 6, 2023 • edited Loading

Choose a reason for hiding this comment

lululxvi Dec 6, 2023

Choose a reason for hiding this comment

lululxvi commented Dec 6, 2023

pescap commented Dec 6, 2023 • edited Loading

haison19952013 commented Feb 1, 2024 • edited Loading

pescap commented Feb 1, 2024

haison19952013 commented Feb 2, 2024 • edited Loading

ThomasHuangZA commented Sep 2, 2024

lululxvi commented Sep 2, 2024

ThomasHuangZA commented Sep 3, 2024

lululxvi commented Sep 4, 2024

pescap Dec 6, 2023 •

edited

Loading

pescap commented Dec 6, 2023 •

edited

Loading

haison19952013 commented Feb 1, 2024 •

edited

Loading

haison19952013 commented Feb 2, 2024 •

edited

Loading