Possible bug in fine-tuning baseline implementation #41

drd13 · 2024-02-02T10:16:07Z

I've found what I believe to be a bug in the implementation of the fine-tuning baseline which would yield incorrect results when the target is longer than one token.

Looking at the code, the fine-tuning baseline seems to get the logits on which to backpropagate by calling model(**inputs) where inputs are the prompt with the subject but excluding the target. It then maximises the probability of the target by taking the logits associated to the last token in the input, and maximising the probability of all the target tokens as simultaneous direct continuations. This is not the regular fine-tuning behaviour which would be to maximise the probability of the first token in the target being a continuation to the input and then maximising the probability of the second token in the target being a continuation to the first token in the target.

Thank you for your assistance and look forwards to hearing back and understanding whether I may have misunderstood an aspect in the implementation.

The text was updated successfully, but these errors were encountered:

drd13 · 2024-03-18T20:48:15Z

Just wanted to point out that there is some relevant discussion around this issue in zjunlp/EasyEdit#173

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bug in fine-tuning baseline implementation #41

Possible bug in fine-tuning baseline implementation #41

drd13 commented Feb 2, 2024

drd13 commented Mar 18, 2024

Possible bug in fine-tuning baseline implementation #41

Possible bug in fine-tuning baseline implementation #41

Comments

drd13 commented Feb 2, 2024

drd13 commented Mar 18, 2024