[Bug]: UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致报错 #8651

JasonCZH4 · 2024-06-24T08:57:48Z

软件环境

- paddlepaddle: 2.5.2
- paddlepaddle-gpu: 2.5.2
- paddlenlp: 2.8.0

重复问题

I have searched the existing issues

错误描述

UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致input_embeddings_size和output_embeddings_size没法对齐

稳定复现步骤 & 代码

tokenizer = UNIMOTokenizer.from_pretrained('./unimo-text-1.0-large')
model.resize_token_embeddings(len(tokenizer)) 
print(model.get_input_embeddings().weight.shape, model.lm_head.weight.shape)

The text was updated successfully, but these errors were encountered:

JasonCZH4 · 2024-06-24T08:59:34Z

GPT2模型也有类似问题，但是他已经被修复了，参考link，我使用类似方法修改unimo/modeling.py后可以修复，后续会提个PR。

github-actions · 2024-08-24T00:17:33Z

This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动，被标记为stale。

github-actions · 2024-09-07T00:18:35Z

This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天，即将关闭。

JasonCZH4 added the bug Something isn't working label Jun 24, 2024

paddle-bot bot assigned lugimzzz Jun 24, 2024

JasonCZH4 mentioned this issue Jun 24, 2024

fix unimo bug #8653

Open

github-actions bot added the stale label Aug 24, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致报错 #8651

[Bug]: UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致报错 #8651

JasonCZH4 commented Jun 24, 2024

JasonCZH4 commented Jun 24, 2024

github-actions bot commented Aug 24, 2024

github-actions bot commented Sep 7, 2024

[Bug]: UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致报错 #8651

[Bug]: UNIMO模型的resize_token_embeddings方法不会修改decoder的vocab_size，导致报错 #8651

Comments

JasonCZH4 commented Jun 24, 2024

软件环境

重复问题

错误描述

稳定复现步骤 & 代码

JasonCZH4 commented Jun 24, 2024

github-actions bot commented Aug 24, 2024

github-actions bot commented Sep 7, 2024