Lora Qwen-MoE adapter_model.pt merge to base model weight

```yaml
model:
  _component_: torchtune.models.qwen3.lora_qwen3_moe_30b_a3b_instruct
  lora_attn_modules: ['q_proj', 'k_proj', 'v_proj', 'output_proj', 'w1', 'w2', 'w3', 'output']
  apply_lora_to_mlp: True
  apply_lora_to_output: True
```
The LoRA training layer extends from the attention layer `['q_proj', 'v_proj', 'output_proj']`  to the entire linear layer `['q_proj', 'k_proj', 'v_proj', 'output_proj', 'w1', 'w2', 'w3', 'output']`, how to merge `adapter_model.pt` to base model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Lora Qwen-MoE adapter_model.pt merge to base model weight #2920

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Lora Qwen-MoE adapter_model.pt merge to base model weight #2920

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions