Feature/support qwenvl glm4-v (tested) #4377

marko1616 · 2024-06-19T12:55:07Z

What does this PR do?

Fixes #4375

Before submitting

Did you read the contributor guideline?
Did you write any new necessary tests?

marko1616 · 2024-06-20T13:22:07Z

终于还差一个image的padding处理就能做好训练支持了。

marko1616 · 2024-06-20T13:27:09Z

@hiyouga 改的比较多捏，有空帮忙看看这个实现思路行不行。谢谢。

marko1616 · 2024-06-20T13:33:55Z

src/llamafactory/chat/hf_engine.py

+            Image.fromarray(image).convert("RGB").save(image_path)
+            messages[-1]["content"] = template.format_image.apply(content=os.fspath(image_path))[0] + messages[-1]["content"]
+        elif image is not None and model_args.visual_inputs_type == "vision_message_embed":
+            messages[-1]["content"] = template.format_image.apply()[0] + messages[-1]["content"]


如果不是内嵌在文本的image url默认放在最后一个的开头（Qwenvl如果不是开头效果不好）

marko1616 · 2024-06-20T13:34:29Z

src/llamafactory/data/loader.py

+        if model_args.visual_inputs_type == "vision_message_embed":
+            dataset = dataset.rename_column("image_inputs","images")
+        print(dataset["images"])
+


dataset.map不能重用删除的column_name

marko1616 · 2024-06-20T13:34:44Z

src/llamafactory/data/processors/processor_utils.py

+                transforms.Normalize((0.48145466, 0.4578275, 0.40821073), (0.26862954, 0.26130258, 0.27577711)),
+            ]
+        )
+        return transform(images[0]) if len(images) != 0 else transform(Image.new("RGB", (1120, 1120), (255, 255, 255)))


数据集加载

marko1616 · 2024-06-20T13:35:29Z

src/llamafactory/model/adapter.py

-        if model_args.visual_inputs and finetuning_args.freeze_vision_tower:
-            target_modules = "^(?!.*vision_tower).*(?:{}).*".format("|".join(target_modules))
+        if model_args.visual_inputs and finetuning_args.freeze_vision:
+            target_modules = f"^(?!.*{VISION_FREEZE_MAP[model_args.visual_inputs_type]})."+"*(?:{}).*".format("|".join(target_modules))


其实还有点小问题，可能会把GLM4的视觉模块附加了

marko1616 · 2024-06-21T08:27:13Z

成功跑了训练。

marko1616 and others added 4 commits June 19, 2024 14:11

Basic support for webui.

fbf19f8

Basic support for GLM4V

95b8a1d

Merge branch 'hiyouga:main' into feature/Support-Qwenvl

61a0880

Pass ruff check.

8044804

hiyouga added the pending This problem is yet to be addressed label Jun 19, 2024

Half of sft support and bug fix.

c58be83

marko1616 commented Jun 20, 2024

View reviewed changes

GLM4v lora sft support

4b01584

Little fix

c233520

marko1616 changed the title Feature/support qwenvl glm4-v *WORKING DO NOT MERGE* Feature/support qwenvl glm4-v (tested) Jun 23, 2024

hiyouga and others added 2 commits June 25, 2024 02:58

Merge branch 'main' into feature/Support-Qwenvl

078c85d

Fix requirements.txt

67542a0

BUAADreamer self-requested a review June 28, 2024 17:05

BUAADreamer and others added 2 commits June 29, 2024 01:45

fix conflict

e6aa967

QwenVL sft & webui train buxfix.

f698b43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/support qwenvl glm4-v (tested) #4377

Feature/support qwenvl glm4-v (tested) #4377

marko1616 commented Jun 19, 2024 •

edited

Loading

marko1616 commented Jun 20, 2024

marko1616 commented Jun 20, 2024

marko1616 Jun 20, 2024

marko1616 Jun 20, 2024

marko1616 Jun 20, 2024

marko1616 Jun 20, 2024

marko1616 commented Jun 21, 2024

Feature/support qwenvl glm4-v (tested) #4377

Are you sure you want to change the base?

Feature/support qwenvl glm4-v (tested) #4377

Conversation

marko1616 commented Jun 19, 2024 • edited Loading

What does this PR do?

Before submitting

marko1616 commented Jun 20, 2024

marko1616 commented Jun 20, 2024

marko1616 Jun 20, 2024

Choose a reason for hiding this comment

marko1616 Jun 20, 2024

Choose a reason for hiding this comment

marko1616 Jun 20, 2024

Choose a reason for hiding this comment

marko1616 Jun 20, 2024

Choose a reason for hiding this comment

marko1616 commented Jun 21, 2024

marko1616 commented Jun 19, 2024 •

edited

Loading