Adjust reward model's score module and pooler module order for reducing computation #1956

aqweteddy · 2024-11-08T07:05:41Z

Motivation

@merrymercy mentioned in Gemma2 reward model support #1954
remove redundant code in gemma2 reward model.

Modifications

Adjust the order of the pooler and score modules in class LlamaForSequenceClassification and class Gemma2ForSequenceClassification to reduce computation.
Remove redundant model-loading code in gemma2 reward model.

Checklist

[V] Format your code according to the Contributor Guide.
[V] Add unit tests as outlined in the Contributor Guide.
[V] Update documentation as needed, including docstrings or example tutorials.

merrymercy · 2024-11-08T07:11:29Z

python/sglang/srt/models/gemma2_reward.py


-    def load_weights(self, weights: Iterable[Tuple[str, torch.Tensor]]):


can you also simplify the weight loader of LlamaForSequenceClassification?

Done & verified.

merrymercy · 2024-11-08T07:14:05Z

Can you fix the lint error?

python/sglang/srt/models/gemma2_reward.py

.pre-commit-config.yaml

merrymercy · 2024-11-08T08:11:18Z

@aqweteddy Thanks for the contribution. It is merged.

aqweteddy added 3 commits November 8, 2024 04:48

add gemma2 reward model support

682bbe6

add support models doc

0e53062

adjust score & pooler order

f9c5a26

aqweteddy requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners November 8, 2024 07:05

Merge branch 'main' into gemma2-rm

ac8d5c4

aqweteddy mentioned this pull request Nov 8, 2024

Gemma2 reward model support #1954

Merged

merrymercy reviewed Nov 8, 2024

View reviewed changes

merrymercy assigned zhaochenyang20 Nov 8, 2024

merrymercy reviewed Nov 8, 2024

View reviewed changes

python/sglang/srt/models/gemma2_reward.py Outdated Show resolved Hide resolved

merrymercy and others added 4 commits November 7, 2024 23:30

Apply suggestions from code review

81379a7

adjust score & pooler order

d10d606

simplify llama reward load_weghts

95fdea5

Merge branch 'main' into gemma2-rm

0ab9f70

merrymercy reviewed Nov 8, 2024

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

Apply suggestions from code review

5a99452

merrymercy approved these changes Nov 8, 2024

View reviewed changes

merrymercy merged commit 4ade15d into sgl-project:main Nov 8, 2024
11 of 12 checks passed

aqweteddy deleted the gemma2-rm branch November 8, 2024 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust reward model's score module and pooler module order for reducing computation #1956

Adjust reward model's score module and pooler module order for reducing computation #1956

aqweteddy commented Nov 8, 2024

merrymercy Nov 8, 2024

aqweteddy Nov 8, 2024

merrymercy commented Nov 8, 2024

merrymercy commented Nov 8, 2024


		def load_weights(self, weights: Iterable[Tuple[str, torch.Tensor]]):

Adjust reward model's score module and pooler module order for reducing computation #1956

Adjust reward model's score module and pooler module order for reducing computation #1956

Conversation

aqweteddy commented Nov 8, 2024

Motivation

Modifications

Checklist

merrymercy Nov 8, 2024

Choose a reason for hiding this comment

aqweteddy Nov 8, 2024

Choose a reason for hiding this comment

merrymercy commented Nov 8, 2024

merrymercy commented Nov 8, 2024