Skip to content

Commit

Permalink
Add a new tp overlap cfg for tp2 llama2 70B (#435)
Browse files Browse the repository at this point in the history
* Add a new tp overlap cfg for tp2 llama2 70B

Signed-off-by: Guyue Huang <[email protected]>

* Fix a name in comment

Signed-off-by: Guyue Huang <[email protected]>

---------

Signed-off-by: Guyue Huang <[email protected]>
Co-authored-by: Guyue Huang <[email protected]>
  • Loading branch information
guyueh1 and Guyue Huang authored Oct 3, 2024
1 parent 35e4393 commit 65c1b74
Showing 1 changed file with 55 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,55 @@
# UB communicator configurations
# Model configs: H100/70B/TP2/MBS1/SeqLen4K/FP8

# Bulk overlap with AllGather / ReduceScatter
qkv_dgrad:
method: bulk
num_sm: 8
cga_size: 2
set_sm_margin: 0

qkv_wgrad:
method: bulk
num_sm: 32
cga_size: 2
set_sm_margin: 0

fc1_dgrad:
method: bulk
num_sm: 2
cga_size: 2
set_sm_margin: 0

fc1_wgrad:
method: bulk
num_sm: 8
cga_size: 2
set_sm_margin: 0

## Ring-exchange overlap with AllGather
qkv_fprop:
method: ring_exchange
aggregate: 0

proj_dgrad:
method: ring_exchange
aggregate: 0

fc1_fprop:
method: ring_exchange
aggregate: 0

fc2_dgrad:
method: ring_exchange
aggregate: 0

proj_fprop:
method: ring_exchange
num_sm: 1
set_sm_margin: 1

fc2_fprop:
method: ring_exchange
num_sm: 1
set_sm_margin: 1

0 comments on commit 65c1b74

Please sign in to comment.