-
Notifications
You must be signed in to change notification settings - Fork 138
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add a new tp overlap cfg for tp2 llama2 70B (#435)
* Add a new tp overlap cfg for tp2 llama2 70B Signed-off-by: Guyue Huang <[email protected]> * Fix a name in comment Signed-off-by: Guyue Huang <[email protected]> --------- Signed-off-by: Guyue Huang <[email protected]> Co-authored-by: Guyue Huang <[email protected]>
- Loading branch information
Showing
1 changed file
with
55 additions
and
0 deletions.
There are no files selected for viewing
55 changes: 55 additions & 0 deletions
55
launcher_scripts/conf/training/tp_overlap/ub_cfg_h100_h8192_tp2_mbs1_seqlen4096.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,55 @@ | ||
# UB communicator configurations | ||
# Model configs: H100/70B/TP2/MBS1/SeqLen4K/FP8 | ||
|
||
# Bulk overlap with AllGather / ReduceScatter | ||
qkv_dgrad: | ||
method: bulk | ||
num_sm: 8 | ||
cga_size: 2 | ||
set_sm_margin: 0 | ||
|
||
qkv_wgrad: | ||
method: bulk | ||
num_sm: 32 | ||
cga_size: 2 | ||
set_sm_margin: 0 | ||
|
||
fc1_dgrad: | ||
method: bulk | ||
num_sm: 2 | ||
cga_size: 2 | ||
set_sm_margin: 0 | ||
|
||
fc1_wgrad: | ||
method: bulk | ||
num_sm: 8 | ||
cga_size: 2 | ||
set_sm_margin: 0 | ||
|
||
## Ring-exchange overlap with AllGather | ||
qkv_fprop: | ||
method: ring_exchange | ||
aggregate: 0 | ||
|
||
proj_dgrad: | ||
method: ring_exchange | ||
aggregate: 0 | ||
|
||
fc1_fprop: | ||
method: ring_exchange | ||
aggregate: 0 | ||
|
||
fc2_dgrad: | ||
method: ring_exchange | ||
aggregate: 0 | ||
|
||
proj_fprop: | ||
method: ring_exchange | ||
num_sm: 1 | ||
set_sm_margin: 1 | ||
|
||
fc2_fprop: | ||
method: ring_exchange | ||
num_sm: 1 | ||
set_sm_margin: 1 | ||
|