Skip to content

Commit

Permalink
Update crash step in ft launcher test
Browse files Browse the repository at this point in the history
Signed-off-by: Shriya Palsamudram <[email protected]>
  • Loading branch information
ShriyaPalsamudram committed Dec 3, 2024
1 parent fa6a773 commit 6deffa9
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion .github/workflows/cicd-main.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3934,7 +3934,7 @@ jobs:
export FAULT_TOL_FINISHED_FLAG_FILE="/tmp/llm_tests/llama_pretrain_results/sample_job_finished_flag"; \
python tests/collections/llm/test_fault_nvrx.py \
--devices=2 \
--crash-step=4 \
--crash-step=16 \
--experiment-dir=/tmp/llm_tests/llama_pretrain_results \
--data-path=/home/TestData/nlp/megatron_llama/data/rp2_sample_sentencepiece_preproc_text_document \
--tokenizer-path=/home/TestData/nlp/megatron_llama/tokenizer.model \
Expand Down

0 comments on commit 6deffa9

Please sign in to comment.