Skip to content

Commit

Permalink
Add test sweep for mlp parallelization
Browse files Browse the repository at this point in the history
  • Loading branch information
gkielian committed Apr 9, 2024
1 parent 8a3be39 commit 6b4c05c
Showing 1 changed file with 20 additions and 0 deletions.
20 changes: 20 additions & 0 deletions explorations/mlp_par_sweep.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
[
{
"max_iters": ["3500"],
"n_layer": ["6"],
"n_head": ["6"],
"n_embd": ["384"],
"block_size":["256"],
"use_post_ln": [true, false],
"use_parallel_mlp": [true, false],
"device": ["cuda"],
"dtype": ["bfloat16"],
"dataset": ["shakespeare_char"],
"use_rotary_embeddings": [false],
"use_abs_pos_embeddings": [true],
"compile": [true],
"softmax_variant_attn": ["softmax", "polymax", "saturatingconsmax"],
"tensorboard_run_name": ["mlp_parallelization"]
}
]

0 comments on commit 6b4c05c

Please sign in to comment.