You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+4-2Lines changed: 4 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,12 +24,14 @@ You can use it like so: `--model gpt2-xl`
24
24
### Train - Run, run, run...
25
25
26
26
Train for X epochs using input.model and save to output.model Then train again for more epochs until coherent.
27
-
`out/input.model` and `out/output.model` can be the same model to resume and continue training.
27
+
--model `out/output.model` and `out/output.model` should be the same model to resume and continue training.
28
+
29
+
If you wish to save a newly trained model to a new `out/output2.model/` path you should copy the tokenized output from the first step into the new output directory
28
30
29
31
Note: change `--batch-size` for smaller/larger GPUs, default is 4.
30
32
31
33
```bash
32
-
python train.py --model [out/input.model] --epochs X [out/output.model]
34
+
python train.py --model [out/output.model] --epochs X [out/output.model]
0 commit comments