Skip to content

Actions: axolotl-ai-cloud/axolotl

Publish Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
203 workflow runs
203 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

feat: validate sample packing requires flash_attention (#1465)
Publish Docs #28: Commit bf4cd67 pushed by NanoCode012
April 5, 2024 03:47 57s main
April 5, 2024 03:47 57s
add support for cohere chat template (#1478)
Publish Docs #27: Commit 05b0b7e pushed by winglian
April 5, 2024 01:20 1m 3s main
April 5, 2024 01:20 1m 3s
don't use deepspeed or fsdp when merging loras (#1479)
Publish Docs #26: Commit 87ca3f9 pushed by winglian
April 5, 2024 01:20 51s main
April 5, 2024 01:20 51s
refactor utils.data module for line count linter (#1476)
Publish Docs #25: Commit e0fcef4 pushed by winglian
April 4, 2024 23:33 53s main
April 4, 2024 23:33 53s
fix toc
Publish Docs #24: Commit 5760099 pushed by hamelsmu
April 3, 2024 19:05 51s main
April 3, 2024 19:05 51s
Pretrain multipack v2 (#1470)
Publish Docs #23: Commit 5aa5097 pushed by winglian
April 2, 2024 12:42 59s main
April 2, 2024 12:42 59s
April 2, 2024 08:36 52s
fix pretraining_ on odd datasets (#1463)
Publish Docs #21: Commit 586bd8d pushed by winglian
April 2, 2024 03:49 56s main
April 2, 2024 03:49 56s
Reorganize Docs (#1468)
Publish Docs #20: Commit 86b7d22 pushed by hamelsmu
April 1, 2024 15:00 1m 17s main
April 1, 2024 15:00 1m 17s
reduce verbosity of the special tokens (#1472)
Publish Docs #19: Commit 0b10377 pushed by NanoCode012
April 1, 2024 12:47 1m 7s main
April 1, 2024 12:47 1m 7s
feat: add deepspeed 3 with cpuoffload (#1466)
Publish Docs #18: Commit 946b497 pushed by NanoCode012
April 1, 2024 12:42 51s main
April 1, 2024 12:42 51s
LISA (#1469)
Publish Docs #17: Commit 0ddfb24 pushed by winglian
April 1, 2024 11:54 59s main
April 1, 2024 11:54 59s
make sure to install causal_conv1d in docker (#1459)
Publish Docs #16: Commit 89134f2 pushed by winglian
March 29, 2024 20:43 51s main
March 29, 2024 20:43 51s
qwen2_moe support w multipack (#1455)
Publish Docs #15: Commit 6086be8 pushed by winglian
March 29, 2024 15:04 50s main
March 29, 2024 15:04 50s
fix some of the edge cases for Jamba (#1452)
Publish Docs #14: Commit 05b398a pushed by winglian
March 29, 2024 06:38 1m 12s main
March 29, 2024 06:38 1m 12s
Support loading datasets saved via save_to_disk (#1432)
Publish Docs #13: Commit e634118 pushed by winglian
March 29, 2024 04:19 57s main
March 29, 2024 04:19 57s
Jamba (#1451)
Publish Docs #12: Commit 02af082 pushed by winglian
March 29, 2024 01:03 1m 12s main
March 29, 2024 01:03 1m 12s
fix layer_replication arg to peft (#1446)
Publish Docs #11: Commit 4155e99 pushed by winglian
March 27, 2024 14:19 1m 19s main
March 27, 2024 14:19 1m 19s
support layer replication for peft and fix rslora integration (#1445)
Publish Docs #10: Commit 25afd35 pushed by winglian
March 27, 2024 14:16 1m 0s main
March 27, 2024 14:16 1m 0s
fix for accelerate env var for auto bf16, add new base image and expa…
Publish Docs #9: Commit da265dd pushed by winglian
March 26, 2024 20:46 1m 31s main
March 26, 2024 20:46 1m 31s
Remove seq_len arg in rotary_emb (#1443)
Publish Docs #8: Commit e07347b pushed by winglian
March 26, 2024 19:19 53s main
March 26, 2024 19:19 53s
make sure to capture non-null defaults from config validation (#1415)
Publish Docs #7: Commit 601b77b pushed by winglian
March 26, 2024 19:18 1m 0s main
March 26, 2024 19:18 1m 0s
March 25, 2024 06:34 55s
docs: update link to docs of advance topic in README.md (#1437)
Publish Docs #5: Commit 324d59e pushed by winglian
March 25, 2024 04:49 49s main
March 25, 2024 04:49 49s
chore(config): refactor old mistral config (#1435)
Publish Docs #4: Commit f1ebaa0 pushed by NanoCode012
March 25, 2024 03:00 52s main
March 25, 2024 03:00 52s