Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add CFormer adapter and input_kl loss #93

Draft
wants to merge 127 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
127 commits
Select commit Hold shift + click to select a range
9fe560b
Update
Sep 6, 2024
7e08492
Update
Sep 6, 2024
5c3c446
Update
Sep 6, 2024
094700e
Update
Sep 6, 2024
32053b9
Update
Sep 6, 2024
caaa0e7
Update
Sep 6, 2024
0979ec1
Update
Sep 6, 2024
42c4417
Update
Sep 6, 2024
37a1680
Update
Sep 6, 2024
697dbbd
Update
Sep 6, 2024
16bf95a
Update
Sep 6, 2024
025aa61
Update
Sep 6, 2024
efab072
Update
Sep 6, 2024
2d37350
Update
Sep 6, 2024
a0bfa65
Update
Sep 7, 2024
48eb79d
Update
Sep 11, 2024
49dd14a
Update
Sep 11, 2024
3eb5a1b
Update
Sep 11, 2024
b1428a1
Update
Sep 11, 2024
d996cff
Update
Sep 11, 2024
37d208d
Update
Sep 11, 2024
587a8b8
Update
Sep 11, 2024
8aa9819
Update
Sep 11, 2024
5d03d9c
Update
Sep 11, 2024
7b52dab
Update
Sep 12, 2024
4429811
Update
Sep 12, 2024
ec7857a
Update
Sep 12, 2024
6dcb51b
Rebase
Aug 28, 2024
c2824c2
Update
Aug 22, 2024
c4bb276
Update
Aug 22, 2024
4e09457
Update
Aug 22, 2024
0a795aa
Update
Aug 22, 2024
0ede13e
Update
Aug 28, 2024
cdc592b
Update
Aug 28, 2024
b1f0009
Update
Aug 28, 2024
1d412e4
output loss
Aug 28, 2024
3a7f1e1
Update
Aug 29, 2024
2496af8
Update
Aug 29, 2024
ac0ecae
Update
Aug 29, 2024
1fdd8f0
Update
Aug 29, 2024
31cb31f
Update
Aug 29, 2024
78bbda8
Update
Aug 29, 2024
8ed68f3
Update
Aug 30, 2024
c9f44f6
Update
Sep 1, 2024
e98389e
Update
Sep 1, 2024
2d3777d
Update
Sep 2, 2024
f60745f
Update
Sep 5, 2024
4b531d6
Update
Sep 5, 2024
3dd8d76
Update
Sep 5, 2024
662a1b8
Update
Sep 5, 2024
75d8c94
Update
Sep 5, 2024
b31a894
Update
Sep 5, 2024
16605b1
Update
Sep 5, 2024
df55511
Update
Sep 5, 2024
27364a9
Update
Sep 5, 2024
fcd9e64
Update
Sep 5, 2024
1d0c528
Update
Sep 5, 2024
a21161e
Update
Sep 5, 2024
412a60e
Update
Sep 5, 2024
a1e3be8
Update
Sep 5, 2024
c5afef0
Update
Sep 5, 2024
c2f6cb5
Update
Sep 6, 2024
5142756
Update
Sep 6, 2024
c408412
Update
Sep 6, 2024
d996024
Update
Sep 6, 2024
77756ef
Update
Sep 6, 2024
f7cc41a
Update
Sep 6, 2024
ddc85d8
Update
Sep 6, 2024
06bc0d5
Update
Sep 13, 2024
fa45043
Update
Sep 13, 2024
34b1606
Update
Sep 13, 2024
9e07d28
Update
Sep 13, 2024
51d398f
Update
Sep 13, 2024
d96e01f
Update
Sep 13, 2024
a42d02c
Update
Sep 13, 2024
5085046
Update
Sep 13, 2024
d357790
Update
Sep 13, 2024
407bfc5
Update
Sep 13, 2024
c5c5412
Update
Sep 13, 2024
2294b68
Update
Sep 13, 2024
28816f0
Update
Sep 13, 2024
6d6da81
Update
Sep 13, 2024
06e62cc
Update
Sep 13, 2024
51545d0
Update
Sep 13, 2024
04504bd
Update
Sep 13, 2024
e3b51d8
Update
Sep 14, 2024
989f8fd
Update
Sep 14, 2024
713780e
Update
Sep 14, 2024
a6c83e6
Update
Sep 14, 2024
b1bd1f9
Update
Sep 14, 2024
7e7a744
Update
Sep 14, 2024
ac1f520
Update
Sep 15, 2024
cf7cef2
Update
Sep 15, 2024
be1c619
Update
Sep 16, 2024
7cbbe5a
Update
Sep 16, 2024
43366ea
Update
Sep 16, 2024
e08a3e1
Update
Sep 16, 2024
cbedfca
Update
Sep 16, 2024
5163a64
Update
Sep 16, 2024
d7748d7
Update
Sep 16, 2024
994fdbf
Update
Sep 17, 2024
626f368
Update
Sep 17, 2024
275e2c3
Update
Sep 17, 2024
7c2c08c
Update
Sep 17, 2024
42a982c
Update
Sep 17, 2024
4e7d012
Update
Sep 17, 2024
967f22f
Update
Sep 17, 2024
fddbd80
Update
Sep 17, 2024
b8dde1f
Update
Sep 17, 2024
8179654
Update
Sep 17, 2024
7dd453a
Update
Sep 17, 2024
ad3b607
Update
Sep 17, 2024
f5950d5
UPdate
Sep 17, 2024
d6257f4
Update
Sep 18, 2024
59fb439
Update
Sep 18, 2024
064942c
Update
Sep 19, 2024
f489ff0
Update
Sep 19, 2024
350d42c
Update
Sep 19, 2024
ce87ef7
Update
Sep 19, 2024
c81b814
Update
Sep 19, 2024
32b4e8e
Update
Sep 19, 2024
6fd5285
Update
Sep 19, 2024
03ac67e
Update
Sep 19, 2024
7dcb52b
Update
Sep 19, 2024
c59c1f1
Update
Sep 19, 2024
2fe6bbe
Update
Sep 19, 2024
d6ee889
Update
Sep 19, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -190,4 +190,5 @@ cython_debug/
mds_output/
mlruns/
output/
.run_configs/

17 changes: 0 additions & 17 deletions mcloud.yaml

This file was deleted.

20 changes: 20 additions & 0 deletions mcloud_eval.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# Ultravox training configuration
name: ultravox
image: mosaicml/composer:latest
compute:
gpus: 8
cluster: r14z3p1
integrations:
- integration_type: git_repo
git_repo: fixie-ai/ultravox-expts
git_branch: $UV_BRANCH
pip_install: poetry==1.7.1
scheduling:
max_duration: 2 # 2 hours max for jobs to avoid hanging jobs
command: >-
cd ultravox-expts && poetry install --no-dev && poetry run torchrun --nproc_per_node=8 -m ultravox.tools.eval_tool $EVAL_ARGS --exp_name $UV_BRANCH
env_variables:
MLFLOW_TRACKING_URI: databricks
UV_BRANCH: zhuang.2024-08-12-ultravox.batch_infer_1a
EVAL_ARGS: --config_path ultravox/evaluation/configs/eval_config_2k.yaml

20 changes: 20 additions & 0 deletions mcloud_train.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
# Ultravox training configuration
name: <RUN_NAME>
image: mosaicml/composer:latest
compute:
gpus: 8
cluster: r14z3p1
integrations:
- integration_type: git_repo
git_repo: fixie-ai/ultravox-expts
git_branch: $UV_BRANCH
pip_install: poetry==1.7.1
scheduling:
max_duration: 6 # 6 hours max for jobs to avoid hanging jobs
command: >-
cd ultravox-expts && poetry install --no-dev && poetry run torchrun --nproc_per_node=8 -m ultravox.training.train $TRAIN_ARGS --exp_name $UV_BRANCH
env_variables:
MLFLOW_TRACKING_URI: databricks
UV_BRANCH: <BRANCH_NAME>
TRAIN_ARGS: --config_path ultravox/training/configs/expt_config.yaml
HF_HUB_DOWNLOAD_TIMEOUT: "300" # Set timeout to 300 seconds (5 minutes)
23 changes: 0 additions & 23 deletions ultravox/data/dataset_config.py

This file was deleted.

Loading