Examples for E2E Mistral training & deployment #673

jonetiz · 2024-07-30T22:30:56Z

What does this PR do?

Adds examples which are used for an upcoming AWS Machine Learning blog.

Coming soon to AWS Machine Learning blog

michaelbenayoun

Overall it is quite different than our examples which are the equivalent of the examples in Transformers.
Should we create a directory like "aws-examples" or something where we would put them?

cc @philschmid @pagezyhf

michaelbenayoun · 2024-07-31T08:46:31Z

examples/mistral-e2e/examples/run_clm.py

Is it the run_clm.py script we already provide?

Distinction is in dataset loading, it uses load_from_disk to load a saved DatasetDict, before downloading from HF hub.

Alright, I do not think this is really needed as a distinction. But that's fair.

yes we should arther update the general on to make it support loading datasets from disk.

Honestly I think we should keep examples easy. Saving the dataset on disk is not really needed. There is a caching mechanism, we should just push them to use the dataset id and let the Datasets library handle the rest.

@michaelbenayoun

In our example, gsm8k is not pre-formatted for run_clm.py, which uses a single text column for training data, so the dataset needs to be formatted.

The reason we're loading from disk is to show customers (in the blog post) how the dataset needs to be formatted by leading them through it step-by-step (which ends up being dataset.py). While we can modify the training script to shape the dataset as necessary, we thought it'd be easier to show it in a standalone file - for the purposes of our blog post, run_clm.py isn't discussed in detail (the idea being it's plug and play with any LLM or dataset) so simplicity isn't necessarily our concern there.

Technically speaking, we could modify run_clm.py to shape the data itself, but I'd also argue it's better to have a training script that can take in any pre-formatted dataset, rather than having to modify it every time for new datasets. Hope this helps you see my point of view.

michaelbenayoun · 2024-08-01T09:07:22Z

@pagezyhf @philschmid wdyt?

philschmid

Thank you for opening the PR. I missing some context. Whats is the goal of that? I cannot see instruction what to do with the yaml file or how the scripts are used. If want to make an official example we should make sure that's understandable and useable for every user.

examples/mistral-e2e/examples/chat.py

examples/mistral-e2e/examples/compile.py

examples/mistral-e2e/examples/dataset.py

philschmid · 2024-08-01T11:56:39Z

examples/mistral-e2e/examples/run_clm.py

yes we should arther update the general on to make it support loading datasets from disk.

examples/mistral-e2e/examples/test.py

jonetiz · 2024-08-01T14:16:43Z

Thank you for opening the PR. I missing some context. Whats is the goal of that? I cannot see instruction what to do with the yaml file or how the scripts are used. If want to make an official example we should make sure that's understandable and useable for every user.

So this is for an AWS Machine Learning blog, to give beginners to intermediates an idea of how to use AWS Trainium and AWS Inferentia2 for simple end-to-end training and inferencing.

The YAML file is an Amazon CloudFormation template - it doesn't necessarily need to be included, so I can take that out.

Overall, similar to the other examples in this directory we (will) have documentation elsewhere that refers to these examples. Michael suggested we might create an aws-examples directory, which would be fine as well.

michaelbenayoun · 2024-08-02T14:18:22Z

I would say let's me the tutorial as minimalist as possible and put it in an aws-examples folder. We can still update that afterwards.

aws-examples/mistral-e2e/chat.py

aws-examples/README.md

aws-examples/mistral-e2e/chat.py

aws-examples/mistral-e2e/compile.py

HuggingFaceDocBuilderDev · 2024-09-14T08:04:51Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

HuggingFaceDocBuilderDev · 2024-10-08T08:05:28Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread. Thank you!

github-actions · 2025-02-12T08:04:56Z

This PR is stale because it has been open 15 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions · 2025-02-17T08:05:31Z

This PR was closed because it has been stalled for 5 days with no activity.

Examples for E2E Mistral training & deployment

97c7f98

Coming soon to AWS Machine Learning blog

michaelbenayoun reviewed Jul 31, 2024

View reviewed changes

philschmid reviewed Aug 1, 2024

View reviewed changes

jonetiz added 2 commits August 2, 2024 11:16

Moved mistral-e2e to aws-examples and provided brief explanation

189a2ca

Added aws-examples to HF DLAMI home directory

2fe19dc

michaelbenayoun reviewed Aug 5, 2024

View reviewed changes

jonetiz added 2 commits August 5, 2024 18:30

Added license, and additional explanatory comments.

c0a9429

Added permissions for aws-examples directory in DLAMI scripts

b364ee8

jonetiz requested a review from michaelbenayoun August 20, 2024 15:35

github-actions bot added the Stale label Feb 12, 2025

github-actions bot closed this Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Examples for E2E Mistral training & deployment #673

Examples for E2E Mistral training & deployment #673

jonetiz commented Jul 30, 2024

michaelbenayoun left a comment

michaelbenayoun Jul 31, 2024

jonetiz Jul 31, 2024

michaelbenayoun Aug 1, 2024

philschmid Aug 1, 2024

michaelbenayoun Aug 5, 2024

jonetiz Aug 5, 2024

michaelbenayoun commented Aug 1, 2024

philschmid left a comment

philschmid Aug 1, 2024

jonetiz commented Aug 1, 2024 •

edited

Loading

michaelbenayoun commented Aug 2, 2024

HuggingFaceDocBuilderDev commented Sep 14, 2024

HuggingFaceDocBuilderDev commented Oct 8, 2024

github-actions bot commented Feb 12, 2025

github-actions bot commented Feb 17, 2025

Examples for E2E Mistral training & deployment #673

Examples for E2E Mistral training & deployment #673

Conversation

jonetiz commented Jul 30, 2024

What does this PR do?

michaelbenayoun left a comment

Choose a reason for hiding this comment

michaelbenayoun Jul 31, 2024

Choose a reason for hiding this comment

jonetiz Jul 31, 2024

Choose a reason for hiding this comment

michaelbenayoun Aug 1, 2024

Choose a reason for hiding this comment

philschmid Aug 1, 2024

Choose a reason for hiding this comment

michaelbenayoun Aug 5, 2024

Choose a reason for hiding this comment

jonetiz Aug 5, 2024

Choose a reason for hiding this comment

michaelbenayoun commented Aug 1, 2024

philschmid left a comment

Choose a reason for hiding this comment

philschmid Aug 1, 2024

Choose a reason for hiding this comment

jonetiz commented Aug 1, 2024 • edited Loading

michaelbenayoun commented Aug 2, 2024

HuggingFaceDocBuilderDev commented Sep 14, 2024

HuggingFaceDocBuilderDev commented Oct 8, 2024

github-actions bot commented Feb 12, 2025

github-actions bot commented Feb 17, 2025

jonetiz commented Aug 1, 2024 •

edited

Loading