Skip to content

Conversation

@XenonMolecule
Copy link

Description

While I was onboarding to Marin I noticed that some of the parameters to ray_run have changed. I updated the commands in the tutorial to reflect the changes I observed.

Before:

uv run lib/marin/src/marin/run/ray_run.py --no_wait --env_vars WANDB_API_KEY=${WANDB_API_KEY} -- python experiments/hello_world.py

After

uv run lib/marin/src/marin/run/ray_run.py  --cluster infra/marin-us-central1.yaml --no_wait -e WANDB_API_KEY ${WANDB_API_KEY} -- python experiments/tutorials/hello_world.py

For testing I also personally found it valuable to view the logs for a run so I added this line to the tutorial:

# Get Job Logs for a specific job
ray job logs --address "http://127.0.0.1:8265" <JOB_ID>

I would appreciate if a maintainer more familiar with the cluster/ray would double check that this is the recommended way of doing this!

Copilot AI review requested due to automatic review settings January 7, 2026 18:12
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the internal development guidelines to reflect changes in the ray_run.py command-line interface parameters. The documentation now uses the current parameter syntax and adds a helpful command for viewing job logs.

Key Changes

  • Updated ray_run.py command to use --cluster parameter and shortened -e flag for environment variables
  • Added ray job logs command to help developers view logs for specific jobs
  • Updated the path to the example script from experiments/hello_world.py to experiments/tutorials/hello_world.py

Copy link
Collaborator

@rjpower rjpower left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the cleanup!

uv run scripts/ray/cluster.py --config infra/marin-us-central1.yaml list-jobs
# Get Job Logs for a specific job
ray job logs --address "http://127.0.0.1:8265" <JOB_ID>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For this, let's make sure the user know they should connect to the dashboard first:

# connect to dashboard
# `uv run ...cluster.py dashboard`
ray job logs...

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated to this!

# Get Job Logs for a specific job
# Ensure that the dashboard for the correct cluster is running (run this in another terminal)
# > uv run scripts/ray/cluster.py --config infra/marin-us-central1.yaml dashboard
ray job logs --address "http://127.0.0.1:8265" <JOB_ID>

Thanks for the suggestion!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants